Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Clustered SVD strategies in latent semantic indexing

  • Jing Gao
  • , Jun Zhang

Producción científica: Articlerevisión exhaustiva

54 Citas (Scopus)

Resumen

The text retrieval method using latent semantic indexing (LSI) technique with truncated singular value decomposition (SVD) has been intensively studied in recent years. The SVD reduces the noise contained in the original representation of the term-document matrix and improves the information retrieval accuracy. Recent studies indicate that SVD is mostly useful for small homogeneous data collections. For large inhomogeneous datasets, the performance of the SVD based text retrieval technique may deteriorate. We propose to partition a large inhomogeneous dataset into several smaller ones with clustered structure, on which we apply the truncated SVD. Our experimental results show that the clustered SVD strategies may enhance the retrieval accuracy and reduce the computing and storage costs.

Idioma originalEnglish
Páginas (desde-hasta)1051-1063
Número de páginas13
PublicaciónInformation Processing and Management
Volumen41
N.º5
DOI
EstadoPublished - sept 2005

Nota bibliográfica

Funding Information:
The research work of the authors was supported in part by the US National Science Foundation under grants CCR-9988165, CCR-0092532, ACR-0202934, and ACR-0234270, and by the US Department of Energy Office of Science under grant DE-FG02-02ER45961.

Financiación

The research work of the authors was supported in part by the US National Science Foundation under grants CCR-9988165, CCR-0092532, ACR-0202934, and ACR-0234270, and by the US Department of Energy Office of Science under grant DE-FG02-02ER45961.

FinanciadoresNúmero del financiador
Michigan State University-U.S. Department of Energy (MSU-DOE) Plant Research LaboratoryDE-FG02-02ER45961
National Science Foundation (NSF)CCR-0092532, ACR-0234270, ACR-0202934, CCR-9988165

    ASJC Scopus subject areas

    • Information Systems
    • Media Technology
    • Computer Science Applications
    • Management Science and Operations Research
    • Library and Information Sciences

    Huella

    Profundice en los temas de investigación de 'Clustered SVD strategies in latent semantic indexing'. En conjunto forman una huella única.

    Citar esto