skip to main content
10.1145/355214.355224acmconferencesArticle/Chapter ViewAbstractPublication PagesiralConference Proceedingsconference-collections
Article
Free Access

Improvement of vector space information retrieval model based on supervised learning

Authors Info & Claims
Published:01 November 2000Publication History

ABSTRACT

This paper proposes and method to improve retrieval performance of the vector space model (VSM) by utilizing user-supplied information of those documents that are relevant to the query in question. In addition to the user's relevance feedback information, incorporated into the retrieval model, which is built by using a sequence of linear transformations, is information such as inter-document similarity values. Then, the high-dimensional and sparse vectors are reduced by SVD (Singular Value Decomposition) and transformed into the low-dimensional vector space, namely the space representing the latent semantic meanings of the words. The method was experimented on through two test collections, Medline collection and Cranfield collection. Improvement of average precision compared with LSI (Latent Semantic Indexing) model were 4.03% (Medline) and 24.87% (Cranfield) for the two training data sets, and 0.01% (Medline) and 4.89% (Cranfield) for the test data, respectively. The proposed method provides an approach that makes it possible to preserve the user-supplied relevance information for a long term in the system and to use the information later.

References

  1. 1.Berry, M. W., Dumais, S. T., O'Brien, G. W. Using linear algebra for intelligent information retrieval. SIAM Review, 37(4),1994, pp. 573-595. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.Bartell, B. T., Cottrell, G. W. and Belew, R. K. Optimizing parameters in a ranked retrieval system using multi-query relevance feedback. Proceedings of the Symposium on Document Analysis and Information Retrieval, Las Vegas, 1994.Google ScholarGoogle Scholar
  3. 3.Deerwester, S., Dnmals, S., Furnas, G. W., Landauer, T. K. and Harshman, R. Indexing by latent semantic analysis.Journal of the American Society for Information Science, Vol. 41, No. 6, 1990, pp. 391-407.Google ScholarGoogle ScholarCross RefCross Ref
  4. 4.Erica, C. and Tamara, G. K. New term weighting formulas for the vector space method in information retrieval. Technical Memorandum O RNL-13756, Oak Ridge National Laboratory, Oak Ridge, Tennessee,1998.Google ScholarGoogle Scholar
  5. 5.Frakes, W. B. and Baeza-Yates, R. Information retrieval: Data structures and algorithms.: Prentice Hall, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.Vogt, C. C., Cottrell, G. W., Belew, R. K. and Bartell, B. T.User lenses-achieving 100% precision on frequently asked questions. Proceedings of User Modeling'99, Banff,1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Improvement of vector space information retrieval model based on supervised learning

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      IRAL '00: Proceedings of the fifth international workshop on on Information retrieval with Asian languages
      November 2000
      220 pages
      ISBN:1581133006
      DOI:10.1145/355214
      • Chairmen:
      • Kam-Fai Wong,
      • Dik L. Lee,
      • Jong-Hyeok Lee

      Copyright © 2000 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 November 2000

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader