ABSTRACT
This paper proposes and method to improve retrieval performance of the vector space model (VSM) by utilizing user-supplied information of those documents that are relevant to the query in question. In addition to the user's relevance feedback information, incorporated into the retrieval model, which is built by using a sequence of linear transformations, is information such as inter-document similarity values. Then, the high-dimensional and sparse vectors are reduced by SVD (Singular Value Decomposition) and transformed into the low-dimensional vector space, namely the space representing the latent semantic meanings of the words. The method was experimented on through two test collections, Medline collection and Cranfield collection. Improvement of average precision compared with LSI (Latent Semantic Indexing) model were 4.03% (Medline) and 24.87% (Cranfield) for the two training data sets, and 0.01% (Medline) and 4.89% (Cranfield) for the test data, respectively. The proposed method provides an approach that makes it possible to preserve the user-supplied relevance information for a long term in the system and to use the information later.
- 1.Berry, M. W., Dumais, S. T., O'Brien, G. W. Using linear algebra for intelligent information retrieval. SIAM Review, 37(4),1994, pp. 573-595. Google ScholarDigital Library
- 2.Bartell, B. T., Cottrell, G. W. and Belew, R. K. Optimizing parameters in a ranked retrieval system using multi-query relevance feedback. Proceedings of the Symposium on Document Analysis and Information Retrieval, Las Vegas, 1994.Google Scholar
- 3.Deerwester, S., Dnmals, S., Furnas, G. W., Landauer, T. K. and Harshman, R. Indexing by latent semantic analysis.Journal of the American Society for Information Science, Vol. 41, No. 6, 1990, pp. 391-407.Google ScholarCross Ref
- 4.Erica, C. and Tamara, G. K. New term weighting formulas for the vector space method in information retrieval. Technical Memorandum O RNL-13756, Oak Ridge National Laboratory, Oak Ridge, Tennessee,1998.Google Scholar
- 5.Frakes, W. B. and Baeza-Yates, R. Information retrieval: Data structures and algorithms.: Prentice Hall, 1992. Google ScholarDigital Library
- 6.Vogt, C. C., Cottrell, G. W., Belew, R. K. and Bartell, B. T.User lenses-achieving 100% precision on frequently asked questions. Proceedings of User Modeling'99, Banff,1999. Google ScholarDigital Library
Improvement of vector space information retrieval model based on supervised learning
Recommendations
An information retrieval model based on vector space method by supervised learning
This paper proposes a method to improve retrieval performance of the vector space model (VSM) in part by utilizing user-supplied information of those documents that are relevant to the query in question. In addition to the user's relevance feedback ...
Vector space model adaptation and pseudo relevance feedback for content-based image retrieval
Image retrieval is an important problem for researchers in computer vision and content-based image retrieval (CBIR) fields. Over the last decades, many image retrieval systems were based on image representation as a set of extracted low-level features ...
Re-examining the effects of adding relevance information in a relevance feedback environment
This paper presents an investigation about how to automatically formulate effective queries using full or partial relevance information (i.e., the terms that are in relevant documents) in the context of relevance feedback (RF). The effects of adding ...
Comments