ABSTRACT
We deal, in this paper, with the short queries (containing one or two words) problem. Short queries have no sufficient information to express their semantics in a non ambiguous way. Pseudo-relevance feedback (PRF) approach for query expansion is useful in many Information Retrieval (IR) tasks. However, this approach does not work well in the case of very short queries. Therefore, we present instead of PRF a semantic query enrichment method based on Wikipedia. This method expands short queries by semantically related terms extracted from Wikipedia. Our experiments on cultural heritage corpora show significant improvement in the retrieval performance.
- E. Agirre, P. D. Clough, S. Fernando, M. Hall, A. Otegi, and M. Stevenson. The sheffield and basque country universities entry to chic: Using random walks and similarity to access cultural heritage. In CLEF (Online Working Notes/Labs/Workshop), 2012.Google Scholar
- M. Akasereh, N. Naji, and J. Savoy. Unine at clef 2012. In CLEF (Online Working Notes/Labs/Workshop), 2012.Google Scholar
- V. Petras, N. Ferro, M. Gade, A. Isaac, M. Kleineberg, I. Masiero, M. Nicchio, and J. Stiller. Cultural heritage in clef (chic) overview 2012, 2012.Google Scholar
- S. E. Robertson and S. Walker. Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. SIGIR '94, pages 232--241, Dublin, Ireland, 1994. Google ScholarDigital Library
- Wikipedia. Compound term processing - wikipedia, the free encyclopedia, 2012. {Online; accessed 19-July-2013}.Google Scholar
- Y. Xu, G. J. Jones, and B. Wang. Query dependent pseudo-relevance feedback based on wikipedia. SIGIR '09, pages 59--66, Boston, MA, USA, 2009. Google ScholarDigital Library
- C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to information retrieval. ACM Trans. Inf. Syst., 22(2):179--214, Apr. 2004. Google ScholarDigital Library
Index Terms
- Wikipedia-based semantic query enrichment
Recommendations
Query dependent pseudo-relevance feedback based on wikipedia
SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrievalPseudo-relevance feedback (PRF) via query-expansion has been proven to be e®ective in many information retrieval (IR) tasks. In most existing work, the top-ranked documents from an initial search are assumed to be relevant and used for PRF. One problem ...
Lexical Co-Occurrence and Contextual Window-Based Approach with Semantic Similarity for Query Expansion
Query expansion QE is an efficient method for enhancing the efficiency of information retrieval system. In this work, we try to capture the limitations of pseudo-feedback based QE approach and propose a hybrid approach for enhancing the efficiency of ...
Co-occurrence and Semantic Similarity Based Hybrid Approach for Improving Automatic Query Expansion in Information Retrieval
ICDCIT 2015: Proceedings of the 11th International Conference on Distributed Computing and Internet Technology - Volume 8956Pseudo Relevance feedback PRF based query expansion approaches assumes that the top ranked retrieved documents are relevant. But this assumption is not always true; it may also possible that a PRF document may contain different topics, which may or may ...
Comments