ACM Home Page
Please provide us with feedback. Feedback
Scoring missing terms in information retrieval tasks
Full text PdfPdf (286 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the thirteenth ACM international conference on Information and knowledge management table of contents
Washington, D.C., USA
SESSION: IR-1 (information retrieval): information retrieval models table of contents
Pages: 50 - 58  
Year of Publication: 2004
ISBN:1-58113-874-1
Authors
Egidio Terra  University of Waterloo, Waterloo, Canada
Charles L.A. Clarke  University of Waterloo, Waterloo, Canada
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 75,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1031171.1031182
What is a DOI?

ABSTRACT

An usual approach to address mismatching vocabulary problem is to augment the original query using dictionaries and other lexical resources and/or by looking at pseudo-relevant documents. Either way, terms are added to form a new query that will be used to score all documents in a subsequent retrieval pass, and as consequence the original query's focus may drift because of the newly added terms. We propose a new method to address the mismatching vocabulary problem, expanding original query terms only when necessary and complementing the user query for missing terms while scoring documents. It allows related semantic aspects to be included in a conservative and selective way, thus reducing the possibility of query drift. Our results using replacements for the <i>missing query terms</i> in modified document and passages retrieval methods show significant improvement over the original ones.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
C. Buckley, G. Salton, J. Allan, and A. Singhal. Automatic query expansion using SMART: TREC 3. In In proceedings of Third Text REtrieval Conference Gaithersburg, MD, 1994.
3
4
5
 
6
C. L. A. Clarke, G. V. Cormack, T. R. Lynam, and E. Terra. Advances in Open Domain Question Answering chapter Question answering by passage selection. Kluwer Academic Publishers. To appear, 2004.
7
 
8
9
 
10
J. Firth. Studies In Linguistic Analisys chapter A Synopsis of Linguistic Theory, 1930-1955, pages 1--32. Basil Blackwell, Oxford, 3rd edition, 1957.
11
 
12
13
 
14
J. Lin, A. Fernandes, B. Katz, G. Marton, and S. Tellex. Extracting answers from the web using data annotation an knowledge mining techniques. In The Eleventh Text REtrieval Conference (TREC 2002), Gaithersburg, MD, 2002.
15
16
 
17
J. Rocchio. The SMART Retrieval System: Experiments in Automatic Document Processing chapter Relevance feedback in information retrieval, pages 313--323. Prentice-Hall Inc., 1971.
18
 
19
P. Schäuble and P. Sheridan. Cross-language information retrieval (clir) track overview. In The Sixth Text REtrieval Conference (TREC 6), Gaithersburg, MD, 1997.
20
 
21
E. Terra and C. L. Clarke. Fast computation of lexical affinity models. In Proceedings of the 20th International Conference on Computational Linguistics (COLING), 2004.
 
22
 
23
24
25
 
26
D. L. Yeung, C. L. A. Clarke, G. V. Cormack, T. R. Lynam, and E. Terra. Task-specific query expansion (multitext experiments for trec 2003). In 2002 Text REtrieval Conference Gaithersburg, MD, 2003.
27


Collaborative Colleagues:
Egidio Terra: colleagues
Charles L.A. Clarke: colleagues