|
ABSTRACT
An usual approach to address mismatching vocabulary problem is to augment the original query using dictionaries and other lexical resources and/or by looking at pseudo-relevant documents. Either way, terms are added to form a new query that will be used to score all documents in a subsequent retrieval pass, and as consequence the original query's focus may drift because of the newly added terms. We propose a new method to address the mismatching vocabulary problem, expanding original query terms only when necessary and complementing the user query for missing terms while scoring documents. It allows related semantic aspects to be included in a conservative and selective way, thus reducing the possibility of query drift. Our results using replacements for the <i>missing query terms</i> in modified document and passages retrieval methods show significant improvement over the original ones.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
C. Buckley, G. Salton, J. Allan, and A. Singhal. Automatic query expansion using SMART: TREC 3. In In proceedings of Third Text REtrieval Conference Gaithersburg, MD, 1994.
|
 |
3
|
|
 |
4
|
C. L. A. Clarke , G. V. Cormack , M. Laszlo , T. R. Lynam , E. L. Terra, The impact of corpus size on question answering performance, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, August 11-15, 2002, Tampere, Finland
[doi> 10.1145/564376.564448]
|
 |
5
|
|
| |
6
|
C. L. A. Clarke, G. V. Cormack, T. R. Lynam, and E. Terra. Advances in Open Domain Question Answering chapter Question answering by passage selection. Kluwer Academic Publishers. To appear, 2004.
|
 |
7
|
|
| |
8
|
|
 |
9
|
|
| |
10
|
J. Firth. Studies In Linguistic Analisys chapter A Synopsis of Linguistic Theory, 1930-1955, pages 1--32. Basil Blackwell, Oxford, 3rd edition, 1957.
|
 |
11
|
Jianfeng Gao , Ming Zhou , Jian-Yun Nie , Hongzhao He , Weijun Chen, Resolving query translation ambiguity using a decaying co-occurrence model and syntactic dependence relations, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, August 11-15, 2002, Tampere, Finland
[doi> 10.1145/564376.564409]
|
| |
12
|
|
 |
13
|
|
| |
14
|
J. Lin, A. Fernandes, B. Katz, G. Marton, and S. Tellex. Extracting answers from the web using data annotation an knowledge mining techniques. In The Eleventh Text REtrieval Conference (TREC 2002), Gaithersburg, MD, 2002.
|
 |
15
|
|
 |
16
|
|
| |
17
|
J. Rocchio. The SMART Retrieval System: Experiments in Automatic Document Processing chapter Relevance feedback in information retrieval, pages 313--323. Prentice-Hall Inc., 1971.
|
 |
18
|
|
| |
19
|
P. Schäuble and P. Sheridan. Cross-language information retrieval (clir) track overview. In The Sixth Text REtrieval Conference (TREC 6), Gaithersburg, MD, 1997.
|
 |
20
|
Stefanie Tellex , Boris Katz , Jimmy Lin , Aaron Fernandes , Gregory Marton, Quantitative evaluation of passage retrieval algorithms for question answering, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, July 28-August 01, 2003, Toronto, Canada
[doi> 10.1145/860435.860445]
|
| |
21
|
E. Terra and C. L. Clarke. Fast computation of lexical affinity models. In Proceedings of the 20th International Conference on Computational Linguistics (COLING), 2004.
|
| |
22
|
|
| |
23
|
|
 |
24
|
|
 |
25
|
|
| |
26
|
D. L. Yeung, C. L. A. Clarke, G. V. Cormack, T. R. Lynam, and E. Terra. Task-specific query expansion (multitext experiments for trec 2003). In 2002 Text REtrieval Conference Gaithersburg, MD, 2003.
|
 |
27
|
|
CITED BY 3
|
|
Ben Carterette , Rosie Jones , Wiley Greiner , Cory Barr, N semantic classes are harder than two, Proceedings of the COLING/ACL on Main conference poster sessions, p.49-56, July 17-18, 2006, Sydney, Australia
|
|
|
|
|
|