ACM Home Page
Please provide us with feedback. Feedback
Using parsimonious language models on web data
Full text PdfPdf (135 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Singapore, Singapore
POSTER SESSION: Posters group 2: blog, tagging, opinion analysis and web IR table of contents
Pages 763-764  
Year of Publication: 2008
ISBN:978-1-60558-164-4
Authors
Rianne Kaptein  University of Amsterdam, Amsterdam, Netherlands
Rongmei LI  University of Twente, Enschede, Netherlands
Djoerd Hiemstra  University of Twente, Enschede, Netherlands
Jaap Kamps  University of Amsterdam, Amsterdam, Netherlands
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 25,   Downloads (12 Months): 45,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390334.1390491
What is a DOI?

ABSTRACT

In this paper we explore the use of parsimonious language models for web retrieval. These models are smaller thus more efficient than the standard language models and are therefore well suited for large-scale web retrieval. We have conducted experiments on four TREC topic sets, and found that the parsimonious language model results in improvement of retrieval effectiveness over the standard language model for all data-sets and measures. In all cases the improvement is significant, and more substantial than in earlier experiments on newspaper/newswire data.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
D. Hiemstra, S. Robertson, and H. Zaragoza. Parsimonious language models for information retrieval. In Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 178--185. ACM Press, New York NY, 2004.
 
2
J. Kamps. Effective smoothing for a terabyte of text. In The Fourteenth Text REtrieval Conference (TREC 2005). National Institute of Standards and Technology. NIST Special Publication, 2006.
 
3
K. Sparck-Jones, S. Robertson, D. Hiemstra, and H. Zaragoza. Language modelling and relevance. In W. Croft and J. Lafferty, editors, Language Modeling for Information Retrieval, pages 57--71. Kluwer Academic Publishers, 2003.
 
4
TREC. Text REtrieval Conference, 2008. http://trec.nist.gov/.

Collaborative Colleagues:
Rianne Kaptein: colleagues
Rongmei LI: colleagues
Djoerd Hiemstra: colleagues
Jaap Kamps: colleagues