research-article

On statistical analysis and optimization of information retrieval effectiveness metrics

Authors:

Jianhan ZhuAuthors Info & Claims

SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval

Pages 226 - 233

https://doi.org/10.1145/1835449.1835489

Published: 19 July 2010 Publication History

Abstract

This paper presents a new way of thinking for IR metric optimization. It is argued that the optimal ranking problem should be factorized into two distinct yet interrelated stages: the relevance prediction stage and ranking decision stage. During retrieval the relevance of documents is not known a priori, and the joint probability of relevance is used to measure the uncertainty of documents' relevance in the collection as a whole. The resulting optimization objective function in the latter stage is, thus, the expected value of the IR metric with respect to this probability measure of relevance. Through statistically analyzing the expected values of IR metrics under such uncertainty, we discover and explain some interesting properties of IR metrics that have not been known before. Our analysis and optimization framework do not assume a particular (relevance) retrieval model and metric, making it applicable to many existing IR models and metrics. The experiments on one of resulting applications have demonstrated its significance in adapting to various IR metrics.

References

[1]

G. Amati and C. J. V. Rijsbergen. Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Trans. Inf. Syst., 20(4):357--389, 2002.

Digital Library

[2]

K. Arrow. Aspects of the Theory of Risk-Bearing. Helsinki: Yrjo Hahnsson Foundation, 1965.

[3]

J. A. Aslam, V. Pavlu, and E. Yilmaz. A statistical method for system evaluation using incomplete judgments. In SIGIR, 2006.

Digital Library

[4]

J. A. Aslam, E. Yilmaz, and V. Pavlu. The maximum entropy method for analyzing retrieval measures. In SIGIR, 2005.

Digital Library

[5]

C. M. Bishop. Pattern Recognition and Machine Learning. Springer, 2006.

Digital Library

[6]

P. F. Brown, V. J. D. Pietra, S. A. D. Pietra, and R. L. Mercer. The mathematics of statistical machine translation: parameter estimation. Comput. Linguist., 1993.

Digital Library

[7]

C. Buckley and E. M. Voorhees. Evaluating evaluation measure stability. In SIGIR, 2000.

Digital Library

[8]

C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. Hullender. Learning to rank using gradient descent. In ICML '05, 2005.

Digital Library

[9]

J. Carbonell and J. Goldstein. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In SIGIR, 1998.

Digital Library

[10]

H. Chen and D. R. Karger. Less is more: probabilistic models for retrieving fewer relevant documents. In SIGIR, 2006.

Digital Library

[11]

G. V. Cormack and T. R. Lynam. Statistical precision of information retrieval evaluation. In SIGIR, 2006.

Digital Library

[12]

W. B. Croft and D. J. Harper. Using probabilistic models of document retrieval without relevance information. Document Retrieval Systems, 1988.

Digital Library

[13]

D. Harman. Overview of the second text retrieval conference (trec-2). In HLT '94, 1994.

Digital Library

[14]

K. Jarvelin and J. Kekalainen. Cumulated gain-based evaluation of IR techniques. ACM Trans. Inf. Syst., 2002.

Digital Library

[15]

J. Lafferty and C. Zhai. Document language models, query models, and risk minimization for information retrieval. In SIGIR, 2001.

Digital Library

[16]

C. D. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval. Cambridge University Press, 2008.

[17]

M. E. Maron and J. L. Kuhns. On relevance, probabilistic indexing and information retrieval. J. ACM, 1960.

Digital Library

[18]

S. Mizzaro. Relevance: The whole history. Journal of the American Society of Information Science, 1997.

Digital Library

[19]

S. E. Robertson. The probability ranking principle in IR. Journal of Documentation, pages 294--304, 1977.

[20]

S. E. Robertson and K. Sparck Jones. Relevance weighting of search terms. Journal of the American Society for Information Science, 27(3):129--46, 1976.

[21]

S. E. Robertson and S. Walker. Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. In SIGIR, 1994.

Digital Library

[22]

A. Singhal, C. Buckley, and M. Mitra. Pivoted document length normalization. In SIGIR, pages 21--29, 1996.

Digital Library

[23]

M. Taylor, J. Guiver, S. Robertson, and T. Minka. Softrank: optimizing non-smooth rank metrics. In WSDM, 2008.

Digital Library

[24]

S. Tomlinson. Early precision measures: implications from the downside of blind feedback. In SIGIR, 2006.

Digital Library

[25]

C. J. van Rijsbergen. Information Retrieval. Butterworths, London, London, UK, 1979.

Digital Library

[26]

M. N. Volkovs and R. S. Zemel. Boltzrank: learning to maximize expected ranking gain. In ICML '09, 2009.

Digital Library

[27]

E. Voorhees. Variations in relevance judgments and the measurement of retrieval effectiveness. In Information Processing and Management, pages 315--323. ACM Press, 1998.

Digital Library

[28]

E. M. Voorhees. The TREC-8 question answering track report. In TREC-8, pages 77--82, 1999.

[29]

J. Wang and J. Zhu. Portfolio theory of information retrieval. In SIGIR, 2009.

Digital Library

[30]

Y. Wang and A. Waibel. Decoding algorithm in statistical machine translation. In EACL, 1997.

Digital Library

[31]

E. Yilmaz, E. Kanoulas, and J. A. Aslam. A simple and efficient sampling method for estimating ap and ndcg. In SIGIR, 2008.

Digital Library

[32]

E. Yilmaz and S. Robertson. On the choice of effectiveness measures for learning to rank. Information Retrieval, 2009.

Digital Library

[33]

Y. Yue, T. Finley, F. Radlinski, and T. Joachims. A support vector method for optimizing average precision. In SIGIR, 2007.

Digital Library

[34]

C. Zhai. Statistical language models for information retrieval a critical review. Found. Trends Inf. Retr., 2(3):137--213, 2008.

Digital Library

[35]

C. Zhai and J. D. Lafferty. A risk minimization framework for information retrieval. Inf. Process. Manage., 42(1):31--55, 2006.

Digital Library

Cited By

Dai XXi YZhang WLiu QTang RHe XHou JWang JYu Y(2021)Beyond Relevance Ranking: A General Graph Matching Framework for Utility-Oriented Learning to RankACM Transactions on Information Systems10.1145/346430340:2(1-29)Online publication date: 16-Nov-2021
https://dl.acm.org/doi/10.1145/3464303
Trichkova-Kashamova E(2020)Modeling and optimization of traffic flows in a network2020 International Conference Automatics and Informatics (ICAI)10.1109/ICAI50593.2020.9311314(1-6)Online publication date: 1-Oct-2020
https://doi.org/10.1109/ICAI50593.2020.9311314
Jannach DLerche LZanker M(2018)Recommending Based on Implicit FeedbackSocial Information Access10.1007/978-3-319-90092-6_14(510-569)Online publication date: 3-May-2018
https://doi.org/10.1007/978-3-319-90092-6_14
Show More Cited By

Index Terms

On statistical analysis and optimization of information retrieval effectiveness metrics
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing
    2. Retrieval models and ranking

Recommendations

Axiomatic Analysis and Optimization of Information Retrieval Models
ICTIR '13: Proceedings of the 2013 Conference on the Theory of Information Retrieval

The accuracy of a search engine is mostly determined by the optimality of the retrieval model used in the search engine. Develoing optimal retrieval models has always been a very important fundamental research problem in information retrieval because an ...
Connectionist interaction information retrieval
Modelling vagueness and subjectivity in information access

Connectionist views for adaptive clustering in information retrieval (IR) have proved to be viable approaches, and have yielded a number of models and techniques. However there has never been any exhaustive and methodical--i.e., theoretical, formal, ...
On the analysis and evaluation of information retrieval models for social book search
Abstract
Social Book Search (SBS) studies how the Social Web impacts book retrieval. This impact is studied in two steps. In this first step, called the baseline run, the search index having bibliographic descriptions or professional metadata and user-...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval

July 2010

944 pages

ISBN:9781450301534

DOI:10.1145/1835449

General Chairs:
Fabio Crestani
University of Lugano, CH
,
Stéphane Marchand-Maillet
University of Geneva, CH
,
Program Chairs:
Hsin-Hsi Chen
National Taiwan University, TW
,
Efthimis N. Efthimiadis
University of Washington, USA
,
Jacques Savoy
University of Neuchatel, CH

Copyright © 2010 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2010

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

SIGIR '10

Sponsor:

SIGIR

SIGIR '10: The 33rd International ACM SIGIR conference on research and development in Information Retrieval

July 19 - 23, 2010

Geneva, Switzerland

Acceptance Rates

SIGIR '10 Paper Acceptance Rate 87 of 520 submissions, 17%;

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

15
Total Citations
View Citations
617
Total Downloads

Downloads (Last 12 months)8
Downloads (Last 6 weeks)1

Reflects downloads up to 15 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Dai XXi YZhang WLiu QTang RHe XHou JWang JYu Y(2021)Beyond Relevance Ranking: A General Graph Matching Framework for Utility-Oriented Learning to RankACM Transactions on Information Systems10.1145/346430340:2(1-29)Online publication date: 16-Nov-2021
https://dl.acm.org/doi/10.1145/3464303
Trichkova-Kashamova E(2020)Modeling and optimization of traffic flows in a network2020 International Conference Automatics and Informatics (ICAI)10.1109/ICAI50593.2020.9311314(1-6)Online publication date: 1-Oct-2020
https://doi.org/10.1109/ICAI50593.2020.9311314
Jannach DLerche LZanker M(2018)Recommending Based on Implicit FeedbackSocial Information Access10.1007/978-3-319-90092-6_14(510-569)Online publication date: 3-May-2018
https://doi.org/10.1007/978-3-319-90092-6_14
(2017)An in-depth study on diversity evaluationInformation Processing and Management: an International Journal10.1016/j.ipm.2017.03.00153:4(799-813)Online publication date: 1-Jul-2017
https://dl.acm.org/doi/10.1016/j.ipm.2017.03.001
Rao VJain PJawahar CKender JSmith JLuo JBoll SHsu W(2016)Diverse Yet Efficient Retrieval using Locality Sensitive HashingProceedings of the 2016 ACM on International Conference on Multimedia Retrieval10.1145/2911996.2911998(189-196)Online publication date: 6-Jun-2016
https://dl.acm.org/doi/10.1145/2911996.2911998
Sloan MWang JAllan JCroft Bde Vries AZhai C(2015)Dynamic Information RetrievalProceedings of the 2015 International Conference on The Theory of Information Retrieval10.1145/2808194.2809457(61-70)Online publication date: 27-Sep-2015
https://dl.acm.org/doi/10.1145/2808194.2809457
Bellini PCenni DNesi P(2014)Optimization of information retrieval for cross media contents in a best practice networkInternational Journal of Multimedia Information Retrieval10.1007/s13735-014-0058-83:3(147-159)Online publication date: 8-May-2014
https://doi.org/10.1007/s13735-014-0058-8
Zhang WWang JChen BZhao XYang QKing ILi QPu PKarypis G(2013)To personalize or notProceedings of the 7th ACM conference on Recommender systems10.1145/2507157.2507167(229-236)Online publication date: 12-Oct-2013
https://dl.acm.org/doi/10.1145/2507157.2507167
Shi YKaratzoglou ABaltrunas LLarson MHanjalic AHe QIyengar ANejdl WPei JRastogi R(2013)GAPfmProceedings of the 22nd ACM international conference on Information & Knowledge Management10.1145/2505515.2505653(2261-2266)Online publication date: 27-Oct-2013
https://dl.acm.org/doi/10.1145/2505515.2505653
Jin XSloan MWang JSchwabe DAlmeida VGlaser HBaeza-Yates RMoon S(2013)Interactive exploratory search for multi page search resultsProceedings of the 22nd international conference on World Wide Web10.1145/2488388.2488446(655-666)Online publication date: 13-May-2013
https://dl.acm.org/doi/10.1145/2488388.2488446
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten