research-article

Learning to efficiently rank

Authors:

Donald MetzlerAuthors Info & Claims

SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval

Pages 138 - 145

https://doi.org/10.1145/1835449.1835475

Published: 19 July 2010 Publication History

Abstract

It has been shown that learning to rank approaches are capable of learning highly effective ranking functions. However, these approaches have mostly ignored the important issue of efficiency. Given that both efficiency and effectiveness are important for real search engines, models that are optimized for effectiveness may not meet the strict efficiency requirements necessary to deploy in a production environment. In this work, we present a unified framework for jointly optimizing effectiveness and efficiency. We propose new metrics that capture the tradeoff between these two competing forces and devise a strategy for automatically learning models that directly optimize the tradeoff metrics. Experiments indicate that models learned in this way provide a good balance between retrieval effectiveness and efficiency. With specific loss functions, learned models converge to familiar existing ones, which demonstrates the generality of our framework. Finally, we show that our approach naturally leads to a reduction in the variance of query execution times, which is important for query load balancing and user satisfaction.

References

[1]

V. Anh and A. Moffat. Pruned query evaluation using pre-computed impacts. SIGIR, p. 372--379, 2006.

Digital Library

[2]

R. Baeza-Yates, A. Gionis, F. Junqueira, V. Murdock, V. Plachouras, and F. Silvestri. The impact of caching on search engines. SIGIR, p. 183--190, 2007.

Digital Library

[3]

J. Bai, Y. Chang, H. Cui, Z. Zheng, G. Sun, and X. Li. Investigation of partial query proximity in web search. WWW, p. 1183--1184, 2008.

Digital Library

[4]

M. Bendersky, W. Croft, and D. Smith. Two-stage query segmentation for information retrieval. SIGIR, p. 810--811, 2009.

Digital Library

[5]

M. Bendersky, D. Metzler, and W. Croft. Learning concept importance using a weighted dependence model. WSDM, p. 31--40, 2010.

Digital Library

[6]

C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. Hullender. Learning to rank using gradient descent. ICML, p. 89--96, 2005.

Digital Library

[7]

S. Buttcher and C. Clarke. Efficiency vs. effectiveness in terabyte-scale information retrieval. TREC, 2005.

[8]

S. Buttcher, C. Clarke, and P. Yeung. Indexing pruning and result reranking: Effects on ad-hoc retrieval and named page finding. TREC, 2006.

[9]

S. Buttcher, C. Clarke, and B. Lushman. Term proximity scoring for ad-hoc retrieval on very large text collections. SIGIR, p. 621--622, 2006.

Digital Library

[10]

D. Carmel, D. Cohen, R. Fagin, E. Farchi, M. Herscovici, Y. Maarek, and A. Soffer. Static indexing pruning for information retrieval systems. SIGIR, p. 43--50, 2001.

Digital Library

[11]

K. Collins-Thompson and J. Callan. Query expansion using random walk models. CIKM, p. 704--711, 2005.

Digital Library

[12]

F. Gey. Inferring probability of relevance using the method of logistic regression. SIGIR, p. 222--231, 1994.

Digital Library

[13]

M. Lease. An improved Markov Random Field model for supporting verbose queries. SIGIR, p. 476--483, 2009.

Digital Library

[14]

J. Lin, D. Metzler, T. Elsayed, and L. Wang. Of Ivory and Smurfs: Loxodontan MapReduce experiments for web search. TREC, 2009.

[15]

T.-Y. Liu. Learning to rank for information retrieval. Foundations and Trends in Information Retrieval, 3(3):225--331, 2009.

Digital Library

[16]

D. Metzler and W. Croft. A Markov Random Field model for term dependencies. SIGIR, p. 472--479, 2005.

Digital Library

[17]

D. Metzler and W. Croft. Linear feature-based models for information retrieval. Information Retrieval, 10(3):257--274, 2007.

Digital Library

[18]

R. Nallapati. Discriminative models for information retrieval. SIGIR, p. 64--71, 2004.

Digital Library

[19]

A. Ntoulas and J. Cho. Pruning policies for two-tiered inverted index with correctness guarantee. SIGIR, p. 191--198, 2007.

Digital Library

[20]

J. Ponte and W. Croft. A language modeling approach to information retrieval. SIGIR, p. 275--281, 1998.

Digital Library

[21]

S. Robertson, S. Walker, S. Jones, M. M. Hancock-Beaulieu, and M. Gatford. Okapi at TREC-3. TREC, p. 109--126, 1994.

[22]

T. Strohman, H. Turtle, and W. Croft. Optimization strategies for complex queries. SIGIR, p. 219--225, 2005.

Digital Library

[23]

T. Tao and C. Zhai. An exploration of proximity measures in information retrieval. SIGIR, p. 295--302, 2007.

Digital Library

[24]

R. Tibshirani. Regression shrinkage and selection via the lasso. J. Roy. Statist. Soc. Ser. B, 58(1):267--288, 1996.

[25]

Y. Yue and C. Burges. On using simultaneous perturbation stochastic approximation for IR measures, and the empirical optimality of LambdaRank. NIPS Machine Learning for Web Search Workshop, 2007.

Cited By

Formal TLassance CPiwowarski BClinchant S(2024)Towards Effective and Efficient Sparse Neural Information RetrievalACM Transactions on Information Systems10.1145/363491242:5(1-46)Online publication date: 29-Apr-2024
https://dl.acm.org/doi/10.1145/3634912
Mackenzie JTrotman ALin J(2023)Efficient Document-at-a-time and Score-at-a-time Query Evaluation for Learned Sparse RepresentationsACM Transactions on Information Systems10.1145/357692241:4(1-28)Online publication date: 22-Mar-2023
https://dl.acm.org/doi/10.1145/3576922
Busolin FLucchese CNardini FOrlando SPerego RTrani S(2023)Early Exit Strategies for Learning-to-Rank CascadesIEEE Access10.1109/ACCESS.2023.333108811(126691-126704)Online publication date: 2023
https://doi.org/10.1109/ACCESS.2023.3331088
Show More Cited By

Index Terms

Learning to efficiently rank
1. Information systems
  1. Information retrieval

Recommendations

Active Learning for Efficient Partial Improvement of Learning to Rank
From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries
Abstract
In this study, we propose a framework that allows partial improvement of ranking algorithms based on feedback from search engine administrators, where partial improvement refers to changing the rank of a particular document for a particular type ...
Learning to efficiently rank
Learning to rank code examples for code search engines

Source code examples are used by developers to implement unfamiliar tasks by learning from existing solutions. To better support developers in finding existing solutions, code search engines are designed to locate and rank code examples relevant to user'...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval

July 2010

944 pages

ISBN:9781450301534

DOI:10.1145/1835449

General Chairs:
Fabio Crestani
University of Lugano, CH
,
Stéphane Marchand-Maillet
University of Geneva, CH
,
Program Chairs:
Hsin-Hsi Chen
National Taiwan University, TW
,
Efthimis N. Efthimiadis
University of Washington, USA
,
Jacques Savoy
University of Neuchatel, CH

Copyright © 2010 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2010

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

SIGIR '10

Sponsor:

SIGIR

SIGIR '10: The 33rd International ACM SIGIR conference on research and development in Information Retrieval

July 19 - 23, 2010

Geneva, Switzerland

Acceptance Rates

SIGIR '10 Paper Acceptance Rate 87 of 520 submissions, 17%;

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

61
Total Citations
View Citations
1,145
Total Downloads

Downloads (Last 12 months)20
Downloads (Last 6 weeks)2

Reflects downloads up to 20 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Formal TLassance CPiwowarski BClinchant S(2024)Towards Effective and Efficient Sparse Neural Information RetrievalACM Transactions on Information Systems10.1145/363491242:5(1-46)Online publication date: 29-Apr-2024
Mackenzie JTrotman ALin J(2023)Efficient Document-at-a-time and Score-at-a-time Query Evaluation for Learned Sparse RepresentationsACM Transactions on Information Systems10.1145/357692241:4(1-28)Online publication date: 22-Mar-2023
Busolin FLucchese CNardini FOrlando SPerego RTrani S(2023)Early Exit Strategies for Learning-to-Rank CascadesIEEE Access10.1109/ACCESS.2023.333108811(126691-126704)Online publication date: 2023
Chu XZhao JZou LYin DAmigo ECastells PGonzalo JCarterette BCulpepper JKazai G(2022)H-ERNIEProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531986(1478-1489)Online publication date: 6-Jul-2022
Gil-Costa VLoor FMolina RNardini FPerego RTrani S(2022)Ensemble Model Compression for Fast and Energy-Efficient Ranking on FPGAsAdvances in Information Retrieval10.1007/978-3-030-99736-6_18(260-273)Online publication date: 5-Apr-2022
Cambazoglu BBaeza-Yates R(2022)Scalability Challenges in Web Search EnginesundefinedOnline publication date: 10-Mar-2022
Li H(2022)Learning to Rank for Information Retrieval and Natural Language ProcessingundefinedOnline publication date: 2-Apr-2022
Zeng AYu HDa QZhan YYu YZhou JMiao C(2021)Improving search engine efficiency through contextual factor selectionAI Magazine10.1609/aimag.v42i2.1509942:2(50-58)Online publication date: 1-Jun-2021
Busolin FLucchese CNardini FOrlando SPerego RTrani SDiaz FShah CSuel TCastells PJones RSakai T(2021)Learning Early Exit Strategies for Additive Ranking EnsemblesProceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3404835.3463088(2217-2221)Online publication date: 11-Jul-2021
Lucchese CNardini FOrlando SPerego RTrani SHuang JChang YCheng XKamps JMurdock VWen JLiu Y(2020)Query-level Early Exit for Additive Learning-to-Rank EnsemblesProceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3397271.3401256(2033-2036)Online publication date: 25-Jul-2020
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten