Article

A regression framework for learning ranking functions using relative relevance judgments

Authors:

Hongyuan ZhaAuthors Info & Claims

SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval

Pages 287 - 294

https://doi.org/10.1145/1277741.1277792

Published: 23 July 2007 Publication History

Abstract

Effective ranking functions are an essential part of commercial search engines. We focus on developing a regression framework for learning ranking functions for improving relevance of search engines serving diverse streams of user queries. We explore supervised learning methodology from machine learning, and we distinguish two types of relevance judgments used as the training data: 1) absolute relevance judgments arising from explicit labeling of search results; and 2) relative relevance judgments extracted from user click throughs of search results or converted from the absolute relevance judgments. We propose a novel optimization framework emphasizing the use of relative relevance judgments. The main contribution is the development of an algorithm based on regression that can be applied to objective functions involving preference data, i.e., data indicating that a document is more relevant than another with respect to a query. Experimental results are carried out using data sets obtained from a commercial search engine. Our results show significant improvements of our proposed methods over some existing methods.

References

[1]

R. Atterer, M. Wunk, and A. Schmidt. Knowing the user's every move: user activity tracking for website usability evaluation and implicit interaction. Proceedings of the 15th International Conference on World Wide Web 203--212,2006.

Digital Library

[2]

A. Berger. Statistical machine learning for information retrieval Ph.D. Thesis, School of Computer Science, Carnegie Mellon University, 2001.

Digital Library

[3]

D. Bertsekas. Nonlinear programming Athena Scienti?c, second edition, 1999.

[4]

C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. Hullender. Learning to rank using gradient descent. Proceedings of international conference on Machine learning 89--96, 2005.

Digital Library

[5]

H. Chen. Machine Learning for information retrieval: Neural networks, symbolic learning and genetic algorithms. JASIS 46:194--216, 1995.

Digital Library

[6]

W. Cooper, F. Gey and A. Chen. Probabilistic retrieval in the TIPSTER collections: an application of staged logistic regression. Proceedings of TREC 73--88, 1992.

[7]

D. Cossock and T. Zhang. Subset ranking using regression. COLT 2006.

Digital Library

[8]

Y. Freund, R. Iyer, R. Schapire and Y. Singer. An efficient boosting algorithm for combining preferences. Journal of Machine Learning Research 4:933--969, 2003.

Digital Library

[9]

J. Friedman. Greedy function approximation: a gradient boosting machine. Ann. Statist. 29:1189--1232, 2001.

[10]

N. Fuhr. Optimum polynomial retrieval functions based on probability ranking principle. ACM Transactions on Information Systems 7:183--204, 1989.

Digital Library

[11]

F. Gey, A. Chen, J. He and J. Meggs. Logistic regression at TREC4: probabilistic retrieval from full text document collections. Proceedings of TREC 65--72, 1995.

[12]

K. Järvelin and J.Kekäläinen.Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems 20:422--446, 2002.

Digital Library

[13]

T. Joachims. Optimizing search engines using clickthrough data. Proceedings of the ACM Conference on Knowledge Discovery and Data Mining 2002.

Digital Library

[14]

T. Joachims. Evaluating retrieval performance using clickthrough data. Proceedings of the SIGIR Workshop on Mathematical/Formal Methods in Information Retrieval 2002.

[15]

T. Joachims, L. Granka, B. Pang, H. Hembrooke, and G. Gay. Accurately Interpreting Clickthrough Data as Implicit Feedback. Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2005.

Digital Library

[16]

J. Ponte and W. Croft. A language modeling approach to information retrieval. In Proceedings of the ACM Conference on Research and Development in Information Retrieval 1998.

Digital Library

[17]

G. Salton. Automatic Text Processing. Addison Wesley, Reading, MA, 1989.

Digital Library

[18]

H. Turtle and W. B. Croft. Inference networks for document retrieval. In Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 1-24, 1990.

Digital Library

[19]

H. Zha, Z. Zheng, H. Fu and G. Sun. Incorporating query difference for learning retrieval functions in worldwidewebsearch. Proceedings of the 15th ACM Conference on Information and Knowledge Management 2006.

Digital Library

[20]

Diane Kelly and Jaime Teevan. Implicit Feedback for Inferring User Preference: A Bibliography. SIGIR Forum 32:2, 2003.

Digital Library

[21]

F. Radlinski and T. Joachims. Query chains: Learning to rank from implicit feedback. Proceedings of the ACM Conference on Knowledge Discovery and Data Mining (KDD), 2005.

Digital Library

[22]

C. Zhai and J. Lafferty. A risk minimization framework for information retrieval, Information Processing and Management 42:31--55, 2006.

Digital Library

Cited By

Xi YLiu WDai XTang RLiu QZhang WYu Y(2024)Utility-Oriented Reranking with Counterfactual ContextACM Transactions on Knowledge Discovery from Data10.1145/367100418:8(1-22)Online publication date: 4-Jun-2024
https://dl.acm.org/doi/10.1145/3671004
Lin ZPan JZhang SWang XXiao XHuang SXiao LJiang JBaeza-Yates RBonchi F(2024)Understanding the Ranking Loss for Recommendation with Sparse User FeedbackProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671565(5409-5418)Online publication date: 25-Aug-2024
https://dl.acm.org/doi/10.1145/3637528.3671565
Zhu YChen LZheng CShi JXiong DHuang ZRen SChen SHao JHe RSerra ESpezzano F(2024)Collaborative Scope: Encountering the Substitution Effect within the Delivery Scope in Online Food Delivery PlatformProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3680029(5151-5158)Online publication date: 21-Oct-2024
https://dl.acm.org/doi/10.1145/3627673.3680029
Show More Cited By

Index Terms

A regression framework for learning ranking functions using relative relevance judgments
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking
  2. Information systems applications

Recommendations

Learning to rank with ties
SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval

Designing effective ranking functions is a core problem for information retrieval and Web search since the ranking functions directly impact the relevance of the search results. The problem has been the focus of much of the research at the intersection ...
Smoothing DCG for learning to rank: a novel approach using smoothed hinge functions
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management

Discounted cumulative gain (DCG) is widely used for evaluating ranking functions. It is therefore natural to learn a ranking function that directly optimizes DCG. However, DCG is non-smooth, rendering gradient-based optimization algorithms inapplicable. ...
Genetic Programming-Based Discovery of Ranking Functions for Effective Web Search

Web search engines have become an integral part of the daily life of a knowledge worker, who depends on these search engines to retrieve relevant information from the Web or from the company's vast document databases. Current search engines are very ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval

July 2007

946 pages

ISBN:9781595935977

DOI:10.1145/1277741

General Chairs:
Wessel Kraaij
TNO, The Netherlands
,
Arjen P. de Vries
CWI, The Netherlands
,
Program Chairs:
Charles L. A. Clarke
University of Waterloo, Canada
,
Norbert Fuhr
University of Duisburg-Essen, Germany
,
Noriko Kando
National Institute of Informatics, Japan

Copyright © 2007 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 July 2007

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

SIGIR07

Sponsor:

SIGIR07: The 30th Annual International SIGIR Conference

July 23 - 27, 2007

Amsterdam, The Netherlands

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

137
Total Citations
View Citations
1,591
Total Downloads

Downloads (Last 12 months)18
Downloads (Last 6 weeks)0

Reflects downloads up to 16 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Xi YLiu WDai XTang RLiu QZhang WYu Y(2024)Utility-Oriented Reranking with Counterfactual ContextACM Transactions on Knowledge Discovery from Data10.1145/367100418:8(1-22)Online publication date: 4-Jun-2024
https://dl.acm.org/doi/10.1145/3671004
Lin ZPan JZhang SWang XXiao XHuang SXiao LJiang JBaeza-Yates RBonchi F(2024)Understanding the Ranking Loss for Recommendation with Sparse User FeedbackProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671565(5409-5418)Online publication date: 25-Aug-2024
https://dl.acm.org/doi/10.1145/3637528.3671565
Zhu YChen LZheng CShi JXiong DHuang ZRen SChen SHao JHe RSerra ESpezzano F(2024)Collaborative Scope: Encountering the Substitution Effect within the Delivery Scope in Online Food Delivery PlatformProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3680029(5151-5158)Online publication date: 21-Oct-2024
https://dl.acm.org/doi/10.1145/3627673.3680029
Zhu ZZhang NZhu K(2024)Big portfolio selection by graph-based conditional moments methodJournal of Empirical Finance10.1016/j.jempfin.2024.10153378(101533)Online publication date: Sep-2024
https://doi.org/10.1016/j.jempfin.2024.101533
Shi YZhang HLi NYang T(2024)An overview of sentence ordering taskInternational Journal of Data Science and Analytics10.1007/s41060-024-00550-918:1(1-18)Online publication date: 25-Apr-2024
https://doi.org/10.1007/s41060-024-00550-9
Wang QLi HXiong HWang WBian JLu YWang SCheng ZDou DYin D(2024)A Simple yet Effective Framework for Active Learning to RankMachine Intelligence Research10.1007/s11633-023-1422-z21:1(169-183)Online publication date: 15-Jan-2024
https://doi.org/10.1007/s11633-023-1422-z
Pergantis MKouretsis AGiannakoulopoulos A(2023)Investigating Online Art Search through Quantitative Behavioral Data and Machine Learning TechniquesAnalytics10.3390/analytics20200212:2(359-392)Online publication date: 26-Apr-2023
https://doi.org/10.3390/analytics2020021
Li YXiong HKong LWang QWang SChen GYin DSingh ASun YAkoglu LGunopulos DYan XKumar ROzcan FYe J(2023)S2phere: Semi-Supervised Pre-training for Web Search over Heterogeneous Learning to Rank DataProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599935(4437-4448)Online publication date: 6-Aug-2023
https://dl.acm.org/doi/10.1145/3580305.3599935
Seifikar MPhan Minh LArabzadeh NClarke CSmucker MChen HDuh WHuang HKato MMothe JPoblete B(2023)A Preference Judgment Tool for Authoritative AssessmentProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591801(3100-3104)Online publication date: 19-Jul-2023
https://dl.acm.org/doi/10.1145/3539618.3591801
Li YXiong HWang QKong LLiu HLi HBian JWang SChen GDou DYin D(2023) COLTR : Semi-Supervised Learning to Rank With Co-Training and Over-Parameterization for Web Search IEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.327075035:12(12542-12555)Online publication date: 1-Dec-2023
https://doi.org/10.1109/TKDE.2023.3270750
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten