research-article

An experimental comparison of click position-bias models

Authors:
Nick Craswell

Microsoft Research, Cambridge UK

Microsoft Research, Cambridge UK
View Profile

,
Onno Zoeter

Microsoft Research, Cambridge UK

Microsoft Research, Cambridge UK
View Profile

,
Michael Taylor

Microsoft Research, Cambridge UK

Microsoft Research, Cambridge UK
View Profile

,
Bill Ramsey

Microsoft Research, Redmond USA

Microsoft Research, Redmond USA
View Profile

WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningFebruary 2008Pages 87–94https://doi.org/10.1145/1341531.1341545

Published:11 February 2008Publication History

WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data Mining

Pages 87–94

ABSTRACT

Search engine click logs provide an invaluable source of relevance information, but this information is biased. A key source of bias is presentation order: the probability of click is influenced by a document's position in the results page. This paper focuses on explaining that bias, modelling how probability of click depends on position. We propose four simple hypotheses about how position bias might arise. We carry out a large data-gathering effort, where we perturb the ranking of a major search engine, to see how clicks are affected. We then explore which of the four hypotheses best explains the real-world position effects, and compare these to a simple logistic regression model. The data are not well explained by simple position models, where some users click indiscriminately on rank 1 or there is a simple decay of attention over ranks. A 'cascade' model, where users view results from top to bottom and leave as soon as they see a worthwhile document, is our best explanation for position bias in early ranks

References

Eugene Agichtein, Eric Brill, Susan Dumais, and Robert Ragno. Learning user interaction models for predicting web search result preferences. In SIGIR'06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval pages 3--10, New York, NY, USA, 2006. ACM Press. Google ScholarDigital Library
Ricardo Baeza-Yates, Carlos Hurtado, and Marcelo Mendoza. Improving search engines by query clustering. In JASIST to appear 2007. Google ScholarDigital Library
Georges Dupret, Vanessa Murdock, and Benjamin Piwowarski. Web search engine evaluation using click-through data and a user model. In Proceedings of the Workshop on Query Log Analysis (WWW)2007.Google Scholar
Georges Dupret, Benjamin Piwowarski, Carlos A. Hurtado, and Marcelo Mendoza. A statistical model of query log generation. In String Processing and Information Retrieval, 13th International Conference, SPIRE 2006 pages 217--228, 2006. Google ScholarDigital Library
Thorsten Joachims. Optimizing search engines using clickthrough data. In KDD'02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining pages 133--142, New York, NY, USA, 2002. ACM Press. Google ScholarDigital Library
Thorsten Joachims, Laura Granka, Bing Pan, Helene Hembrooke, and Geri Gay. Accurately interpreting clickthrough data as implicit feedback. In SIGIR'05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval pages 154--161, New York, NY, USA, 2005. ACM Press. Google ScholarDigital Library
Sandeep Pandey, Sourashis Roy, Christopher Olston, Junghoo Cho, and Soumen Chakrabarti. Shuffling a stacked deck: the case for partially randomized ranking of search engine results. In VLDB'05: Proceedings of the 31st international conference on Very large data bases pages 781--792. VLDB Endowment, 2005. Google ScholarDigital Library
F. Radlinski and T. Joachims. Minimally invasive randomization for collecting unbiased preferences from clickthrough logs. In Conference of the Association for the Advancement of Artificial Intelligence (AAAI) pages 1406--1412, 2006. Google ScholarDigital Library
Matthew Richardson, Ewa Dominowska, and Robert Ragno. Predicting clicks: estimating the click-through rate for new ads. In WWW'07: Proceedings of the 16th international conference on World Wide Web pages 521--530, New York, NY, USA, 2007. ACM Press. Google ScholarDigital Library

Index Terms

An experimental comparison of click position-bias models
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking

Recommendations

Random walks on the click graph
SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval

Search engines can record which documents were clicked for which query, and use these query-document pairs as "soft" relevance judgments. However, compared to the true judgments, click logs give noisy and sparse relevance information. We apply a Markov ...
Read More
Characterizing search intent diversity into click models
WWW '11: Proceedings of the 20th international conference on World wide web

Modeling a user's click-through behavior in click logs is a challenging task due to the well-known position bias problem. Recent advances in click models have adopted the examination hypothesis which distinguishes document relevance from position bias. ...
Read More
A collaborative filtering approach to ad recommendation using the query-ad click graph
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management

Search engine logs contain a large amount of click-through data that can be leveraged as soft indicators of relevance. In this paper we address the sponsored search retrieval problem which is to find and rank relevant ads to a search query. We propose a ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data Mining
February 2008
270 pages
ISBN:9781595939272
DOI:10.1145/1341531
General Chair:
Marc Najork
Microsoft, USA
,
Program Chairs:
Andrei Broder
Yahoo!, USA
,
Soumen Chakrabarti
IIT Bombay, India
Copyright © 2008 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 February 2008
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
click data
user behavior
web search models
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate498of2,863submissions,17%
Upcoming Conference
WSDM '25

Sponsor:

sigir

sigir

sigir

sigir

The Eighteenth ACM International Conference on Web Search and Data Mining

April 7 - 11, 2025

Hannover , Germany
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 568
  Total Citations
  View Citations
- 3,505
  Total Downloads
- Downloads (Last 12 months)259
- Downloads (Last 6 weeks)32
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

An experimental comparison of click position-bias models

WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data Mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Random walks on the click graph

Characterizing search intent diversity into click models

A collaborative filtering approach to ad recommendation using the query-ad click graph