research-article

Classification-enhanced ranking

Authors:

Paul N. Bennett,

Susan T. DumaisAuthors Info & Claims

WWW '10: Proceedings of the 19th international conference on World wide web

Pages 111 - 120

https://doi.org/10.1145/1772690.1772703

Published: 26 April 2010 Publication History

Abstract

Many have speculated that classifying web pages can improve a search engine's ranking of results. Intuitively results should be more relevant when they match the class of a query. We present a simple framework for classification-enhanced ranking that uses clicks in combination with the classification of web pages to derive a class distribution for the query. We then go on to define a variety of features that capture the match between the class distributions of a web page and a query, the ambiguity of a query, and the coverage of a retrieved result relative to a query's set of classes. Experimental results demonstrate that a ranker learned with these features significantly improves ranking over a competitive baseline. Furthermore, our methodology is agnostic with respect to the classification space and can be used to derive query classes for a variety of different taxonomies.

References

[1]

E. Agichtein, E. Brill, and S. Dumais. Improving web search ranking by incorporating user behavior information. In SIGIR '06, pages 19--26, 2006.

Digital Library

[2]

D. Beeferman and A. Berger. Agglomerative clustering of a search engine query log. In KDD '00, pages 407 -- 416, 2000.

Digital Library

[3]

S. M. Beitzel, E. C. Jensen, D. D. Lewis, A. Chowdhury, and O. Frieder. Automatic classification of web queries using very large unlabeled query logs. ACM Transactions on Information Systems, 25(2), 2007.

Digital Library

[4]

A. Broder, M. Fontoura, E. Gabrilovich, A. Joshi, V. Josifovski, and T. Zhang. Robust classification of rare queries using web knowledge. In SIGIR '07, pages 231--238, 2007.

Digital Library

[5]

C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. Hullender. Learning to rank using gradient descent. In ICML '05, pages 89--96, 2005.

Digital Library

[6]

C. J. Burges, R. Ragno, and Q. V. Le. Learning to rank with nonsmooth cost functions. In NIPS '06, pages 193--200, 2007. See also MSR Technical Report MSR-TR-2006-60.

[7]

H. Cao, D. H. Hu, D. Shen, D. Jiang, J.-T. Sun, E. Chen, and Q. Yang. Context-aware query classification. In SIGIR '09, pages 3--10, 2009.

Digital Library

[8]

O. Chapelle and Y. Zhang. A dynamic bayesian network click model for web search ranking. In WWW '09, pages 1--10, 2009.

Digital Library

[9]

K. Collins-Thompson and P. N. Bennett. Estimating query performance using class predictions. In SIGIR '09 as a Poster-Paper, pages 672--673, 2009.

Digital Library

[10]

N. Craswell, O. Zoeter, M. Taylor, and B. Ramsey. An experimental comparison of click position-bias models. In WSDM '08, pages 87--94, 2008.

Digital Library

[11]

P. Donmez, K. Svore, and C. Burges. On the local optimality of LambdaRank. In SIGIR '09, pages 460--467, 2009.

Digital Library

[12]

S. T. Dumais, E. Cutrell, and H. Chen. Optimizing search by showing results in context. In CHI '01, pages 277--284, 2001.

Digital Library

[13]

E. Gabrilovich, A. Broder, M. Fontoura, A. Joshi, V. Josifovski, L. Riedel, and T. Zhang. Classifying search queries using the web as a source of knowledge. ACM Transactions on the Web, 3(2), 2009.

Digital Library

[14]

J. Gao, W. Yuan, X. Li, K. Deng, and J.-Y. Nie. Smoothing clickthrough data for web search ranking. In SIGIR '09, pages 355--362, 2009.

Digital Library

[15]

F. Guo, C. Liu, A. Kannan, T. Minka, M. Taylor, Y.-M. Wang, and C. Faloutsos. Click chain model in web search. In WWW '09, pages 11--20, 2009.

Digital Library

[16]

K. Jarvelin and J. Kekalainen. IR evaluation methods for retrieving highly relevant documents. In SIGIR'00, pages 41--48, 2000.

Digital Library

[17]

Z. Kardkovacs, D. Tikk, and Z. Bansaghi. The ferrety algorithm for the KDD Cup 2005 problem. SIGKDD Explorations, 7(2):111--116, 2005.

Digital Library

[18]

Y. Li, Z. Zheng, and H. Dai. KDD CUP-2005 report: Facing a great challenge. SIGKDD Explorations, 7(2):91--99, 2005.

Digital Library

[19]

T. M. Mitchell. Machine Learning. McGraw-Hill Companies, Inc., 1997.

Digital Library

[20]

Netscape Communication Corporation. Open directory project. http://www.dmoz.org.

[21]

T. Qin, T.-Y. Liu, J. Xu, and H. Li. LETOR: A benchmark collection for research on learning to rank for information retrieval. Information Retrieval Journal, 2010.

Digital Library

[22]

S. Robertson and S. Walker. Some simple e ffective approximations to the 2-Poisson model for probabilistic weighted retrieval. In SIGIR '94, pages 232 -- 241, 1994.

Digital Library

[23]

D. E. Rose and D. Levinson. Understanding user goals in web search. In WWW '04, pages 13--19, 2004.

Digital Library

[24]

M. Sahami and T. D. Heilman. A web-based kernel function for measuring the similarity of short text snippets. In WWW '06, pages 377--386, 2006.

Digital Library

[25]

D. Shen, R. Pan, J. Sun, J. Pan, K. Wu, and J. Yin. Q2c@ust: Our winning solution to query classification in KDDCUP 2005. SIGKDD Explorations, 7(2):100--110, 2005.

Digital Library

[26]

D. Shen, J. Sun, Q. Yang, and Z. Chen. Building bridges for web query classification. In SIGIR '06, pages 131--138, 2006.

Digital Library

[27]

D. Vogel, S. Bickel, P. Haider, R. Shimpfky, and P. Siemen. Classifying search engine queries using the web as background knowledge. SIGKDD Explorations, 7(2):117--122, 2005.

Digital Library

[28]

Q. Wu, C. Burges, K. Svore, and J. Gao. Adapting boosting for information retrieval measures. Journal of Information Retrieval, 2009. DOI 10.1007/s10791-009-9112-1.

Digital Library

[29]

Y. Yue and C. Burges. On using simultaneous perturbation stochastic approximation for IR measures, and the empirical optimality of LambdaRank. NIPS '07 Machine Learning for Web Search Workshop, 2007.

Cited By

Bai YZhou YDou ZWen J(2024)Intent-Oriented Dynamic Interest Modeling for Personalized Web SearchACM Transactions on Information Systems10.1145/363981742:4(1-30)Online publication date: 8-Jan-2024
https://dl.acm.org/doi/10.1145/3639817
Zhou YZhu QJin JDou ZChua TNgo CKa-Wei Lee RKumar RLauw H(2024)Cognitive Personalized Search Integrating Large Language Models with an Efficient Memory MechanismProceedings of the ACM Web Conference 202410.1145/3589334.3645482(1464-1473)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589334.3645482
Meng YLi RQin HWu XDuan HLu YWang G(2024)Encoding Group Interests With Persistent Homology for Personalized SearchIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2024.341002954:9(5606-5616)Online publication date: Sep-2024
https://doi.org/10.1109/TSMC.2024.3410029
Show More Cited By

Index Terms

Classification-enhanced ranking
1. Computing methodologies
  1. Machine learning
2. Information systems
  1. Information retrieval
    1. Document representation

Recommendations

Quality-biased ranking for queries with commercial intent
WWW '13 Companion: Proceedings of the 22nd International Conference on World Wide Web

Modern search engines are good enough to answer popular commercial queries with mainly highly relevant documents. However, our experiments show that users behavior on such relevant commercial sites may differ from one to another web-site with the same ...
Ranking Relevance in Yahoo Search
KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Search engines play a crucial role in our daily lives. Relevance is the core problem of a commercial search engine. It has attracted thousands of researchers from both academia and industry and has been studied for decades. Relevance in a modern search ...
Collaborative ranking: improving the relevance for tail queries
CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management

It is well known that tail queries contribute to a substantial fraction of distinct queries submitted to search engines and thus become a major battle field for search engines. Unfortunately, compared with popular queries, it is much more difficult to ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

WWW '10: Proceedings of the 19th international conference on World wide web

April 2010

1407 pages

ISBN:9781605587998

DOI:10.1145/1772690

General Chairs:
Michael Rappa
North Carolina State University, USA
,
Paul Jones
University of North Carolina at Chapel Hill, USA
,
Program Chairs:
Juliana Freire
University of Utah, USA
,
Soumen Chakrabarti
Indian Institute of Technology, India

Copyright © 2010 International World Wide Web Conference Committee (IW3C2).

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 April 2010

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

WWW '10

WWW '10: The 19th International World Wide Web Conference

April 26 - 30, 2010

North Carolina, Raleigh, USA

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

95
Total Citations
View Citations
917
Total Downloads

Downloads (Last 12 months)8
Downloads (Last 6 weeks)1

Reflects downloads up to 15 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Bai YZhou YDou ZWen J(2024)Intent-Oriented Dynamic Interest Modeling for Personalized Web SearchACM Transactions on Information Systems10.1145/363981742:4(1-30)Online publication date: 8-Jan-2024
https://dl.acm.org/doi/10.1145/3639817
Zhou YZhu QJin JDou ZChua TNgo CKa-Wei Lee RKumar RLauw H(2024)Cognitive Personalized Search Integrating Large Language Models with an Efficient Memory MechanismProceedings of the ACM Web Conference 202410.1145/3589334.3645482(1464-1473)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589334.3645482
Meng YLi RQin HWu XDuan HLu YWang G(2024)Encoding Group Interests With Persistent Homology for Personalized SearchIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2024.341002954:9(5606-5616)Online publication date: Sep-2024
https://doi.org/10.1109/TSMC.2024.3410029
Abri SAbri R(2024)Deep learning methods for LSTM-based personalized search: a comparative analysisInternational Journal of Machine Learning and Cybernetics10.1007/s13042-024-02418-7Online publication date: 25-Oct-2024
https://doi.org/10.1007/s13042-024-02418-7
Ito TMaruta AKato MFujita S(2024)PR-Rank: A Parameter Regression Approach for Learning-to-Rank Model Adaptation Without Target Domain DataWeb Information Systems Engineering – WISE 202410.1007/978-981-96-0573-6_1(3-18)Online publication date: 27-Nov-2024
https://doi.org/10.1007/978-981-96-0573-6_1
Zhou YDou ZWen J(2023)Enhancing Potential Re-Finding in Personalized Search With Hierarchical Memory NetworksIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.312606635:4(3846-3857)Online publication date: 1-Apr-2023
https://doi.org/10.1109/TKDE.2021.3126066
Bhatt SAgarwal SGurjar OGupta MShrivastava M(2023)TourismNLG: A Multi-lingual Generative Benchmark for the Tourism DomainAdvances in Information Retrieval10.1007/978-3-031-28244-7_10(150-166)Online publication date: 17-Mar-2023
https://doi.org/10.1007/978-3-031-28244-7_10
Deng CZhou YDou ZSelcuk Candan KLiu HAkoglu LLuna Dong XTang J(2022)Improving Personalized Search with Dual-Feedback NetworkProceedings of the Fifteenth ACM International Conference on Web Search and Data Mining10.1145/3488560.3498447(210-218)Online publication date: 11-Feb-2022
https://dl.acm.org/doi/10.1145/3488560.3498447
Yao JDou ZXie RLu YWang ZWen JDemartini GZuccon GCulpepper JHuang ZTong H(2021)USERProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482489(2373-2382)Online publication date: 26-Oct-2021
https://dl.acm.org/doi/10.1145/3459637.3482489
Zhou YDou ZZhu YWen JDemartini GZuccon GCulpepper JHuang ZTong H(2021)PSSLProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482379(2749-2758)Online publication date: 26-Oct-2021
https://dl.acm.org/doi/10.1145/3459637.3482379
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

EPUB

View this article in ePub.

Figures

Tables

Media

View Table of Conten