research-article

Survey and evaluation of query intent detection methods

Authors:
David J. Brenes

Indigo Group, Oviedo, Spain

Indigo Group, Oviedo, Spain
View Profile

,
Daniel Gayo-Avello

University of Oviedo, Oviedo, Spain

University of Oviedo, Oviedo, Spain
View Profile

,
Kilian Pérez-González

University of Oviedo, Spain

University of Oviedo, Spain
View Profile

WSCD '09: Proceedings of the 2009 workshop on Web Search Click DataFebruary 2009Pages 1–7https://doi.org/10.1145/1507509.1507510

Published:09 February 2009Publication History

WSCD '09: Proceedings of the 2009 workshop on Web Search Click Data

Pages 1–7

ABSTRACT

User interactions with search engines reveal three main underlying intents, namely navigational, informational, and transactional. By providing more accurate results depending on such query intents the performance of search engines can be greatly improved. Therefore, query classification has been an active research topic for the last years. However, while query topic classification has deserved a specific bakeoff, no evaluation campaign has been devoted to the study of automatic query intent detection. In this paper some of the available query intent detection techniques are reviewed, an evaluation framework is proposed, and it is used to compare those methods in order to shed light on their relative performance and drawbacks. As it will be shown, manually prepared gold-standard files are much needed, and traditional pooling is not the most feasible evaluation method. In addition to this, future lines of work in both query intent detection and its evaluation are proposed.

References

Baeza-Yates, R., Calderon-Benavides, L., and Gonzalez-Caro, C. The Intention Behind Web Queries. Lecture Notes in Computer Science 4209, (2006), 98. Google ScholarDigital Library
Bailey, P., Craswell, N., Soboroff, I., Thomas, P., de Vries, A. P., and Yilmaz, E. Relevance assessment: are judges exchangeable and does it matter. Proceedings of the 31^st annual international ACM SIGIR conference on Research and development in information retrieval, (2008), 667--674. Google ScholarDigital Library
Beitzel, S. M., Jensen, E. C., Chowdhury, A., and Frieder, O. Varying approaches to topical web query classification. Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, (2007), 783--784. Google ScholarDigital Library
Beitzel, S. M., Jensen, E. C., Frieder, O., Lewis, D. D., Chowdhury, A., and Kolcz, A. Improving automatic query classification via semi-supervised learning. Proceedings of the Fifth IEEE International Conference on Data Mining, (2005), 42--49. Google ScholarDigital Library
Brenes, D. J., and Gayo-Avello, D. Automatic detection of navigational queries according to Behavioural Characteristics. LWA 2008 Workshop Proceedings, (2008), 41--48.Google Scholar
Broder, A. A taxonomy of web search. ACM SIGIR Forum 36, 2 (2002), 3--10. Google ScholarDigital Library
Broder, A. Z., Fontoura, M., Gabrilovich, E., Joshi, A., Josifovski, V., and Zhang, T. Robust classification of rare queries using web knowledge. Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, (2007), 231--238. Google ScholarDigital Library
Buckley, C., Dimmick, D., Soboroff, I., and Voorhees, E. Bias and the limits of pooling for large collections. Information Retrieval 10, 6 (2007), 491--508. Google ScholarDigital Library
Buzikashvili, N. Sliding window technique for the web log analysis. Proceedings of the 16th international conference on World Wide Web, (2007), 1213--1214. Google ScholarDigital Library
Dai, H. K., Zhao, L., Nie, Z., Wen, J. R., Wang, L., and Li, Y. Detecting online commercial intention (OCI). Proceedings of the 15th international conference on World Wide Web, (2006), 829--837. Google ScholarDigital Library
Flanagan, D. MQL Reference Guide, (2008). Available at: http://mql.freebaseapps.com/ (Accessed 24 November 2008)Google Scholar
Gravano, L., Hatzivassiloglou, V., and Lichtenstein, R. Categorizing web queries according to geographical locality. Proceedings of the twelfth international conference on Information and knowledge management, (2003), 325--333. Google ScholarDigital Library
Jansen, B. J., Booth, D. L., and Spink, A. Determining the informational, navigational, and transactional intent of Web queries. Information Processing and Management 44, 3 (2008), 1251--1266. Google ScholarDigital Library
Jones, K. S. and van Rijsbergen, C. Report on the need for and provision of an "ideal" information retrieval test collection. British Library Research and Development, 1975. Cited by Voorhees, E. M. and Harman, D. The text retrieval conferences (TRECS). Proceedings of a workshop on held at Baltimore, Maryland: October 13--15, 1998, Association for Computational Linguistics Morristown, NJ, USA (1998), 241--273.Google Scholar
Kang, I. H. and Kim, G. C. Query type classification for web document retrieval. Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval, (2003), 64--71. Google ScholarDigital Library
Kittur, A., Chi, E., and Suh, B. Crowdsourcing user studies with Mechanical Turk. Proceedings of the 26^th annual SIGCHI conference on Human Factors in Computing Systems, (2008), 453--456. Google ScholarDigital Library
Lee, U., Liu, Z., and Cho, J. Automatic identification of user goals in Web search. Proceedings of the 14th international conference on World Wide Web, (2005), 391--400. Google ScholarDigital Library
Li, X., Wang, Y. Y., and Acero, A. Learning query intent from regularized click graphs. Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, (2008), 339--346. Google ScholarDigital Library
Li, Y., Krishnamurthy, R., Vaithyanathan, S., and Jagadish, H. V. Getting work done on the web: supporting transactional queries. Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, (2006), 557--564. Google ScholarDigital Library
Li, Y., Zheng, Z., and Dai, H. K. KDD CUP-2005 report: facing a great challenge. ACM SIGKDD Explorations Newsletter 7, 2 (2005), 91--99. Google ScholarDigital Library
Liu, Y., Zhang, M., Ru, L., and Ma, S. Automatic Query Type Identification Based on Click Through Information. Lecture Notes in Computer Science 4182, (2006), 593--600. Google ScholarDigital Library
Microsoft. Microsoft Research Microsoft Live Labs: Accelerating Search in Academic Research 2006, Request for Proposals, (2006). Available at: http://research.microsoft.com/ur/us/fundingopps/RFPs/Search_2006_RFP.aspx (accessed 24 November 2008).Google Scholar
Nettleton, D., Calderon, L., and Baeza-Yates, R. Analysis of Web Search Engine Query Sessions. Proc. of WebKDD, (2006), 20--23. Google ScholarDigital Library
Pu, H. T., Chuang, S. L., and Yang, C. Subject categorization of query terms for exploring Web users' search interests. Journal of the American Society for Information Science and Technology 53, 8 (2002), 617--630. Google ScholarDigital Library
Rose, D. E. and Levinson, D. Understanding user goals in web search. Proceedings of the 13th international conference on World Wide Web, (2004), 13--19. Google ScholarDigital Library
Shen, D., Pan, R., Sun, J. T., et al. Query enrichment for web-query classification. ACM Transactions on Information Systems (TOIS) 24, 3 (2006), 320--352. Google ScholarDigital Library
Shen, D., Sun, J. T., Yang, Q., and Chen, Z. Building bridges for web query classification. Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, (2006), 131--138. Google ScholarDigital Library
Spärck-Jones, K. Automatic indexing, Journal of Documentation 30, (1974), 393--432.Google ScholarCross Ref
Spink, A., Wolfram, D., Jansen, M. B. J., and Saracevic, T. Searching the web: The public and their queries. Journal of the American Society for Information Science and Technology 52, 3 (2001), 226--234. Google ScholarDigital Library
Tamine, L., Daoud, M., Dinh, B. D., and Boughanem, M. Contextual query classification in web search. LWA 2008 Workshop Proceedings, (2008), 65--68.Google Scholar
Taylor, A. R., Cool, C., Belkin, N. J., and Amadio, W. J. Relationships between categories of relevance criteria and stage in task completion. Information Processing & Management 43, 4 (2007), 1071--1084. Google ScholarDigital Library
Wang, L., Wang, C., Xie, X., et al. Detecting dominant locations from search queries. Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, (2005), 424--431. Google ScholarDigital Library
Yilmaz, E. and Aslam, J. A. Estimating average precision with incomplete and imperfect judgments. Proceedings of the 15th ACM international conference on Information and knowledge management, ACM Press New York, NY, USA (2006), 102--111. Google ScholarDigital Library
Zhang, Y. and Moffat, A. Separating Human and Non-Human Web Queries. Web Information Seeking and Interaction, (2007), 13--16.Google Scholar

Index Terms

Survey and evaluation of query intent detection methods
1. Human-centered computing
  1. Collaborative and social computing
    1. Collaborative and social computing systems and tools
2. Information systems

Recommendations

Learning query intent from regularized click graphs
SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval

This work presents the use of click graphs in improving query intent classifiers, which are critical if vertical search and general-purpose search services are to be offered in a unified user interface. Previous works on query classification have ...
Read More
Characterizing commercial intent
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management

Understanding the intent underlying user's queries may help personalize search results and therefore improve user satisfaction. We develop a methodology for using the content of search engine result pages (SERPs) along with the information obtained from ...
Read More
The influence of commercial intent of search results on their perceived relevance
iConference '11: Proceedings of the 2011 iConference

We carried out a retrieval effectiveness test on the three major web search engines (i.e., Google, Microsoft and Yahoo). In addition to relevance judgments, we classified the results according to their commercial intent and whether or not they carried ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WSCD '09: Proceedings of the 2009 workshop on Web Search Click Data
February 2009
95 pages
ISBN:9781605584348
DOI:10.1145/1507509
Conference Chairs:
Nick Craswell,
Rosie Jones,
Georges Dupret,
Evelyne Viegas
Copyright © 2009 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 9 February 2009
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
MSN query log
click-through data
evaluation
query intent detection
web search behavior
Qualifiers
- research-article
Conference
Upcoming Conference
WSDM '25

Sponsor:

sigir

sigir

sigir

sigir

The Eighteenth ACM International Conference on Web Search and Data Mining

April 7 - 11, 2025

Hannover , Germany
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 39
  Total Citations
  View Citations
- 1,447
  Total Downloads
- Downloads (Last 12 months)36
- Downloads (Last 6 weeks)7
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Survey and evaluation of query intent detection methods

WSCD '09: Proceedings of the 2009 workshop on Web Search Click Data

ABSTRACT

References

Cited By

Index Terms

Recommendations

Learning query intent from regularized click graphs

Characterizing commercial intent

The influence of commercial intent of search results on their perceived relevance

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Survey and evaluation of query intent detection methods

WSCD '09: Proceedings of the 2009 workshop on Web Search Click Data

ABSTRACT

References

Cited By

Index Terms

Recommendations

Learning query intent from regularized click graphs

Characterizing commercial intent

The influence of commercial intent of search results on their perceived relevance

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media