short-paper

Quantifying Query Ambiguity with Topic Distributions

Authors:
Yuki Yano

Yahoo Japan Corporation, Tokyo, Japan

Yahoo Japan Corporation, Tokyo, Japan
View Profile

,
Yukihiro Tagami

Yahoo Japan Corporation, Tokyo, Japan

Yahoo Japan Corporation, Tokyo, Japan
View Profile

,
Akira Tajima

Yahoo Japan Corporation, Tokyo, Japan

Yahoo Japan Corporation, Tokyo, Japan
View Profile

CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge ManagementOctober 2016Pages 1877–1880https://doi.org/10.1145/2983323.2983863

Published:24 October 2016Publication History

CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management

Pages 1877–1880

ABSTRACT

Query ambiguity is a useful metric for search engines to understand users' intents. Existing methods quantify query ambiguity by calculating an entropy of clicks. These methods assign each click to a one-hot vector corresponding to some mutually exclusive groups. However, they cannot incorporate non-obvious structures such as similarity among documents. In this paper, we propose a new approach for quantifying query ambiguity using topic distributions. We show that it is a natural extension of an existing entropy-based method. Further, we use our approach to achieve topic-based extensions of major existing entropy-based methods. Through an evaluation using e-commerce search logs combined with human judgments, our approach successfully extended existing entropy-based methods and improved the quality of query ambiguity measurements.

References

R. Artstein and M. Poesio. Inter-coder agreement for computational linguistics. Comput. Linguist., 34(4):555--596, 2008. Google ScholarDigital Library
P. N. Bennett, K. Svore, and S. T. Dumais. Classification-enhanced ranking. In Proceedings of WWW '10, pages 111--120, 2010. Google ScholarDigital Library
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 3:993--1022, 2003. Google ScholarDigital Library
S. Cronen-Townsend and W. B. Croft. Quantifying query ambiguity. In Proceedings of HLT '02, pages 104--109, 2002. Google ScholarDigital Library
Z. Dou, R. Song, and J.-R. Wen. A large-scale evaluation and analysis of personalized search strategies. In Proceedings of WWW '07, pages 581--590, 2007. Google ScholarDigital Library
H. Duan, E. Kiciman, and C. Zhai. Click patterns: An empirical representation of complex query intents. In Proceedings of CIKM '12, pages 1035--1044, 2012. Google ScholarDigital Library
E. Jones, T. Oliphant, P. Peterson, et al. SciPy: Open source scientific tools for Python, 2001--. {Online; accessed 2016-04--25}.Google Scholar
T. Kudo, K. Yamamoto, and Y. Matsumoto. Applying conditional random fields to japanese morphological analysis. In Proceedings of EMNLP '04, pages 230--237, 2004.Google Scholar
G. Qiu, K. Liu, J. Bu, C. Chen, and Z. Kang. Quantify query ambiguity using odp metadata. In Proceedings of SIGIR '07, pages 697--698, 2007. Google ScholarDigital Library
R.v Rehurek and P. Sojka. Software Framework for Topic Modelling with Large Corpora. In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, pages 45--50, 2010.Google Scholar
R. L. Santos, C. Macdonald, and I. Ounis. Selectively diversifying web search results. In Proceedings of CIKM '10, pages 1179--1188, 2010. Google ScholarDigital Library
R. Song, Z. Luo, J.-R. Wen, Y. Yu, and H.-W. Hon. Identifying ambiguous queries in web search. In Proceedings of WWW '07, pages 1169--1170, 2007. Google ScholarDigital Library
Y. Wang and E. Agichtein. Query ambiguity revisited: clickthrough measures for distinguishing informational and ambiguous queries. In Proceedings of HLT '10, pages 361--364, 2010. Google ScholarDigital Library

Index Terms

Quantifying Query Ambiguity with Topic Distributions
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing
      1. Query intent
      2. Query representation

Recommendations

Intent-aware query similarity
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management

Query similarity calculation is an important problem and has a wide range of applications in IR, including query recommendation, query expansion, and even advertisement matching. Existing work on query similarity aims to provide a single similarity ...
Read More
Predicting query reformulation type from user behavior
SAC '13: Proceedings of the 28th Annual ACM Symposium on Applied Computing

This paper proposes a method to discover how a user's search intent changes using his/her behavior during a Web search. A Web search user has a particular search intent and formulates search queries according to that intent. It is, however, a difficult ...
Read More
Rank-Integrated Topic Modeling: A General Framework
Web and Big Data
Abstract
Rank-integrated topic models which incorporate link structures into topic modeling through topical ranking have shown promising performance comparing to other link combined topic models. However, existing work on rank-integrated topic modeling ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
October 2016
2566 pages
ISBN:9781450340731
DOI:10.1145/2983323
General Chairs:
Snehasis Mukhopadhyay
Indiana University Purdue University Indianapolis, USA
,
ChengXiang Zhai
University of Illinois at Urbana-Champaign, USA
,
Program Chairs:
Elisa Bertino
Purdue University
,
Fabio Crestani
University of Lugano
,
Javed Mostafa
University of North Carolina
,
Jie Tang
Tsinghua University
,
Luo Si
Alibaba Group Inc & Purdue University
,
Xiaofang Zhou
University of Queensland
,
Yi Chang
Yahoo Research
,
Yunyao Li
IBM Research - Almaden
,
Parikshit Sondhi
WalmartLabs
Copyright © 2016 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 24 October 2016
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
query ambiguity
search intent
topic distribution
Qualifiers
- short-paper
Conference

Acceptance Rates
CIKM '16 Paper Acceptance Rate160of701submissions,23%Overall Acceptance Rate1,861of8,427submissions,22%
More
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 3
  Total Citations
  View Citations
- 156
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Quantifying Query Ambiguity with Topic Distributions

CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management

ABSTRACT

References

Cited By

Index Terms

Recommendations

Intent-aware query similarity

Predicting query reformulation type from user behavior

Rank-Integrated Topic Modeling: A General Framework