skip to main content
10.1145/2187980.2187986acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Answering math queries with search engines

Published:16 April 2012Publication History

ABSTRACT

Conventional search engines such as Bing and Google provide a user with a short answer to some queries as well as a ranked list of documents, in order to better meet her information needs. In this paper we study a class of such queries that we call math. Calculations (e.g. "12% of 24$ ", "square root of 120"), unit conversions (e.g. "convert 10 meter to feet"), and symbolic computations (e.g. "plot x^2+x+1") are examples of math queries. Among the queries that should be answered, math queries are special because of the infinite combinations of numbers and symbols, and rather few keywords that form them. Answering math queries must be done through real time computations rather than keyword searches or database look ups. The lack of a formal definition for the entire range of math queries makes it hard to automatically identify them all. We propose a novel approach for recognizing and classifying math queries using large scale search logs, and investigate its accuracy through empirical experiments and statistical analysis. It allows us to discover classes of math queries even if we do not know their structures in advance. It also helps to identify queries that are not math even though they might look like math queries.

We also evaluate the usefulness of math answers based on the implicit feedback from users. Traditional approaches for evaluating the quality of search results mostly rely on the click information and interpret a click on a link as a sign of satisfaction. Answers to math queries do not contain links, therefore such metrics are not applicable to them. In this paper we describe two evaluation metrics that can be applied for math queries, and present the results on a large collection of math queries taken from Bing's search logs.

References

  1. F. J. Anscombe. The validity of comparative experiments. InJournal of the Royal Statistical Society, pages 181--211, 1948.Google ScholarGoogle ScholarCross RefCross Ref
  2. I. Bordino, C. Castillo, D. Donato, and A. Gionis. Query similarity by projecting the query-flow graph. In SIGIR, pages 515--522, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. C. Castillo, A. Gionis, R. Lempel, and Y. Maarek. When no clicks are good news. In SIGIR, 2010.Google ScholarGoogle Scholar
  4. L. B. Chilton and J. Teevan. Addressing people's information needs directly in a web search result page. In WWW, pages 27--36, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. Hassan, R. Jones, and K. L. Klinkner. Beyond dcg: user behavior as a predictor of a successful search. In WSDM, pages 221--230, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. K. Jain and R. C. Dubes.Algorithms for Clustering Data. Prentice-Hall, Inc., Upper Saddle River, NJ, USA, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Kamali and F. W. Tompa. A new mathematics retrieval system. In CIKM, pages 1413--1416, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Kamali and F. W. Tompa. Grammar inference for web documents. In WebDB, 2011.Google ScholarGoogle Scholar
  9. M. Kohlhase and I. A. S Ayucan. A search engine for mathematical formulae. InArtificial Intelligence and Symbolic Computation, pages 241--253. Springer, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. P. Lewicki and T. Hill.Statistics : Methods and Applications. StatSoft, 2006.Google ScholarGoogle Scholar
  11. B. Liu and Y. Zhai. NET - a system for extracting web data from at and nested data records. In WISE, pages 487--495, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. C. Montgomery and G. C. Runger.Applied Statistics and Probability for Engineers. John Wiley and Sons, 2010.Google ScholarGoogle Scholar
  13. D. Shen, Y. Li, X. Li, and D. Zhou. Product query classification. InCIKM, pages 741--750, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. S. Stamou and E. N. Efthimiadis. Queries without clicks: Successful or failed searches. In SIGIR Workshop on the Future of IR Evaluation, pages 13--14, 2009.Google ScholarGoogle Scholar
  15. S. Stamou and E. N. Efthimiadis. Interpreting user inactivity on search results. In ECIR, pages 100--113, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. R. G. Steel and J. H. Torrie. Principles and Procedures of Statistics.Google ScholarGoogle Scholar
  17. P.-N. Tan, M. Steinbach, and V. Kumar. Introduction to Data Mining. Addison-Wesley, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J.-R. Wen and H. Zhang. Query clustering in the web context. In Clustering and Information Retrieval, pages 195--226. 2003.Google ScholarGoogle Scholar

Index Terms

  1. Answering math queries with search engines

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          WWW '12 Companion: Proceedings of the 21st International Conference on World Wide Web
          April 2012
          1250 pages
          ISBN:9781450312301
          DOI:10.1145/2187980

          Copyright © 2012 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 16 April 2012

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate1,899of8,196submissions,23%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader