skip to main content
10.1145/2983323.2983909acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper
Public Access

Improving Entity Ranking for Keyword Queries

Authors Info & Claims
Published:24 October 2016Publication History

ABSTRACT

Knowledge bases about entities are an important part of modern information retrieval systems. A strong ranking of entities can be used to enhance query understanding and document retrieval or can be presented as another vertical to the user. Given a keyword query, our task is to provide a ranking of the entities present in the collection of interest. We are particularly interested in approaches to this problem that generalize to different knowledge bases and different collections. In the past, this kind of problem has been explored in the enterprise domain through Expert Search. Recently, a dataset was introduced for entity ranking from news and web queries from more general TREC collections.

Approaches from prior work leverage a wide variety of lexical resources: e.g., natural language processing and relations in the knowledge base. We address the question of whether we can achieve competitive performance with minimal linguistic resources.

We propose a set of features that do not require index-time entity linking, and demonstrate competitive performance on the new dataset. As this paper is the first non-introductory work to leverage this new dataset, we also find and correct certain aspects of the benchmark. To support a fair evaluation, we collect 38% more judgments and contribute annotator agreement information.

References

  1. S. Auer, C. Bizer, G. Kobilarov, J. Lehmann, R. Cyganiak, and Z. Ives. Dbpedia: A nucleus for a web of open data. Springer, 2007.Google ScholarGoogle Scholar
  2. K. Balog, L. Azzopardi, and M. De Rijke. Formal models for expert finding in enterprise corpora. In SIGIR'06, pages 43--50. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. K. Balog, Y. Fang, M. de Rijke, P. Serdyukov, and L. Si. Expertise retrieval. Foundations and Trends in Information Retrieval, 6(2--3):127--256, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. K. Balog, P. Serdyukov, and A. P. d. Vries. Overview of the trec 2010 entity track. Technical report, DTIC Document, 2010.Google ScholarGoogle Scholar
  5. A. Boldyrev, G. Weikum, and M. Theobald. Dictionary-Based Named Entity Recognition. PhD thesis, Universitat des Saarlandes Saarbrücken, 2013.Google ScholarGoogle Scholar
  6. D. Carmel, M.-W. Chang, E. Gabrilovich, B.-J. P. Hsu, and K. Wang. ERD'14: entity recognition and disambiguation challenge. In ACM SIGIR Forum, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. N. Craswell, A. P. de Vries, and I. Soboroff. Overview of the trec 2005 enterprise track. In TREC, pages 199--205, 2005.Google ScholarGoogle Scholar
  8. W. B. Croft, D. Metzler, and T. Strohman. Search engines: Information retrieval in practice. Addison-Wesley Reading, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Dalton, L. Dietz, and J. Allan. Entity query feature expansion using knowledge base links. In SIGIR'14, pages 365--374. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. V. Dang. Ranklib. https://sourceforge.net/p/lemur/wiki/RankLib, 2015.Google ScholarGoogle Scholar
  11. G. Demartini, T. Iofciu, and A. P. De Vries. Overview of the INEX 2009 entity ranking track. In Focused Retrieval and Evaluation, pages 254--264. Springer, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Dunietz and D. Gillick. A new entity salience task with millions of training examples. EACL'14, page 205.Google ScholarGoogle Scholar
  13. P. Ferragina and U. Scaiella. Tagme: on-the-fly annotation of short text fragments (by wikipedia entities). In CIKM'10, pages 1625--1628. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. E. Gabrilovich, M. Ringgaard, and A. Subramanya. Freebase annotation of clueweb corpora. http://lemurproject.org/clueweb09/FACC1/, June 2013.Google ScholarGoogle Scholar
  15. F. Hasibi, K. Balog, and S. E. Bratsberg. On the reproducibility of the tagme entity linking system. In ECIR'16, pages 436--449.Google ScholarGoogle Scholar
  16. J. Hoffart, D. Milchevski, and G. Weikum. Stics: searching with strings, things, and cats. In SIGIR'14, pages 1247--1248. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. Hoffart, F. M. Suchanek, K. Berberich, and G. Weikum. Yago2: A spatially and temporally enhanced knowledge base from wikipedia. Artificial Intelligence, 194:28--61, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Y. Hong, D. Lu, D. Yu, X. Pan, X. Wang, Y. Chen, L. Huang, and H. Ji. RPI Blender TAC-KBP2015 system description. In Text Analysis Conference, 2015.Google ScholarGoogle Scholar
  19. H. Ji, J. Nothman, B. Hachey, and R. Florian. Overview of TAC-KBP2015 Tri-lingual Entity Discovery and Linking. 2015.Google ScholarGoogle Scholar
  20. V. Lavrenko and W. B. Croft. Relevance based language models. In SIGIR 2001, pages 120--127, New York, NY, USA, 2001. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. X. Liu and H. Fang. Latent entity space: a novel retrieval approach for entity-bearing queries. Information Retrieval Journal, 18(6):473--503, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. C. D. Manning, M. Surdeanu, J. Bauer, J. Finkel, S. J. Bethard, and D. McClosky. The Stanford CoreNLP natural language processing toolkit. In ACL'14 Demo.Google ScholarGoogle Scholar
  23. P. McNamee and H. T. Dang. Overview of the TAC 2009 knowledge base population track. In TAC'09, volume 17, pages 111--113, 2009.Google ScholarGoogle Scholar
  24. D. Petkova and W. B. Croft. Proximity-based document representation for named entity retrieval. In CIKM'07, pages 731--740. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. M. Schuhmacher, L. Dietz, and S. Ponzetto. Ranking entities for web queries through text and knowledge. In CIKM'15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. C. Xiong and J. Callan. EsdRank: Connecting Query and Documents through External Semi-Structured Data. In CIKM'15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. N. Zhiltsov, A. Kotov, and F. Nikolaev. Fielded sequential dependence model for ad-hoc entity retrieval in the web of data. In SIGIR'15, pages 253--262. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. G. Zuccon, B. Koopman, and P. Bruza. Exploiting inference from semantic annotations for information retrieval: Reflections from medical IR. In ESAIR'14, pages 43--45, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Improving Entity Ranking for Keyword Queries

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
      October 2016
      2566 pages
      ISBN:9781450340731
      DOI:10.1145/2983323

      Copyright © 2016 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 October 2016

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      CIKM '16 Paper Acceptance Rate160of701submissions,23%Overall Acceptance Rate1,861of8,427submissions,22%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader