ABSTRACT
The DBpedia-entity collection has been used as a standard test collection for entity search in recent years. We develop and release a new version of this test collection, DBpedia-Entity v2, which uses a more recent DBpedia dump and a unified candidate result pool from the same set of retrieval models. Relevance judgments are also collected in a uniform way, using the same group of crowdsourcing workers, following the same assessment guidelines. The result is an up-to-date and consistent test collection.To facilitate further research, we also provide details about the pre-processing and indexing steps, and include baseline results from both classical and recently developed entity search methods.
- Krisztian Balog and Robert Neumayer 2012. Hierarchical target type identification for entity-oriented queries Proc. of CIKM'12. 2391--2394.Google Scholar
- Krisztian Balog and Robert Neumayer 2013. A Test Collection for Entity Search in DBpedia. Proc. of SIGIR'13. 737--740. Google ScholarDigital Library
- Krisztian Balog, Pavel Serdyukov, Arjen De Vries, Paul Thomas, and Thijs Westerveld 2010. Overview of the TREC 2009 Entity Track. In Proc. of TREC'09.Google Scholar
- Roi Blanco, Harry Halpin, Daniel M Herzig, Peter Mika, Jeffrey Pound, and Henry S Thompson. 2011. Entity Search Evaluation over Structured Web Data. Proc. of the 1st International Workshop on Entity-Oriented Search. 65--71.Google Scholar
- Jing Chen, Chenyan Xiong, and Jamie Callan 2016. An Empirical Study of Learning to Rank for Entity Search Proc. of SIGIR'16. 737--740.Google Scholar
- Gianluca Demartini, Tereza Iofciu, and Arjen P De Vries. 2009. Overview of the INEX 2009 Entity Ranking Track. In INEX. 254--264.Google ScholarDigital Library
- Paolo Ferragina and Ugo Scaiella 2010. TAGME: On-the-fly Annotation of Short Text Fragments (by Wikipedia Entities) Proc. of CIKM'10. 1625--1628. Google ScholarDigital Library
- Dario Garigliotti, Faegheh Hasibi, and Krisztian Balog. 2017. Target Type Identification for Entity-Bearing Queries Proc. of SIGIR'17.Google Scholar
- Jiafeng Guo, Gu Xu, Xueqi Cheng, and Hang Li. 2009. Named Entity Recognition in Query. In Proc. of SIGIR'09. 267--274. Google ScholarDigital Library
- Harry Halpin, Daniel M Herzig, Peter Mika, Roi Blanco, Jeffrey Pound, Henry S Thompson, and Duc Thanh Tran 2010. Evaluating Ad-hoc Object Retrieval. In Proc. of the International Workshop on Evaluation of Semantic Technologies.Google Scholar
- Faegheh Hasibi, Krisztian Balog, and Svein Erik Bratsberg. 2016. Exploiting Entity Linking in Queries for Entity Retrieval Proc. of ICTIR'16. 171--180.Google Scholar
- Faegheh Hasibi, Krisztian Balog, and Svein Erik Bratsberg. 2017natexlaba. Dynamic Factual Summaries for Entity Cards. In Proc. of SIGIR'17.Google ScholarDigital Library
- Faegheh Hasibi, Krisztian Balog, and Svein Erik Bratsberg. 2017natexlabb. Entity Linking in Queries: Efficiency vs. Effectiveness Proc. of ECIR'17. 40--53.Google Scholar
- Jinyoung Kim, Xiaobing Xue, and W Bruce Croft. 2009. A Probabilistic Retrieval Model for Semistructured Data Proc. of ECIR'09. 228--239.Google Scholar
- Vanessa Lopez, Christina Unger, Philipp Cimiano, and Enrico Motta 2013. Evaluating Question Answering over Linked Data. Web Semantics: Science, Services and Agents on the World Wide Web, Vol. 21, 0 (2013), 3--13. Google ScholarDigital Library
- Chunliang Lu, Wai Lam, and Yi Liao 2015. Entity Retrieval via Entity Factoid Hierarchy. In Proc. of ACL'15. 514--523. Google ScholarCross Ref
- Edgar Meij, Krisztian Balog, and Daan Odijk 2014. Entity Linking and Retrieval for Semantic Search. Proc of. WSDM'14. 683--684. Google ScholarDigital Library
- Edgar Meij, Marc Bron, Laura Hollink, Bouke Huurnink, and Maarten de Rijke 2011. Mapping Queries to the Linking Open Data Cloud: A Case Study Using DBpedia. Web Semant., Vol. 9, 4 (Dec. 2011), 418--433. Google ScholarDigital Library
- Donald Metzler and W Bruce Croft 2005. A Markov Random Field Model for Term Dependencies. Proc. of SIGIR'05. 472--479. Google ScholarDigital Library
- Fedor Nikolaev, Alexander Kotov, and Nikita Zhiltsov. 2016. Parameterized Fielded Term Dependence Models for Ad-hoc Entity Retrieval from Knowledge Graph Proc. of SIGIR'16. 435--444.Google Scholar
- Paul Ogilvie and Jamie Callan 2003. Combining Document Representations for Known-item Search Proc. of SIGIR'03. 143--150.Google Scholar
- Jay M Ponte and W Bruce Croft 1998. A Language Modeling Approach to Information Retrieval Proc. of SIGIR'98. 275--281.Google Scholar
- Stephen E. Robertson and Hugo Zaragoza 2009. The Probabilistic Relevance Framework: BM25 and Beyond. Found. and Trends in IR Vol. 3, 4 (2009), 333--389.Google ScholarDigital Library
- Fabian M Suchanek, Gjergji Kasneci, and Gerhard Weikum. 2007. Yago: A Core of Semantic Knowledge. In Proc. of WWW'07. 697--706.Google ScholarDigital Library
- Qiuyue Wang, Jaap Kamps, Georgina Ramírez Camps, Maarten Marx, Anne Schuth, Martin Theobald, Sairam Gurajada, and Arunav Mishra. 2012. Overview of the INEX 2012 Linked Data Track. In CLEF Online Working Notes.Google Scholar
- Chenyan Xiong, Russell Power, and Jamie Callan. 2017. Explicit Semantic Ranking for Academic Search via Knowledge Graph Embedding Proc. of WWW'17. 1271--1279.Google ScholarDigital Library
- Nikita Zhiltsov, Alexander Kotov, and Fedor Nikolaev. 2015. Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of Data Proc. of SIGIR'15. 253--262.Google Scholar
Index Terms
- DBpedia-Entity v2: A Test Collection for Entity Search
Recommendations
A test collection for entity search in DBpedia
SIGIR '13: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrievalWe develop and make publicly available an entity search test collection based on the DBpedia knowledge base. This includes a large number of queries and corresponding relevance judgments from previous benchmarking campaigns, covering a broad range of ...
Exploiting paths for entity search in RDF graphs
SIGIR '12: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrievalThe field of entity search using Semantic Web (RDF) data has gained more interest recently. In this paper, we propose a probabilistic entity retrieval model for RDF graphs using paths in the graph. Unlike previous work which assumes that all ...
Two-stage approach to named entity recognition using Wikipedia and DBpedia
IMCOM '17: Proceedings of the 11th International Conference on Ubiquitous Information Management and CommunicationIn natural language understanding, extraction of named entity (NE) mentions in given text and classification of the mentions into pre-defined NE types are important processes. Most NE recognition (NER) relies on resources such as a training corpus or NE ...
Comments