ABSTRACT
Given a knowledge base, annotating any text with entities in the knowledge base enhances automated understanding of the text. Entities provide extra contextual information for the automated system to understand and interpret the text better. In the special case when the text is in the form of short text queries, automated understanding can be critical in improving the quality of search results and recommendations. Annotation of queries helps semantic retrieval, ensuring diversity of search results including retrieval of relevant news stories. In this paper, we present SIEL@ERD, a system for automated stamping of entity information in short query text. Our system builds from the state-of-the-art TAGME system and is optimized for time and performance efficiency. Our system achieved an F1 measure of 0.53 and the latency of 0.31 seconds on a dataset of 500 queries and a Freebase snapshot provided for the short track in the Entity Recognition and Disambiguation Challenge at SIGIR 2014.
- A. E. Cano, G. Rizzo, A. Varga, M. Rowe, M. Stankovic, and A.-S. Dadzie. Microposts2014 NEEL Challenge: Measuring the Performance of Entity Linking Systems in Social Streams. In Proc. of the Microposts2014 NEEL Challenge, 2014.Google Scholar
- D. Carmel, M.-W. Chang, E. Gabrilovich, B.-J. P. Hsu, and K. Wang. ERD 2014: Entity Recognition and Disambiguation Challenge. SIGIR Forum, 2014 (forthcoming). Google ScholarDigital Library
- S. Cucerzan. Large-Scale Named Entity Disambiguation Based on Wikipedia Data. In Proc. of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pages 708--716. Association for Computational Linguistics, Jun 2007.Google Scholar
- P. Ferragina and U. Scaiella. TAGME: On-the-fly Annotation of Short Text Fragments (by Wikipedia Entities). In Proc. of the 19th ACM Intl. Conf. on Information and Knowledge Management (CIKM), pages 1625--1628, 2010. Google ScholarDigital Library
- S. Guo, M.-W. Chang, and E. Kıcıman. To Link or Not to Link? A Study on End-to-End Tweet Entity Linking. In Proc. of the Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT), pages 1020--1030, 2013.Google Scholar
- S. Kulkarni, A. Singh, G. Ramakrishnan, and S. Chakrabarti. Collective Annotation of Wikipedia Entities in Web Text. In Proc. of the 15th ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining (KDD), pages 457--466. ACM, 2009. Google ScholarDigital Library
- X. Liu, Y. Li, H. Wu, M. Zhou, F. Wei, and Y. Lu. Entity Linking for Tweets. In Proc. of the 51th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1304--1311, 2013.Google Scholar
- E. Meij, W. Weerkamp, and M. de Rijke. Adding Semantics to Microblog Posts. In Proc. of the 5th ACM Intl. Conf. on Web Search and Data Mining (WSDM), pages 563--572. ACM, 2012. Google ScholarDigital Library
- R. Mihalcea and A. Csomai. Wikify!: Linking Documents to Encyclopedic Knowledge. In Proc. of the 16th ACM Conf. on Information and Knowledge Management (CIKM), pages 233--242. ACM, 2007. Google ScholarDigital Library
- D. Milne and I. H. Witten. An Effective, Low-Cost Measure of Semantic Relatedness Obtained from Wikipedia Links. In Proc. of the AAAI Workshop on Wikipedia and Artificial Intelligence: An Evolving Synergy, 2008.Google Scholar
- D. Milne and I. H. Witten. Learning to Link with Wikipedia. In Proc. of the 17th ACM Conf. on Information and Knowledge Management (CIKM), pages 509--518. ACM, 2008. Google ScholarDigital Library
- A. Ritter, S. Clark, Mausam, and O. Etzioni. Named Entity Recognition in Tweets: An Experimental Study. In Proc. of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2011. Google ScholarDigital Library
Index Terms
Exploiting Wikipedia inlinks for linking entities in queries
Recommendations
Re-ranking for joint named-entity recognition and linking
CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge ManagementRecognizing names and linking them to structured data is a fundamental task in text analysis. Existing approaches typically perform these two steps using a pipeline architecture: they use a Named-Entity Recognition (NER) system to find the boundaries of ...
Exploiting Relevant Hyperlinks in Knowledge Base for Entity Linking
Advances in Knowledge Discovery and Data MiningAbstractIn this study, we propose a new model aiming to enhance the quality of entity linking by exploiting highly relevant hyperlinks in knowledge base for entity disambiguation. We find that most existing studies do not filter the corresponding ...
DAWT: Densely Annotated Wikipedia Texts Across Multiple Languages
WWW '17 Companion: Proceedings of the 26th International Conference on World Wide Web CompanionIn this work, we open up the DAWT dataset - Densely Annotated Wikipedia Texts across multiple languages. The annotations include labeled text mentions mapping to entities (represented by their Freebase machine ids) as well as the type of the entity. The ...
Comments