ABSTRACT
We demonstrate the YaLi browser plug-in which discovers named entities in Web pages and provides background knowledge about them. The plug-in is implemented with two purposes. From a user perspective, it enriches the browsing experience with entities, helping users with their information needs. From the research perspective, we aim to improve the methods that are used for named entity recognition and disambiguation (NERD) by leveraging the plug-in as an implicit crowdsourcing platform. YaLi tracks the system's errors and the users' corrections, and also gathers implicit training data for improving NERD accuracy.
- J. R. Finkel, T. Grenager, C. Manning. Incorporating non-local information into information extraction systems by Gibbs sampling. ACL, 2005. Google ScholarDigital Library
- M. J. Franklin, D. Kossmann, T. Kraska, S. Ramesh, R. Xin. CrowdDB: Answering queries with crowdsourcing. SIGMOD, 2011. Google ScholarDigital Library
- J. Hoffart, et al.. Robust disambiguation of named entities in text. EMNLP, 2011. Google ScholarDigital Library
- V. I. Spitkovsky, A. X. Chang. A cross-lingual dictionary for English Wikipedia concepts. LREC, 2012.Google Scholar
- J. Wang, T. Kraska, M. J. Franklin, J. Feng. CrowdER: Crowdsourcing entity resolution. VLDB, 2012. Google ScholarDigital Library
- T. Yan, V. Kumar, D. Ganesan. CrowdSearch: Exploiting crowds for accurate real-time image search on mobile phones. In MobiSys, 2010. Google ScholarDigital Library
Index Terms
- YaLi: a crowdsourcing plug-in for NERD
Recommendations
The neofonie NERD system at the ERD challenge 2014
ERD '14: Proceedings of the first international workshop on Entity recognition & disambiguationThis paper describes Neofonie NERD, our Named Entity Recognition and Disambiguation system submitted to the ERD Challenge 2014. The system uses a vector space model approach for disambiguation, based on the link structure of Freebase, in combination ...
Web personal name disambiguation based on reference entity tables mined from the web
WIDM '09: Proceedings of the eleventh international workshop on Web information and data managementAmbiguous personal names are common on the Web, which pose a challenge for many different tasks. The traditional disambiguation employs the clustering methods. However, without reference entity tables, the clustering method can only identify whether two ...
Named entity recognition and disambiguation using linked data and graph-based centrality scoring
SWIM '12: Proceedings of the 4th International Workshop on Semantic Web Information ManagementNamed Entity Recognition (NER) is a subtask of information extraction and aims to identify atomic entities in text that fall into predefined categories such as person, location, organization, etc. Recent efforts in NER try to extract entities and link ...
Comments