demonstration

YaLi: a crowdsourcing plug-in for NERD

Authors:
Yafang Wang

Max Planck Institute for Informatics, Saarbrüecken, Germany

Max Planck Institute for Informatics, Saarbrüecken, Germany
View Profile

,
Lili Jiang

Max Planck Institute for Informatics, Saarbrüecken, Germany

Max Planck Institute for Informatics, Saarbrüecken, Germany
View Profile

,
Johannes Hoffart

Max Planck Institute for Informatics, Saarbrüecken, Germany

Max Planck Institute for Informatics, Saarbrüecken, Germany
View Profile

,
Gerhard Weikum

Max Planck Institute for Informatics, Saarbrüecken, Germany

Max Planck Institute for Informatics, Saarbrüecken, Germany
View Profile

SIGIR '13: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrievalJuly 2013Pages 1111–1112https://doi.org/10.1145/2484028.2484206

Published:28 July 2013Publication History

SIGIR '13: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval

Pages 1111–1112

ABSTRACT

We demonstrate the YaLi browser plug-in which discovers named entities in Web pages and provides background knowledge about them. The plug-in is implemented with two purposes. From a user perspective, it enriches the browsing experience with entities, helping users with their information needs. From the research perspective, we aim to improve the methods that are used for named entity recognition and disambiguation (NERD) by leveraging the plug-in as an implicit crowdsourcing platform. YaLi tracks the system's errors and the users' corrections, and also gathers implicit training data for improving NERD accuracy.

References

J. R. Finkel, T. Grenager, C. Manning. Incorporating non-local information into information extraction systems by Gibbs sampling. ACL, 2005. Google ScholarDigital Library
M. J. Franklin, D. Kossmann, T. Kraska, S. Ramesh, R. Xin. CrowdDB: Answering queries with crowdsourcing. SIGMOD, 2011. Google ScholarDigital Library
J. Hoffart, et al.. Robust disambiguation of named entities in text. EMNLP, 2011. Google ScholarDigital Library
V. I. Spitkovsky, A. X. Chang. A cross-lingual dictionary for English Wikipedia concepts. LREC, 2012.Google Scholar
J. Wang, T. Kraska, M. J. Franklin, J. Feng. CrowdER: Crowdsourcing entity resolution. VLDB, 2012. Google ScholarDigital Library
T. Yan, V. Kumar, D. Ganesan. CrowdSearch: Exploiting crowds for accurate real-time image search on mobile phones. In MobiSys, 2010. Google ScholarDigital Library

Index Terms

YaLi: a crowdsourcing plug-in for NERD
1. Information systems
  1. Information systems applications

Recommendations

The neofonie NERD system at the ERD challenge 2014
ERD '14: Proceedings of the first international workshop on Entity recognition & disambiguation

This paper describes Neofonie NERD, our Named Entity Recognition and Disambiguation system submitted to the ERD Challenge 2014. The system uses a vector space model approach for disambiguation, based on the link structure of Freebase, in combination ...
Read More
Web personal name disambiguation based on reference entity tables mined from the web
WIDM '09: Proceedings of the eleventh international workshop on Web information and data management

Ambiguous personal names are common on the Web, which pose a challenge for many different tasks. The traditional disambiguation employs the clustering methods. However, without reference entity tables, the clustering method can only identify whether two ...
Read More
Named entity recognition and disambiguation using linked data and graph-based centrality scoring
SWIM '12: Proceedings of the 4th International Workshop on Semantic Web Information Management

Named Entity Recognition (NER) is a subtask of information extraction and aims to identify atomic entities in text that fall into predefined categories such as person, location, organization, etc. Recent efforts in NER try to extract entities and link ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGIR '13: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
July 2013
1188 pages
ISBN:9781450320344
DOI:10.1145/2484028
General Chairs:
Gareth J.F. Jones
Dublin City University, Ireland
,
Páraic Sheridan
Dublin City University, Ireland
,
Program Chairs:
Diane Kelly
University of North Carolina, Chapel Hill, USA
,
Maarten de Rijke
University of Amsterdam, The Netherlands
,
Tetsuya Sakai
Microsoft Research Asia, China
Copyright © 2013 Copyright is held by the owner/author(s)
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 28 July 2013
Check for updates
Author Tags
browser plug-in
crowdsourcing
named entity disambiguation
named entity recognition
Qualifiers
- demonstration
Conference

Acceptance Rates
SIGIR '13 Paper Acceptance Rate73of366submissions,20%Overall Acceptance Rate792of3,983submissions,20%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 221
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

YaLi: a crowdsourcing plug-in for NERD

SIGIR '13: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

The neofonie NERD system at the ERD challenge 2014

Web personal name disambiguation based on reference entity tables mined from the web

Named entity recognition and disambiguation using linked data and graph-based centrality scoring