skip to main content
10.1145/1835449.1835664acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
poster

Incorporating global information into named entity recognition systems using relational context

Published: 19 July 2010 Publication History

Abstract

The state-of-the-art in Named Entity Recognition relies on a combination of local features of the text and global knowledge to determine the types of the recognized entities. This is problematic in some cases, resulting in entities being classified as belonging to the wrong type. We show that using global information about the corpus improves the accuracy of type identification. We explore the notion of a global domain frequency that relates relation identifying terms with pairs of entity types which are used in that relation. We use this to identify entities whose types are not compatible with the terms they co-occur in the text. Our results on a large corpus of social media content allows the identification of mistyped entities with 70% accuracy.

References

[1]
K. Burton, A. Java, and I. Soboroff. The icwsm 2009 spinn3r dataset. In ICWSM '09: Proceedings of the 3rd Int'l AAAI Conference on Weblogs and Social Media, 2009.
[2]
F. Mesquita, Y. Merhav, and D. Barbosa. Extracting information networks from the blogosphere: State-of-the-art and challenges. In ICWSM '10: Proceedings of the 4th Int'l AAAI Conference on Weblogs and Social Media, 2010.
[3]
L. Ratinov and D. Roth. Design challenges and misconceptions in named entity recognition. In CoNLL '09: Proceedings of the 13th Conference on Computational Natural Language Learning, pages 147--155, Morristown, NJ, USA, 2009. Association for Computational Linguistics.

Cited By

View all
  • (2017)A Novel Word Clustering and Cluster Merging Technique for Named Entity RecognitionJournal of Intelligent Systems10.1515/jisys-2016-007428:1(15-30)Online publication date: 7-Jun-2017
  • (2014)Micro-Blogs Entity Recognition Based on DSTCRFChinese Journal of Electronics10.23919/CJE.2014.1084802523:1(147-150)Online publication date: Jan-2014
  • (2012)A weighting scheme for open information extractionProceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop10.5555/2385736.2385749(60-65)Online publication date: 3-Jun-2012
  • Show More Cited By

Index Terms

  1. Incorporating global information into named entity recognition systems using relational context

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
    July 2010
    944 pages
    ISBN:9781450301534
    DOI:10.1145/1835449
    Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 July 2010

    Check for updates

    Author Tags

    1. domain frequency
    2. named entity recognition

    Qualifiers

    • Poster

    Conference

    SIGIR '10
    Sponsor:

    Acceptance Rates

    SIGIR '10 Paper Acceptance Rate 87 of 520 submissions, 17%;
    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 08 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2017)A Novel Word Clustering and Cluster Merging Technique for Named Entity RecognitionJournal of Intelligent Systems10.1515/jisys-2016-007428:1(15-30)Online publication date: 7-Jun-2017
    • (2014)Micro-Blogs Entity Recognition Based on DSTCRFChinese Journal of Electronics10.23919/CJE.2014.1084802523:1(147-150)Online publication date: Jan-2014
    • (2012)A weighting scheme for open information extractionProceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop10.5555/2385736.2385749(60-65)Online publication date: 3-Jun-2012
    • (2012)Clustering techniques for open relation extractionProceedings of the on SIGMOD/PODS 2012 PhD Symposium10.1145/2213598.2213607(27-32)Online publication date: 20-May-2012

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media