skip to main content
10.1145/2492517.2500251acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Biomedical text disambiguation using UMLS

Published:25 August 2013Publication History

ABSTRACT

Interest in extracting information from biomedical documents has increased significantly in recent years but has always been challenged by the ambiguity of natural language. An important source of ambiguity is the usage of polysemous words: words with multiple meanings. Word sense disambiguation algorithms attempt to solve this problem by finding the correct meaning of a polysemous word in a given context, but very few algorithms were designed to disambiguate biomedical text. In this study we propose a word sense disambiguation algorithm focused on biomedical text. The proposed algorithm does not need to be trained and uses a relatively small knowledge base.

References

  1. Ide, Nancy, and Jean Véronis.: Introduction to the special issue on word sense disambiguation: the state of the art. Computational linguistics 24.1, 1998, pp. 2--40. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Navigli, Roberto.: Word sense disambiguation: A survey. ACM Computing Surveys (CSUR) 41.2, 2009 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Kilgarri, Adam.: Senseval: An exercise in evaluating word sense disambiguation programs. Proc. of the First International Conference on Language Resources and Evaluation, 1998Google ScholarGoogle Scholar
  4. Edmonds, Philip, and Scott Cotton.: senseval-2: Overview. Proceedings of the 2nd International Workshop on Evaluating Word Sense Disambiguation Systems. Toulouse, France, 2001, pp. 1--6 Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Mihalcea, Rada, and Ehsanul Faruque.: Senselearner: Minimally supervised word sense disambiguation for all words in open text. In Proceedings of ACL/SIGLEX Senseval, vol. 3, 2004, pp. 155--158Google ScholarGoogle Scholar
  6. Agirre, Eneko, Oier Lopez de Lacalle, Bernardo Magnini, Arantxa Otegi, German Rigau, and Piek Vossen.: SemEval-2007 task 01: evaluating WSD on cross-language information retrieval. Advances in Multilingual and Multimodal Information Retrieval, 2008, pp. 908--917 Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Schuemie, Martijn J., Jan A. Kors, and Barend Mons.: Word sense disambiguation in the biomedical domain: an overview. Journal of Computational Biology 12, no. 5, 2005, pp. 554--565.Google ScholarGoogle ScholarCross RefCross Ref
  8. Humphreys, Betsy L., Donald AB Lindberg, Harold M. Schoolman, and G. Octo Barnett.: The Unified Medical Language System An Informatics Research Collaboration. Journal of the American Medical Informatics Association 5, no. 1, 1998, pp. 1--11Google ScholarGoogle ScholarCross RefCross Ref
  9. Agirre, Eneko, Aitor Soroa, and Mark Stevenson.: Graph-based Word Sense Disambiguation of biomedical documents. Bioinformatics 26, no. 22, 2010, pp. 2889--2896 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Agirre, Eneko, and Aitor Soroa.: Personalizing pagerank for word sense disambiguation. In Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, 2009, pp. 33--41 Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Sinha, Ravi, and Rada Mihalcea.: Unsupervised graph-basedword sense disambiguation using measures of word semantic similarity, In Proceedings of the IEEE International Conference on Semantic Computing (ICSC), 2007, pp. 363--369 Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Navigli, Roberto, and Mirella Lapata.: Graph connectivity measures for unsupervised word sense disambiguation. In Proceedings of the 20th international joint conference on Artifical intelligence (2007) 1683--1688 Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Tsatsaronis, George, Michalis Vazirgiannis, and Ion Androutsopoulos.: Word sense disambiguation with spreading activation networks generated from thesauri. In Proceedings of the 20th international joint conference on Artifical intelligence, 2007, pp. 1725--1730 Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Antonio, Jimeno-Yepes, McInnes Bridget, and Aronson Alan.: Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation. BMC Bioinformatics, 2011Google ScholarGoogle Scholar
  15. Stark, Michael M., and Richard F. Riesenfeld, Wordnet: An electronic lexical database. In Proceedings of 11th Eurographics Workshop on Rendering, 1998Google ScholarGoogle Scholar
  16. Brandes, Ulrik, A faster algorithm for betweenness centrality. Journal of Mathematical Sociology 25, no. 2, 2001, pp. 163--177Google ScholarGoogle ScholarCross RefCross Ref
  17. McInnes, Bridget T., Ted Pedersen, Ying Liu, Genevieve B. Melton, and Serguei V. Pakhomov, Knowledge-based Method for Determining the Meaning of Ambiguous Biomedical Terms Using Information Content Measures of Similarity." In AMIA Annual Symposium Proceedings, 2011, pp. 895Google ScholarGoogle Scholar

Index Terms

  1. Biomedical text disambiguation using UMLS

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ASONAM '13: Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
          August 2013
          1558 pages
          ISBN:9781450322409
          DOI:10.1145/2492517

          Copyright © 2013 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 25 August 2013

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate116of549submissions,21%

          Upcoming Conference

          KDD '24

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader