ABSTRACT
Interest in extracting information from biomedical documents has increased significantly in recent years but has always been challenged by the ambiguity of natural language. An important source of ambiguity is the usage of polysemous words: words with multiple meanings. Word sense disambiguation algorithms attempt to solve this problem by finding the correct meaning of a polysemous word in a given context, but very few algorithms were designed to disambiguate biomedical text. In this study we propose a word sense disambiguation algorithm focused on biomedical text. The proposed algorithm does not need to be trained and uses a relatively small knowledge base.
- Ide, Nancy, and Jean Véronis.: Introduction to the special issue on word sense disambiguation: the state of the art. Computational linguistics 24.1, 1998, pp. 2--40. Google ScholarDigital Library
- Navigli, Roberto.: Word sense disambiguation: A survey. ACM Computing Surveys (CSUR) 41.2, 2009 Google ScholarDigital Library
- Kilgarri, Adam.: Senseval: An exercise in evaluating word sense disambiguation programs. Proc. of the First International Conference on Language Resources and Evaluation, 1998Google Scholar
- Edmonds, Philip, and Scott Cotton.: senseval-2: Overview. Proceedings of the 2nd International Workshop on Evaluating Word Sense Disambiguation Systems. Toulouse, France, 2001, pp. 1--6 Google ScholarDigital Library
- Mihalcea, Rada, and Ehsanul Faruque.: Senselearner: Minimally supervised word sense disambiguation for all words in open text. In Proceedings of ACL/SIGLEX Senseval, vol. 3, 2004, pp. 155--158Google Scholar
- Agirre, Eneko, Oier Lopez de Lacalle, Bernardo Magnini, Arantxa Otegi, German Rigau, and Piek Vossen.: SemEval-2007 task 01: evaluating WSD on cross-language information retrieval. Advances in Multilingual and Multimodal Information Retrieval, 2008, pp. 908--917 Google ScholarDigital Library
- Schuemie, Martijn J., Jan A. Kors, and Barend Mons.: Word sense disambiguation in the biomedical domain: an overview. Journal of Computational Biology 12, no. 5, 2005, pp. 554--565.Google ScholarCross Ref
- Humphreys, Betsy L., Donald AB Lindberg, Harold M. Schoolman, and G. Octo Barnett.: The Unified Medical Language System An Informatics Research Collaboration. Journal of the American Medical Informatics Association 5, no. 1, 1998, pp. 1--11Google ScholarCross Ref
- Agirre, Eneko, Aitor Soroa, and Mark Stevenson.: Graph-based Word Sense Disambiguation of biomedical documents. Bioinformatics 26, no. 22, 2010, pp. 2889--2896 Google ScholarDigital Library
- Agirre, Eneko, and Aitor Soroa.: Personalizing pagerank for word sense disambiguation. In Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, 2009, pp. 33--41 Google ScholarDigital Library
- Sinha, Ravi, and Rada Mihalcea.: Unsupervised graph-basedword sense disambiguation using measures of word semantic similarity, In Proceedings of the IEEE International Conference on Semantic Computing (ICSC), 2007, pp. 363--369 Google ScholarDigital Library
- Navigli, Roberto, and Mirella Lapata.: Graph connectivity measures for unsupervised word sense disambiguation. In Proceedings of the 20th international joint conference on Artifical intelligence (2007) 1683--1688 Google ScholarDigital Library
- Tsatsaronis, George, Michalis Vazirgiannis, and Ion Androutsopoulos.: Word sense disambiguation with spreading activation networks generated from thesauri. In Proceedings of the 20th international joint conference on Artifical intelligence, 2007, pp. 1725--1730 Google ScholarDigital Library
- Antonio, Jimeno-Yepes, McInnes Bridget, and Aronson Alan.: Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation. BMC Bioinformatics, 2011Google Scholar
- Stark, Michael M., and Richard F. Riesenfeld, Wordnet: An electronic lexical database. In Proceedings of 11th Eurographics Workshop on Rendering, 1998Google Scholar
- Brandes, Ulrik, A faster algorithm for betweenness centrality. Journal of Mathematical Sociology 25, no. 2, 2001, pp. 163--177Google ScholarCross Ref
- McInnes, Bridget T., Ted Pedersen, Ying Liu, Genevieve B. Melton, and Serguei V. Pakhomov, Knowledge-based Method for Determining the Meaning of Ambiguous Biomedical Terms Using Information Content Measures of Similarity." In AMIA Annual Symposium Proceedings, 2011, pp. 895Google Scholar
Index Terms
- Biomedical text disambiguation using UMLS
Recommendations
Disambiguation of ambiguous biomedical terms using examples generated from the UMLS Metathesaurus
Researchers have access to a vast amount of information stored in textual documents and there is a pressing need for the development of automated methods to enable and improve access to this resource. Lexical ambiguity, the phenomena in which a word or ...
Resolving ambiguity in biomedical text to improve summarization
Access to the vast body of research literature that is now available on biomedicine and related fields can be improved with automatic summarization. This paper describes a summarization system for the biomedical domain that represents documents as ...
Query Expansion for UMLS Metathesaurus Disambiguation Based on Automatic Corpus Extraction
ICMLA '10: Proceedings of the 2010 Ninth International Conference on Machine Learning and ApplicationsWord sense disambiguation (WSD) is an intermediate task within information retrieval and information extraction, which attempts selecting the proper sense of ambiguous terms. In the biomedical domain, general WSD has not received much attention compared ...
Comments