ACM Home Page
Please provide us with feedback. Feedback
ME-based biomedical named entity recognition using lexical knowledge
Full text PdfPdf (317 KB)
Source ACM Transactions on Asian Language Information Processing (TALIP) archive
Volume 5 ,  Issue 1  (March 2006) table of contents
Pages: 4 - 21  
Year of Publication: 2006
ISSN:1530-0226
Authors
Kyung-Mi Park  Korea University, Seoul, Korea
Seon-Ho Kim  Korea University, Seoul, Korea
Hae-Chang Rim  Korea University, Seoul, Korea
Young-Sook Hwang  Advanced Telecommunications Research Institute (ATR), Kyoto, Japan
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 61,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1131348.1131350
What is a DOI?

ABSTRACT

In this paper, we present a two-phase biomedical NE-recognition method based on a ME model: we first recognize biomedical terms and then assign appropriate semantic classes to the recognized terms. In the two-phase NE-recognition method, the performance of the term-recognition phase is very important, because the semantic classification is performed on the region identified at the recognition phase. In this study, in order to improve the performance of term recognition, we try to incorporate lexical knowledge into pre- and postprocessing of the term-recognition phase. In the preprocessing step, we use domain-salient words as lexical knowledge obtained by corpus comparison. In the postprocessing step, we utilize χ2-based collocations gained from Medline corpus. In addition, we use morphological patterns extracted from the training data as features for learning the ME-based classifiers. Experimental results show that the performance of NE-recognition can be improved by utilizing such lexical knowledge.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
Chen, S. and Rosenfeld, R. 1999. A Gaussian prior for smoothing maximum entropy models. Technical Report CMUCS-99-108, Carnegie Mellon University.
 
4
Finkel, J., Dingare, S., Nguyen, H., Nissim, M., Sinclair, G., and Manning, C. 2004. Exploiting context for biomedical entity recognition: from syntax to the web. Proceedings of the Joint Workshop on Natural Language Processing in Biomedicine and its Applications (JNLPBA-2004). 88--91.
 
5
Kim, J. D. and Tsujii, J. 2004. Word folding: taking the snapshot of words instead of the whole. In Proceedings of the 1st International Joint Conference on Natural Language Processing (IJCNLP-2004).
 
6
Kim, J. D., Ohta, T., Tsuruoka, Y., Tateisi, Y., and Collier, N. 2004. Introduction to the bio-entity recognition task at JNLPBA. In Proceedings of the Joint Workshop on Natural Language Processing in Biomedicine and its Applications (JNLPBA-2004). 70--75.
 
7
 
8
 
9
Settles, B. 2004. Biomedical named entity recognition using conditional random fields and novel feature sets. Proceedings of the Joint Workshop on Natural Language Processing in Biomedicine and its Applications (JNLPBA-2004). 104--107.
 
10
Song, Y., Kim, E. J., Lee, G. B., and Yi, B. K. 2004. Biomedical named entity recognition using conditional random fields and novel feature sets. Proceedings of the Joint Workshop on Natural Language Processing in Biomedicine and its Applications (JNLPBA-2004). 104--107.
 
11
Zhou, G. and Su, J. 2004. Exploring deep knowledge resources in biomedical name recognition. Proceedings of the Joint Workshop on Natural Language Processing in Biomedicine and its Applications (JNLPBA-2004). 96--99.
 
12

Collaborative Colleagues:
Kyung-Mi Park: colleagues
Seon-Ho Kim: colleagues
Hae-Chang Rim: colleagues
Young-Sook Hwang: colleagues