ACM Home Page
Please provide us with feedback. Feedback
Terminology-based knowledge mining for new knowledge discovery
Full text PdfPdf (1.36 MB)
Source ACM Transactions on Asian Language Information Processing (TALIP) archive
Volume 5 ,  Issue 1  (March 2006) table of contents
Pages: 74 - 88  
Year of Publication: 2006
ISSN:1530-0226
Authors
Hideki Mima  School of Engineering, University of Tokyo, Tokyo, Japan
Sophia Ananiadou  School of Informatics, University of Manchester, Manchester, UK
Katsumori Matsushima  School of Engineering, University of Tokyo, Tokyo, Japan
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 122,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1131348.1131354
What is a DOI?

ABSTRACT

In this article we present an integrated knowledge-mining system for the domain of biomedicine, in which automatic term recognition, term clustering, information retrieval, and visualization are combined. The primary objective of this system is to facilitate knowledge acquisition from documents and aid knowledge discovery through terminology-based similarity calculation and visualization of automatically structured knowledge. This system also supports the integration of different types of databases and simultaneous retrieval of different types of knowledge. In order to accelerate knowledge discovery, we also propose a visualization method for generating similarity-based knowledge maps. The method is based on real-time terminology-based knowledge clustering and categorization and allows users to observe real-time generated knowledge maps, graphically. Lastly, we discuss experiments using the GENIA corpus to assess the practicality and applicability of the system.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Ananiadou, S. and Nenadic, G. 2006. Automatic terminology management in biomedicine. In Text Mining for Biology and Biomedicine, S. Ananiadou and J. McNaught (eds), Artech House, Norwood, MA, Ch.4, 67--98.
 
2
 
3
Berners-Lee, T. 1998. The Semantic Web as a language of logic. Available at: www.w3.org/DesignIssues/Logic.html.
 
4
Brickle, D. and Guha, R. 2000. Resource description framework (RDF) schema specification 1.0, W3C Candidate Recommendation. Available at: http://www.w3.org/TR/rdf-schema.
 
5
 
6
Frantzi, K., Ananiadou, S., and Mima, H. 2000. Automatic recognition of multi-word terms. Int. J. Digital Libraries 3, 2, 117--132. Special issue.
 
7
Fukuda, K., Tsunoda, T., Tamura, A., and Takagi, T. 1998. Toward information extraction: Identifying protein names from biological papers. In Proceedings of the PSB-98 (Hawaii), 705--716.
 
8
Gaizauskas, R., Demetriou, G., and Humphreys, K. 2000. Term recognition and classification in biological science journal articles. In Proceedings of the Workshop on Computational Terminology for Medical and Biological Applications (NLP-2000, Patras, Greece), 37--44.
 
9
Gamper, J., Nejdl, W., and Wolpers, M. 1999. Combining ontologies and terminologies in information systems. In Proceedings of the 5th International Congress on Terminology and Knowledge Engineering, (Innsbruck, Austria), 152--168.
 
10
Genia Project. 2002. Genia project home page. www-tsujii.is.s.u-tokyo.ac.jp/GENIA/.
 
11
Hatzivassiloglou, V., Duboue, P., and Rzhetsky, A. 2001. Disambiguating proteins, genes, and RNA in text: A machine learning approach. Bioinformatics 17, 1, S97--S106.
 
12
Jacquemin, C. 2001. Spotting and Discovering Terms through NLP. MIT Press, Cambridge, MA, 378.
 
13
Krauthammer, M., Rzhetsky, A., Morozov, P., and Friedman, C. 2000. Using BLAST for identifying gene and protein names in journal articles. Gene 259, 245--252.
 
14
 
15
Medline (National Library of Medicine). 2002. http://www.ncbi.nlm.nih.gov/PubMed/.
 
16
 
17
Mima, H. and Ananiadou, S. 2001b. An application and evaluation of the C/NC-value approach for the automatic term recognition of multi-word units in Japanese. Int. J. Terminology 6/2, 175--194.
 
18
Nenadic, G., Ananiadou, S., and McNaught, J. 2004. Enhancing automatic term recognition through term variation, In Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004, Geneva, Switzerland).
 
19
 
20
Spasic, I., Ananiadou, S., McNaught, J., and Kumar, A. 2005b. Text mining and ontologies in biomedicine: Making sense of raw text. Briefings in Bioinformatics 6, 3, 239--251.
 
21
TinySVM. 2004. http://chasen.org/~taku/software/TinySVM/.
 
22
UMLS. 2004. http://www.nlm.nih.gov/research/umls/.
 
23
 
24
Visser, P. R. S., Jones, D. M., Bench-Capon, T. J. M., and Shave, M. J. R. 1997. An analysis of ontology mismatches---Heterogeneity versus interoperability. In Proceedings of the AAAI 1997 Spring Symposium on Ontological Engineering (Stanford University, Stanford, CA), 164--172.
 
25
Voutilainen, A. and Heikkila, J. 1993. An English constraint grammar (ENGCG), a surface-syntactic parser of English. In Creating and Using English Language Corpora, U. Fries et al. (eds.), Rodopi, Amsterdam, 189--199.

Collaborative Colleagues:
Hideki Mima: colleagues
Sophia Ananiadou: colleagues
Katsumori Matsushima: colleagues