ABSTRACT
Automatic taxonomy construction aims to build a categorization system without human efforts. Traditional textual pattern based methods extract hyponymy relation in raw texts. However, these methods usually yield low precision and recall. In this paper, we propose a method to automatically find diffusing attributes to a category from Wikipedia infoboxes. We use the diffusing attribute to diffuse a coarse-grained category into several fine-grained subcategories and generate a finer-grained taxonomy. Experiments show our method can find proper diffusing attributes to categories across various domains.
- M. A. Hearst. Automatic acquisition of hyponyms from large text corpora. In COLING 1992. Google ScholarDigital Library
- J. R. Quinlan. C4. 5: programs for machine learning. Elsevier, 2014.Google ScholarDigital Library
Index Terms
- Coarse to Fine: Diffusing Categories in Wikipedia
Recommendations
Testing the validity of Wikipedia categories for subject matter labelling of open-domain corpus data
The Wikipedia category system was designed to enable browsing and navigation of Wikipedia. It is also a useful resource for knowledge organisation and document indexing, especially using automatic approaches. However, it has received little attention as ...
Classifying image galleries into a taxonomy using metadata and wikipedia
NLDB'12: Proceedings of the 17th international conference on Applications of Natural Language Processing and Information SystemsThis paper presents a method for the hierarchical classification of image galleries into a taxonomy. The proposed method links textual gallery metadata to Wikipedia pages and categories. Entity extraction from metadata, entity ranking, and selection of ...
MENTA: inducing multilingual taxonomies from wikipedia
CIKM '10: Proceedings of the 19th ACM international conference on Information and knowledge managementIn recent years, a number of projects have turned to Wikipedia to establish large-scale taxonomies that describe orders of magnitude more entities than traditional manually built knowledge bases. So far, however, the multilingual nature of Wikipedia has ...
Comments