ABSTRACT
Besides being difficult to scale between different domains and to handle knowledge fluctuations, the results of terms clustering presented by existing ontology engineering systems are far from desirable. In this paper, we propose a new version of ant-based method for clustering terms known as Tree-Traversing Ants (TTA). With the help of the Normalized Google Distance (NGD) and n° of Wikipedia (n°W) as measures for similarity and distance between terms, we attempt to achieve an adaptable clustering method that is highly scalable across domains. Initial experiments with two datasets show promising results and demonstrated several advantages that are not simultaneously present in standard ant-based and other conventional clustering methods.
- Steinbach, M., Karypis, G., Kumar, V.: A comparison of document clustering techniques. Technical Report 00--034, University of Minnesota (2000)Google Scholar
- Choi, B., Yao, Z.: Web page classification. In Chu, W., Lin, T., eds.: Foundations and Advances in Data Mining. Springer-Verlag (2005)Google Scholar
- Jain, A., Murty, M., Flynn, P.: Data clustering: A review. ACM Computing Survey 31(3) (1999) 264--323 Google ScholarDigital Library
- Wong, W., Liu, W., Bennamoun, M.: Progress and open problems in ontology engineering from text. Submitted to the Journal of Web Semantics (2006)Google Scholar
- Lagus, K., Honkela, T., Kaski, S., Kohonen, T.: Self-organizing maps of document collections: A new approach to interactive exploration. In: Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining. (1996)Google Scholar
- Handl, J., Knowles, J., Dorigo, M.: Ant-based clustering: A comparative study of its relative performance with respect to k-means, average link and 1d-som. Technical Report TR/IRIDIA/2003-24, Universite Libre de Bruxelles (2003)Google Scholar
- Faure, D., Nedellec, C.: A corpus-based conceptual clustering method for verb frames and ontology acquisition. In: Proceedings of the 1st International Conference on Language Resources and Evaluation (LREC), Granada, Spain (1998)Google Scholar
- Maedche, A., Volz, R.: The ontology extraction & maintenance framework: Text-to-onto. In: Proceedings of the IEEE International Conference on Data Mining, California, USA (2001)Google Scholar
- Shamsfard, M., Barforoush, A.: Learning ontologies from natural language texts. International Journal of Human-Computer Studies 60(1) (2004) Page 17--63 Google ScholarDigital Library
- Deneubourg, J., Goss, S., Franks, N., Sendova-Franks, A., Detrain, C., Chretien, L.: The dynamics of collective sorting: Robot-like ants and ant-like robots. In: Proceedings of the 1st International Conference on Simulation of Adaptive Behavior: From Animals to Animats, France (1991) Google ScholarDigital Library
- Gutowitz, H.: Complexity-seeking ants. In: Proceedings of the 3rd European Conference on Artificial Life. (1993)Google Scholar
- Lumer, E., Faieta, B.: Diversity and adaptation in populations of clustering ants. In: Proceedings of the 3rd International Conference on Simulation of Adaptive Behavior: From Animals to Animats 3. (1994) Google ScholarDigital Library
- Vizine, A., deCastro, L., Hruschka, E., Gudwin, R.: Towards improving clustering ants: An adaptive ant clustering algorithm. Informatica 29(2) (2005) 143--154Google Scholar
- Handl, J., Meyer, B.: Improved ant-based clustering and sorting. In: Proceedings of the 7th International Conference on Parallel Problem Solving from Nature. (2002) Google ScholarDigital Library
- Cilibrasi, R., Vitanyi, P.: Automatic meaning discovery using google. http://xxx.lanl.gov/abs/cs.CL/0412098 (2005)Google Scholar
- Ritter, H., Kohonen, T.: Self-organizing semantic maps. Biological Cybernetics 61(1) (1989) 241--254Google ScholarDigital Library
- Handl, J., Knowles, J., Dorigo, M.: Ant-based clustering and topographic mapping. Artificial Life 12(1) (2006) 35--61 Google ScholarDigital Library
- Berkhin, P.: Survey of clustering data mining techniques. Technical report;, Accrue Software (2002)Google Scholar
Index Terms
- Featureless similarities for terms clustering using tree-traversing ants
Recommendations
Tree-Traversing Ant Algorithm for term clustering based on featureless similarities
Many conventional methods for concepts formation in ontology learning have relied on the use of predefined templates and rules, and static resources such as WordNet. Such approaches are not scalable, difficult to port between different domains and ...
Agglomerative clustering using asymmetric similarities
MDAI'11: Proceedings of the 8th international conference on Modeling decisions for artificial intelligenceAlgorithms of agglomerative hierarchical clustering using asymmetric similarity measures are studied. Two different measures between two clusters are proposed, one of which generalizes the average linkage for symmetric similarity measures. Asymmetric ...
On cluster tree for nested and multi-density data clustering
Clustering is one of the important data mining tasks. Nested clusters or clusters of multi-density are very prevalent in data sets. In this paper, we develop a hierarchical clustering approach-a cluster tree to determine such cluster structure and ...
Comments