ABSTRACT
Cluster tracing algorithms are used to mine temporal evolutions of clusters. Generally, clusters represent groups of objects with similar values. In a temporal context like tracing, similar values correspond to similar behavior in one snapshot in time. Recently, tracing based on object-value-similarity was introduced. In this new paradigm, the decision whether two clusters are considered similar is based on the similarity of the clusters' object values. Existing approaches of this paradigm, however, have a severe limitation. The mapping of clusters between snapshots in time is performed pairwise, i.e. global connections between a temporal snapshot's clusters are ignored; thus, impacts of other clusters that may affect the mapping are not considered and incorrect cluster tracings may be obtained.
In this vision paper, we present our ongoing work on a novel approach for cluster tracing that applies the object-value-similarity paradigm and is based on the well-known Earth Mover's Distance (EMD). The EMD enables a cluster tracing that uses global mapping: in the mapping process, all clusters of compared snapshots are considered simultaneously. A special property of our approach is that we nest the EMD: we use it as a ground distance for itself to achieve most effective value-based cluster tracing.
- M. Böttcher, F. Höppner, and M. Spiliopoulou. On exploiting the power of time in data mining. SIGKDD Explorations, 10(2):3--11, 2008. Google ScholarDigital Library
- S. Günnemann, H. Kremer, C. Laufkötter, and T. Seidl. Tracing evolving clusters by subspace and value similarity. In PAKDD (2), pages 444--456, 2011. Google ScholarDigital Library
- S. Günnemann, H. Kremer, C. Laufkötter, and T. Seidl. Tracing evolving subspace clusters in temporal climate data. DMKD, 24(2):387--410, 2012. Google ScholarDigital Library
- F. S. Hillier and G. J. Lieberman. Introduction to Operations Research. McGraw-Hill, 2001.Google ScholarDigital Library
- P. Kalnis, N. Mamoulis, and S. Bakiras. On discovering moving clusters in spatio-temporal data. In SSTD, pages 364--381, 2005. Google ScholarDigital Library
- J. Rosswog and K. Ghose. Detecting and tracking spatio-temporal clusters with adaptive history filtering. In ICDM Workshops, pages 448--457, 2008. Google ScholarDigital Library
- Y. Rubner, C. Tomasi, and L. J. Guibas. The earth mover's distance as a metric for image retrieval. Int. J. Comput. Vision, 40:99--121, November 2000. Google ScholarDigital Library
- M. Spiliopoulou, I. Ntoutsi, Y. Theodoridis, and R. Schult. MONIC: modeling and monitoring cluster transitions. In KDD, pages 706--711, 2006. Google ScholarDigital Library
Index Terms
- Nesting the earth mover's distance for effective cluster tracing
Recommendations
Detecting Climate Change in Multivariate Time Series Data by Novel Clustering and Cluster Tracing Techniques
ICDMW '10: Proceedings of the 2010 IEEE International Conference on Data Mining WorkshopsClimate change can be detected in several scientific domains including hydrology, meteorology, and oceanography. In this paper we describe our on-going work for detecting change in multivariate time series data from these domains. For the detection, we ...
Discovering cluster evolution patterns with the Cluster Association-aware matrix factorization
AbstractTracking of document collections over time (or across domains) is helpful in several applications such as finding dynamics of terminologies, identifying emerging and evolving trends, and concept drift detection. We propose a novel ‘Cluster ...
Inter cluster distance management model with optimal centroid estimation for K-means clustering algorithm
Clustering techniques are used to group up the transactions based on the relevancy. Cluster analysis is one of the primary data analysis method. The clustering process can be done in two ways such that Hierarchical clusters and partition clustering. ...
Comments