skip to main content
10.1145/1995412.1995432acmotherconferencesArticle/Chapter ViewAbstractPublication PagessisapConference Proceedingsconference-collections
demonstration

DBStrata: a system for density-based clustering and outlier detection based on stratification

Published:30 June 2011Publication History

ABSTRACT

Clustering is a widely used unsupervised data mining technique. In density-based clustering, a cluster is defined as a connected dense component and grows in the direction set by the density. In this paper we present a software system called DBStrata that implements the density-based clustering architecture together with several extensions able to boost the clustering performances and to efficiently identify outliers.

References

  1. Ester, M.; Kriegel, H. P.; Sander, J.; Xu, X. A density-based algorithm for discovering clusters in large spatial databases with noise. Proc. KDD. 1996. 226--231. Vol. 96.Google ScholarGoogle Scholar
  2. Jin, W.; Tung, A.; Han, J.; Wang, W. Ranking outliers using symmetric neighborhood relationship. Advances in Knowledge Discovery and Data Mining. Springer, 2006. p. 577--593. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Ankerst, M., Breunig, M. M., Kriegel, H. P. and J. Sander. Optics: Ordering points to identify the clustering structure. ACM SIGMOD Record, 28(2):60, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Sander, J.; Qin, X.; Lu, Z.; Niu, N.; Kovarsky, A. Automatic extraction of clusters from hierarchical clustering representations. Advances in Knowledge Discovery and Data Mining. Springer, 2003. p. 567. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Daszykowski, B. Walczak, D. L. Massart, Looking for natural patterns in analytical data. Part 2. Tracing local density with OPTICS, Journal of Chemical Information and Computer Sciences 42 (2002) 500--507.Google ScholarGoogle ScholarCross RefCross Ref
  6. Berkhin, P. A survey of clustering data mining techniques. Grouping Multidimensional Data. Springer, 2006. p. 25--71.Google ScholarGoogle ScholarCross RefCross Ref
  7. C. Cassisi, A. Ferro, R. Giugno, G. Pigola, A. Pulvirenti. Higher Space Embedding to Enhance Density-Based Clustering: Parameter Reduction and Outlier Detection. Submitted 2011.Google ScholarGoogle Scholar
  8. Bentley, J. Multidimensional binary search trees used for associative searching. Commun. ACM, 1975. 18 (9), pp. 509--517. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Hinneburg, D. A. Keim: An Efficient Approach to Clustering in Large Multimedia Databases with Noise. Proceedings of the 4 th International Conference on Knowledge Discovery and Data Mining, New York, 1998.Google ScholarGoogle Scholar

Index Terms

  1. DBStrata: a system for density-based clustering and outlier detection based on stratification

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader