skip to main content
10.1145/1835804.1835877acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Flexible constrained spectral clustering

Published:25 July 2010Publication History

ABSTRACT

Constrained clustering has been well-studied for algorithms like K-means and hierarchical agglomerative clustering. However, how to encode constraints into spectral clustering remains a developing area. In this paper, we propose a flexible and generalized framework for constrained spectral clustering. In contrast to some previous efforts that implicitly encode Must-Link and Cannot-Link constraints by modifying the graph Laplacian or the resultant eigenspace, we present a more natural and principled formulation, which preserves the original graph Laplacian and explicitly encodes the constraints. Our method offers several practical advantages: it can encode the degree of belief (weight) in Must-Link and Cannot-Link constraints; it guarantees to lower-bound how well the given constraints are satisfied using a user-specified threshold; and it can be solved deterministically in polynomial time through generalized eigendecomposition. Furthermore, by inheriting the objective function from spectral clustering and explicitly encoding the constraints, much of the existing analysis of spectral clustering techniques is still valid. Consequently our work can be posed as a natural extension to unconstrained spectral clustering and be interpreted as finding the normalized min-cut of a labeled graph. We validate the effectiveness of our approach by empirical results on real-world data sets, with applications to constrained image segmentation and clustering benchmark data sets with both binary and degree-of-belief constraints.

Skip Supplemental Material Section

Supplemental Material

kdd2010_wang_fcsc_01.mov

mov

119.6 MB

References

  1. A. Asuncion and D. Newman. UCI machine learning repository, 2007.Google ScholarGoogle Scholar
  2. S. Basu, I. Davidson, and K. Wagstaff, editors. Constrained Clustering: Advances in Algorithms, Theory, and Applications. Chapman & Hall/CRC, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. T. Coleman, J. Saunderson, and A. Wirth. Spectral clustering with inconsistent advice. In ICML, pages 152--159, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. I. Davidson and S. S. Ravi. Intractability and clustering with constraints. In ICML, pages 201--208, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. T. De Bie, J. A. K. Suykens, and B. De Moor. Learning from general label constraints. In SSPR/SPR, pages 671--679, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  6. P. Drineas, A. M. Frieze, R. Kannan, S. Vempala, and V. Vinay. Clustering large graphs via the singular value decomposition. Machine Learning, 56(1--3):9--33, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. X. Ji and W. Xu. Document clustering with prior knowledge. In SIGIR, pages 405--412, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. D. Kamvar, D. Klein, and C. D. Manning. Spectral learning. In IJCAI, pages 561--566, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. H. Kuhn and A. Tucker. Nonlinear programming. ACM SIGMAP Bulletin, pages 6--18, 1982. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Z. Li, J. Liu, and X. Tang. Constrained clustering via spectral regularization. In CVPR, pages 421--428, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  11. Z. Lu and M. A. Carreira-Perpinan. Constrained spectral clustering through affinity propagation. In CVPR, 2008.Google ScholarGoogle Scholar
  12. D. Martin, C. Fowlkes, D. Tal, and J. Malik. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In Proc. 8th Int'l Conf. Computer Vision, volume 2, pages 416--423, July 2001.Google ScholarGoogle ScholarCross RefCross Ref
  13. A. Y. Ng, M. I. Jordan, and Y. Weiss. On spectral clustering: Analysis and an algorithm. In NIPS, pages 849--856, 2001.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. Shi and J. Malik. Normalized cuts and image segmentation. IEEE Trans. Pattern Anal. Mach. Intell., 22(8):888--905, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. U. von Luxburg. A tutorial on spectral clustering. Statistics and Computing, 17(4):395--416, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. K. Wagstaff and C. Cardie. Clustering with instance-level constraints. In ICML, pages 1103--1110, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. F. Wang, C. H. Q. Ding, and T. Li. Integrated KL (K-means - Laplacian) clustering: A new clustering approach by combining attribute data and pairwise relations. In SDM, pages 38--48, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  18. S. White and P. Smyth. A spectral clustering approach to finding communities in graph. In SDM, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  19. Q. Xu, M. desJardins, and K. Wagstaff. Constrained spectral clustering under a local proximity structure assumption. In FLAIRS Conference, pages 866--867, 2005.Google ScholarGoogle Scholar
  20. S. X. Yu and J. Shi. Grouping with bias. In NIPS, pages 1327--1334, 2001.Google ScholarGoogle Scholar
  21. S. X. Yu and J. Shi. Segmentation given partial grouping constraints. IEEE Trans. Pattern Anal. Mach. Intell., 26(2):173--183, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Flexible constrained spectral clustering

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
      July 2010
      1240 pages
      ISBN:9781450300551
      DOI:10.1145/1835804

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 25 July 2010

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader