skip to main content
10.1145/1143844.1143892acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

Practical solutions to the problem of diagonal dominance in kernel document clustering

Published:25 June 2006Publication History

ABSTRACT

In supervised kernel methods, it has been observed that the performance of the SVM classifier is poor in cases where the diagonal entries of the Gram matrix are large relative to the off-diagonal entries. This problem, referred to as diagonal dominance, often occurs when certain kernel functions are applied to sparse high-dimensional data, such as text corpora. In this paper we investigate the implications of diagonal dominance for unsupervised kernel methods, specifically in the task of document clustering. We propose a selection of strategies for addressing this issue, and evaluate their effectiveness in producing more accurate and stable clusterings.

References

  1. Cancedda, N., Gaussier, E., Goutte, C., & Renders, J. M. (2003). Word sequence kernels. J. Mach. Learn. Res., 3, 1059--1082. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Dhillon, I., Guan, Y., & Kulis, B. (2004). A unified view of kernel k-means, spectral clustering and graph cuts (Technical Report TR-04-25). UTCS.Google ScholarGoogle Scholar
  3. Dhillon, I. S., Guan, Y., & Kogan, J. (2002). Iterative clustering of high dimensional text data augmented by local search. Proceedings of The 2002 IEEE International Conference on Data Mining. Maebashi City, Japan. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Greene, D., & Cunningham, P. (2005). Producing accurate interpretable clusters from high-dimensional data (Technical Report TCD-CS-2005-42). Department of Computer Science, Trinity College Dublin.Google ScholarGoogle Scholar
  5. Roth, V., Braun, M., Lange, T., & Buhmann, J. (2002). A resampling approach to cluster validation. Proceedings of the 15th Symposium in Computational Statistics.Google ScholarGoogle ScholarCross RefCross Ref
  6. Saigo, H., Vert, J.-P., Ueda, N., & Akutsu, T. (2004). Protein homology detection using string alignment kernels. Bioinformatics, 20, 1682--1689. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Schölkopf, B., Smola, A., & Müüller, K.-R. (1998). Nonlinear component analysis as a kernel eigenvalue problem. Neural Comput., 10, 1299--1319. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Schölkopf, B., & Smola, A. J. (2001). Learning with kernels: Support vector machines, regularization, optimization, and beyond. Cambridge, MA, USA: MIT Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Schölkopf, B., Weston, J., Eskin, E., Leslie, C., & Noble, W. S. (2002). A kernel approach for learning from almost orthogonal patterns. ECML '02: Proceedings of the 13th European Conference on Machine Learning (pp. 511--528). London, UK: Springer-Verlag. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Smola, A. J., & Bartlett, P. J. (Eds.). (2000). Advances in large margin classifiers. Cambridge, MA, USA: MIT Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Strehl, A., & Ghosh, J. (2002). Cluster ensembles - a knowledge reuse framework for combining multiple partitions. JMLR, 3, 583--617. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Tao, Q., Scott, S., Vinodchandran, N. V., Osugi, T. T., & Mueller, B. (2004). An extended kernel for generalized multiple-instance learning. 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2004) (pp. 272--277). Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Wu, G., Chang, E., & Zhang, Z. (2005). An analysis of transformation on non-positive semidefinite similarity matrix for kernel machines (Technical Report). UCSB.Google ScholarGoogle Scholar

Index Terms

  1. Practical solutions to the problem of diagonal dominance in kernel document clustering

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICML '06: Proceedings of the 23rd international conference on Machine learning
      June 2006
      1154 pages
      ISBN:1595933832
      DOI:10.1145/1143844

      Copyright © 2006 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 25 June 2006

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      ICML '06 Paper Acceptance Rate140of548submissions,26%Overall Acceptance Rate140of548submissions,26%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader