ACM Home Page
Please provide us with feedback. Feedback
An investigation of computational and informational limits in Gaussian mixture clustering
Full text PdfPdf (335 KB)
Source ACM International Conference Proceeding Series; Vol. 148 archive
Proceedings of the 23rd international conference on Machine learning table of contents
Pittsburgh, Pennsylvania
Pages: 865 - 872  
Year of Publication: 2006
ISBN:1-59593-383-2
Authors
Nathan Srebro  University of Toronto, Toronto, Ontario, Canada
Gregory Shakhnarovich  Brown University, Providence, Rhode Island
Sam Roweis  University of Toronto, Toronto, Ontario, Canada
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 1,   Downloads (12 Months): 25,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1143844.1143953
What is a DOI?

ABSTRACT

We investigate under what conditions clustering by learning a mixture of spherical Gaussians is (a) computationally tractable; and (b) statistically possible. We show that using principal component projection greatly aids in recovering the clustering using EM; present empirical evidence that even using such a projection, there is still a large gap between the number of samples needed to recover the clustering using EM, and the number of samples needed without computational restrictions; and characterize the regime in which such a gap exists.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Achlioptas, D., & McSherry, F. (2005). On spectral learning of mixtures of distributions. 18th Annual Conference on Learning Theory (COLT).
2
 
3
Barkai, N., & Sompolinksy, H. (1994). Statistical mechanics of maximum-likelihood desnsity estimation. Physical Review E, 50, 1766--1769.
 
4
 
5
 
6
Kannan, R., Salmasian, H., & Vempala, S. (2005). The spectral method for general mixture models. 18th Annual Conference on Learning Theory (COLT).
 
7
 
8
Redner, R. A., & Walker, H. F. (1984). Mixture densities, maximum likelihood and the EM algorithm. SIAM Review, 26, 195--239.
 
9
 
10
Watkins, T., & Nadal, J. (1994). Optimal unsupervised learning. J. Phys. A, 27, 1899--1915.

Collaborative Colleagues:
Nathan Srebro: colleagues
Gregory Shakhnarovich: colleagues
Sam Roweis: colleagues