| An investigation of computational and informational limits in Gaussian mixture clustering |
| Full text |
Pdf
(335 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 148
archive
Proceedings of the 23rd international conference on Machine learning
table of contents
Pittsburgh, Pennsylvania
Pages: 865 - 872
Year of Publication: 2006
ISBN:1-59593-383-2
|
|
Authors
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 1, Downloads (12 Months): 25, Citation Count: 0
|
|
|
ABSTRACT
We investigate under what conditions clustering by learning a mixture of spherical Gaussians is (a) computationally tractable; and (b) statistically possible. We show that using principal component projection greatly aids in recovering the clustering using EM; present empirical evidence that even using such a projection, there is still a large gap between the number of samples needed to recover the clustering using EM, and the number of samples needed without computational restrictions; and characterize the regime in which such a gap exists.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Achlioptas, D., & McSherry, F. (2005). On spectral learning of mixtures of distributions. 18th Annual Conference on Learning Theory (COLT).
|
 |
2
|
|
| |
3
|
Barkai, N., & Sompolinksy, H. (1994). Statistical mechanics of maximum-likelihood desnsity estimation. Physical Review E, 50, 1766--1769.
|
| |
4
|
|
| |
5
|
|
| |
6
|
Kannan, R., Salmasian, H., & Vempala, S. (2005). The spectral method for general mixture models. 18th Annual Conference on Learning Theory (COLT).
|
| |
7
|
|
| |
8
|
Redner, R. A., & Walker, H. F. (1984). Mixture densities, maximum likelihood and the EM algorithm. SIAM Review, 26, 195--239.
|
| |
9
|
|
| |
10
|
Watkins, T., & Nadal, J. (1994). Optimal unsupervised learning. J. Phys. A, 27, 1899--1915.
|
|