ACM Home Page
Please provide us with feedback. Feedback
Robust probabilistic projections
Full text PdfPdf (270 KB)
Source ACM International Conference Proceeding Series; Vol. 148 archive
Proceedings of the 23rd international conference on Machine learning table of contents
Pittsburgh, Pennsylvania
Pages: 33 - 40  
Year of Publication: 2006
ISBN:1-59593-383-2
Authors
Cédric Archambeau  Université catholique de Louvain, Belgium
Nicolas Delannay  Université catholique de Louvain, Belgium
Michel Verleysen  Université catholique de Louvain, Belgium
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 0,   Downloads (12 Months): 49,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1143844.1143849
What is a DOI?

ABSTRACT

Principal components and canonical correlations are at the root of many exploratory data mining techniques and provide standard pre-processing tools in machine learning. Lately, probabilistic reformulations of these methods have been proposed (Roweis, 1998; Tipping & Bishop, 1999b; Bach & Jordan, 2005). They are based on a Gaussian density model and are therefore, like their non-probabilistic counterpart, very sensitive to atypical observations. In this paper, we introduce robust probabilistic principal component analysis and robust probabilistic canonical correlation analysis. Both are based on a Student-t density model. The resulting probabilistic reformulations are more suitable in practice as they handle outliers in a natural way. We compute maximum likelihood estimates of the parameters by means of the EM algorithm.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Archambeau, C. (2005). Probabilistic models in noisy environments and their application to a visual prosthesis for the blind. Doctoral dissertation, Université catholique de Louvain, Belgium.
 
2
Bach, F. R., & Jordan, M. I. (2005). A probabilistic interpretation of canoncial correlation analysis (Technical Report 688). Department of Statistics, University of California, Berkeley.
 
3
de la Torre, F., & Black, M. J. (2001). Robust principal component analysis for computer vision. Int. Conf. on Computer Vision (pp. 362--369).
 
4
Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm (with discussion). Journal of the Royal Statistical Society B, 39, 1--38.
 
5
Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology, 24, 417--441.
 
6
Hotelling, H. (1936). Relations between two sets of variates. Biometrika, 28, 321--377.
 
7
Jolliffe, I. T. (1986). Principal component analysis. New York: Springer-Verlag.
 
8
Liu, C., & Rubin, D. B. (1995). ML estimation of the t distribution using EM and its extensions, ECM and ECME. Statistica Sinica, 5, 19--39.
 
9
 
10
 
11
 
12
 
13
Tipping, M. E., & Bishop, C. M. (1999b). Probabilistic principal component analysis. Journal of the Royal Statistical Society B, 61, 611--622.
 
14
Verbeek, J., Roweis, S., & Vlassis, N. (2004). Nonlinear CCA and PCA by alignment of local models. Advances in Neural Information Processing Systems 16.
 
15
Xu, L., & Yuille, A. L. (1995). Robust principal component analysis by self-organizing rules based on statistical physics approach. IEEE Transactions on Neural Networks, 6, 131--143.


Collaborative Colleagues:
Cédric Archambeau: colleagues
Nicolas Delannay: colleagues
Michel Verleysen: colleagues