|
ABSTRACT
Principal components and canonical correlations are at the root of many exploratory data mining techniques and provide standard pre-processing tools in machine learning. Lately, probabilistic reformulations of these methods have been proposed (Roweis, 1998; Tipping & Bishop, 1999b; Bach & Jordan, 2005). They are based on a Gaussian density model and are therefore, like their non-probabilistic counterpart, very sensitive to atypical observations. In this paper, we introduce robust probabilistic principal component analysis and robust probabilistic canonical correlation analysis. Both are based on a Student-t density model. The resulting probabilistic reformulations are more suitable in practice as they handle outliers in a natural way. We compute maximum likelihood estimates of the parameters by means of the EM algorithm.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Archambeau, C. (2005). Probabilistic models in noisy environments and their application to a visual prosthesis for the blind. Doctoral dissertation, Université catholique de Louvain, Belgium.
|
| |
2
|
Bach, F. R., & Jordan, M. I. (2005). A probabilistic interpretation of canoncial correlation analysis (Technical Report 688). Department of Statistics, University of California, Berkeley.
|
| |
3
|
de la Torre, F., & Black, M. J. (2001). Robust principal component analysis for computer vision. Int. Conf. on Computer Vision (pp. 362--369).
|
| |
4
|
Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm (with discussion). Journal of the Royal Statistical Society B, 39, 1--38.
|
| |
5
|
Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology, 24, 417--441.
|
| |
6
|
Hotelling, H. (1936). Relations between two sets of variates. Biometrika, 28, 321--377.
|
| |
7
|
Jolliffe, I. T. (1986). Principal component analysis. New York: Springer-Verlag.
|
| |
8
|
Liu, C., & Rubin, D. B. (1995). ML estimation of the t distribution using EM and its extensions, ECM and ECME. Statistica Sinica, 5, 19--39.
|
| |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
Tipping, M. E., & Bishop, C. M. (1999b). Probabilistic principal component analysis. Journal of the Royal Statistical Society B, 61, 611--622.
|
| |
14
|
Verbeek, J., Roweis, S., & Vlassis, N. (2004). Nonlinear CCA and PCA by alignment of local models. Advances in Neural Information Processing Systems 16.
|
| |
15
|
Xu, L., & Yuille, A. L. (1995). Robust principal component analysis by self-organizing rules based on statistical physics approach. IEEE Transactions on Neural Networks, 6, 131--143.
|
|