Abstract
Matrix factorization is often used for data representation in many data mining and machine-learning problems. In particular, for a dataset without any negative entries, nonnegative matrix factorization (NMF) is often used to find a low-rank approximation by the product of two nonnegative matrices. With reduced dimensions, these matrices can be effectively used for many applications such as clustering. The existing methods of NMF are often afflicted with their sensitivity to outliers and noise in the data. To mitigate this drawback, in this paper, we consider integrating NMF into a robust principal component model, and design a robust formulation that effectively captures noise and outliers in the approximation while incorporating essential nonlinear structures. A set of comprehensive empirical evaluations in clustering applications demonstrates that the proposed method has strong robustness to gross errors and superior performance to current state-of-the-art methods.
| Original language | English |
|---|---|
| Article number | 33 |
| Journal | ACM Transactions on Knowledge Discovery from Data |
| Volume | 11 |
| Issue number | 3 |
| DOIs | |
| State | Published - Mar 2017 |
Bibliographical note
Publisher Copyright:© 2017 ACM.
Funding
This work is supported by the National Science Foundation, under grant IIS-1218712, National Natural Science Foundation of China, under grant 11241005, and Shanxi Scholarship Council of China, under grant 2015-093.
| Funders | Funder number |
|---|---|
| National Science Foundation (NSF) | 1218712, IIS-1218712 |
| National Natural Science Foundation of China (NSFC) | 11241005 |
| Shanxi Scholarship Council of China | 2015-093 |
Keywords
- Clustering
- Manifold
- Nonnegative factorization
- Robust principal component analysis
ASJC Scopus subject areas
- General Computer Science