Robust principal component analysis: A factorization-based approach with linear complexity

Chong Peng, Yongyong Chen, Zhao Kang, Chenglizhao Chen, Qiang Cheng

Research output: Contribution to journalArticlepeer-review

68 Scopus citations

Abstract

Low-rankness has been widely observed in real world data and there is often a need to recover low-rank matrices in many machine learning and data mining problems. Robust principal component analysis (RPCA) has been used for such problems by separating the data into a low-rank and a sparse part. The convex approach to RPCA has been well studied due to its elegant properties in theory and many extensions have been developed. However, the state-of-the-art algorithms for the convex approach and their extensions are usually expensive in complexity due to the need for solving singular value decomposition (SVD) of large matrices. In this paper, we propose a novel RPCA model based on matrix tri-factorization, which only needs the computation of SVDs for very small matrices. Thus, this approach reduces the complexity of RPCA to be linear and makes it fully scalable. It also overcomes the drawback of the state-of-the-art scalable approach such as AltProj, which requires the precise knowledge of the true rank of the low-rank component. As a result, our method is about 4 times faster than AltProj. Our method can be used as a light-weight, scalable tool for RPCA in the absence of the precise value of the true rank.

Original languageEnglish
Pages (from-to)581-599
Number of pages19
JournalInformation Sciences
Volume513
DOIs
StatePublished - Mar 2020

Bibliographical note

Publisher Copyright:
© 2019 Elsevier Inc.

Funding

This work is supported by National Natural Science Foundation of China (NSFC) under grants 61806106, 61802215, and 61806045, Shandong Provincial Natural Science Foundation, China under grants ZR2019QF009, ZR2019BF028, and ZR2019BF011; Q.C. is partially supported by NIH UH3 NS100606-03 and a grant from the University of Kentucky. This work is supported by National Natural Science Foundation of China ( NSFC ) under grants 61806106 , 61802215 , and 61806045 , Shandong Provincial Natural Science Foundation , China under grants ZR2019QF009 , ZR2019BF028 , and ZR2019BF011 ; Q.C. is partially supported by NIH UH3 NS100606-03 and a grant from the University of Kentucky.

FundersFunder number
NIH UH3 NS100606-03UH3 NS100606-03
Shandong Provincial Natural Science Foundation , ChinaZR2019BF011, ZR2019QF009, ZR2019BF028
Shandong Provincial Natural Science Foundation, China
University of Kentucky
National Natural Science Foundation of China (NSFC)61806045, 61806106, 61802215

    Keywords

    • Factorization
    • Linear complexity
    • Robust principal component analysis

    ASJC Scopus subject areas

    • Software
    • Information Systems and Management
    • Artificial Intelligence
    • Theoretical Computer Science
    • Control and Systems Engineering
    • Computer Science Applications

    Fingerprint

    Dive into the research topics of 'Robust principal component analysis: A factorization-based approach with linear complexity'. Together they form a unique fingerprint.

    Cite this