TY - JOUR
T1 - Fast Extended Inductive Robust Principal Component Analysis With Optimal Mean
AU - Yi, Shuangyan
AU - Nie, Feiping
AU - Liang, Yongsheng
AU - Liu, Wei
AU - He, Zhenyu
AU - Liao, Qingmin
N1 - Publisher Copyright:
© 1989-2012 IEEE.
PY - 2022/10/1
Y1 - 2022/10/1
N2 - Inspired by the mean calculation of RPCA_OM and inductiveness of IRPCA, we first propose an inductive robust principal component analysis method with removing the optimal mean automatically, which is shorted as IRPCA_OM. Furthermore, IRPCA_OM is extended to Schatten-p norm and a more general framework (i.e., EIRPCA_OM) is presented. The objective function of EIRPCA_OM includes two terms, the first term is a robust reconstruction error term constrained by an ℓ2,1-norm and the second term is a regularization term constrained by a Schatten-p norm. The proposed EIRPCA_OM method is robust, inductive and accurate. However, on the high-dimensional data, it would spend a large computation cost in training stage. To this end, a fast version of EIRPCA_OM called as FEIRPCA_OM is proposed, and its basic idea is to eliminate the zero eigenvalues of data matrix. More importantly, an effective theoretical proof is presented to ensure that FEIRPCA_OM has faster processing speed than EIRPCA_OM when processing high-dimensional data, but without any performance loss. Based on it, we also can exchange the less performance loss for the higher computation efficiency by removing the small eigenvalues of data matrix. Experimental results on the public datasets demonstrate that FEIRPCA_OM works efficiently on the high-dimensional data.
AB - Inspired by the mean calculation of RPCA_OM and inductiveness of IRPCA, we first propose an inductive robust principal component analysis method with removing the optimal mean automatically, which is shorted as IRPCA_OM. Furthermore, IRPCA_OM is extended to Schatten-p norm and a more general framework (i.e., EIRPCA_OM) is presented. The objective function of EIRPCA_OM includes two terms, the first term is a robust reconstruction error term constrained by an ℓ2,1-norm and the second term is a regularization term constrained by a Schatten-p norm. The proposed EIRPCA_OM method is robust, inductive and accurate. However, on the high-dimensional data, it would spend a large computation cost in training stage. To this end, a fast version of EIRPCA_OM called as FEIRPCA_OM is proposed, and its basic idea is to eliminate the zero eigenvalues of data matrix. More importantly, an effective theoretical proof is presented to ensure that FEIRPCA_OM has faster processing speed than EIRPCA_OM when processing high-dimensional data, but without any performance loss. Based on it, we also can exchange the less performance loss for the higher computation efficiency by removing the small eigenvalues of data matrix. Experimental results on the public datasets demonstrate that FEIRPCA_OM works efficiently on the high-dimensional data.
KW - Principal component analysis (PCA)
KW - fast computation
KW - inductiveness
KW - robustness
UR - http://www.scopus.com/inward/record.url?scp=85098776133&partnerID=8YFLogxK
U2 - 10.1109/TKDE.2020.3047405
DO - 10.1109/TKDE.2020.3047405
M3 - 文章
AN - SCOPUS:85098776133
SN - 1041-4347
VL - 34
SP - 4812
EP - 4825
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 10
ER -