Sparse PCA via ℓ2,p-Norm Regularization for Unsupervised Feature Selection

Zhengxin Li, Feiping Nie, Jintang Bian, Danyang Wu, Xuelong Li

科研成果: 期刊稿件文章同行评审

50 引用 (Scopus)

摘要

In the field of data mining, how to deal with high-dimensional data is an inevitable topic. Since it does not rely on labels, unsupervised feature selection has attracted a lot of attention. The performance of spectral-based unsupervised methods depends on the quality of the constructed similarity matrix, which is used to depict the intrinsic structure of data. However, real-world data often contain plenty of noise features, making the similarity matrix constructed by original data cannot be completely reliable. Worse still, the size of a similarity matrix expands rapidly as the number of samples rises, making the computational cost increase significantly. To solve this problem, a simple and efficient unsupervised model is proposed to perform feature selection. We formulate PCA as a reconstruction error minimization problem, and incorporate a ℓ2,p-norm regularization term to make the projection matrix sparse. The learned row-sparse and orthogonal projection matrix is used to select discriminative features. Then, we present an efficient optimization algorithm to solve the proposed unsupervised model, and analyse the convergence and computational complexity of the algorithm theoretically. Finally, experiments on both synthetic and real-world data sets demonstrate the effectiveness of our proposed method.

源语言英语
页(从-至)5322-5328
页数7
期刊IEEE Transactions on Pattern Analysis and Machine Intelligence
45
4
DOI
出版状态已出版 - 1 4月 2023

指纹

探究 'Sparse PCA via ℓ2,p-Norm Regularization for Unsupervised Feature Selection' 的科研主题。它们共同构成独一无二的指纹。

引用此