Learning Feature-Sparse Principal Subspace

Feiping Nie; Lai Tian; Rong Wang; Xuelong Li

doi:10.1109/TPAMI.2022.3212646

Learning Feature-Sparse Principal Subspace

Feiping Nie, Lai Tian, Rong Wang, Xuelong Li

School of Artificial Intelligence, OPtics and Electronics

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

3 Scopus citations

Abstract

The principal subspace estimation is directly connected to dimension reduction and is important when there is more than one principal component of interest. In this article, we introduce two new algorithms to solve the feature-sparsity constrained PCA problem (FSPCA) for the principal subspace estimation task, which performs feature selection and PCA simultaneously. Existing optimization methods for FSPCA require data distribution assumptions and are lack of global convergence guarantee. Though the general FSPCA problem is NP-hard, we show that, for a low-rank covariance, FSPCA can be solved globally (Algorithm 1). Then, we propose another strategy (Algorithm 2) to solve FSPCA for the general covariance by iteratively building a carefully designed proxy. We prove (data-dependent) approximation bound and regular stationary convergence guarantees for the new algorithms. For the spectrum of covariance with exponential/Zipf's distribution, we provide exponential/posynomial approximation bounds. Constructive examples and numerical results are provided to demonstrate the tightness of our results. Experimental results show the promising performance and efficiency of the new algorithms compared with the state-of-the-arts on both synthetic and real-world datasets.

Original language	English
Pages (from-to)	4858-4869
Number of pages	12
Journal	IEEE Transactions on Pattern Analysis and Machine Intelligence
Volume	45
Issue number	4
DOIs	https://doi.org/10.1109/TPAMI.2022.3212646
State	Published - 1 Apr 2023

Keywords

approximation algorithms
Feature selection
nonconvex optimization
nonsmooth optimization
sparse PCA

Access to Document

10.1109/TPAMI.2022.3212646

Cite this

@article{d3960b9a1f78451788ca414a049e9e8d,

title = "Learning Feature-Sparse Principal Subspace",

abstract = "The principal subspace estimation is directly connected to dimension reduction and is important when there is more than one principal component of interest. In this article, we introduce two new algorithms to solve the feature-sparsity constrained PCA problem (FSPCA) for the principal subspace estimation task, which performs feature selection and PCA simultaneously. Existing optimization methods for FSPCA require data distribution assumptions and are lack of global convergence guarantee. Though the general FSPCA problem is NP-hard, we show that, for a low-rank covariance, FSPCA can be solved globally (Algorithm 1). Then, we propose another strategy (Algorithm 2) to solve FSPCA for the general covariance by iteratively building a carefully designed proxy. We prove (data-dependent) approximation bound and regular stationary convergence guarantees for the new algorithms. For the spectrum of covariance with exponential/Zipf's distribution, we provide exponential/posynomial approximation bounds. Constructive examples and numerical results are provided to demonstrate the tightness of our results. Experimental results show the promising performance and efficiency of the new algorithms compared with the state-of-the-arts on both synthetic and real-world datasets.",

keywords = "approximation algorithms, Feature selection, nonconvex optimization, nonsmooth optimization, sparse PCA",

author = "Feiping Nie and Lai Tian and Rong Wang and Xuelong Li",

note = "Publisher Copyright: {\textcopyright} 1979-2012 IEEE.",

year = "2023",

month = apr,

day = "1",

doi = "10.1109/TPAMI.2022.3212646",

language = "英语",

volume = "45",

pages = "4858--4869",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE Computer Society",

number = "4",

}

TY - JOUR

T1 - Learning Feature-Sparse Principal Subspace

AU - Nie, Feiping

AU - Tian, Lai

AU - Wang, Rong

AU - Li, Xuelong

PY - 2023/4/1

Y1 - 2023/4/1

N2 - The principal subspace estimation is directly connected to dimension reduction and is important when there is more than one principal component of interest. In this article, we introduce two new algorithms to solve the feature-sparsity constrained PCA problem (FSPCA) for the principal subspace estimation task, which performs feature selection and PCA simultaneously. Existing optimization methods for FSPCA require data distribution assumptions and are lack of global convergence guarantee. Though the general FSPCA problem is NP-hard, we show that, for a low-rank covariance, FSPCA can be solved globally (Algorithm 1). Then, we propose another strategy (Algorithm 2) to solve FSPCA for the general covariance by iteratively building a carefully designed proxy. We prove (data-dependent) approximation bound and regular stationary convergence guarantees for the new algorithms. For the spectrum of covariance with exponential/Zipf's distribution, we provide exponential/posynomial approximation bounds. Constructive examples and numerical results are provided to demonstrate the tightness of our results. Experimental results show the promising performance and efficiency of the new algorithms compared with the state-of-the-arts on both synthetic and real-world datasets.

AB - The principal subspace estimation is directly connected to dimension reduction and is important when there is more than one principal component of interest. In this article, we introduce two new algorithms to solve the feature-sparsity constrained PCA problem (FSPCA) for the principal subspace estimation task, which performs feature selection and PCA simultaneously. Existing optimization methods for FSPCA require data distribution assumptions and are lack of global convergence guarantee. Though the general FSPCA problem is NP-hard, we show that, for a low-rank covariance, FSPCA can be solved globally (Algorithm 1). Then, we propose another strategy (Algorithm 2) to solve FSPCA for the general covariance by iteratively building a carefully designed proxy. We prove (data-dependent) approximation bound and regular stationary convergence guarantees for the new algorithms. For the spectrum of covariance with exponential/Zipf's distribution, we provide exponential/posynomial approximation bounds. Constructive examples and numerical results are provided to demonstrate the tightness of our results. Experimental results show the promising performance and efficiency of the new algorithms compared with the state-of-the-arts on both synthetic and real-world datasets.

KW - approximation algorithms

KW - Feature selection

KW - nonconvex optimization

KW - nonsmooth optimization

KW - sparse PCA

UR - http://www.scopus.com/inward/record.url?scp=85141607586&partnerID=8YFLogxK

U2 - 10.1109/TPAMI.2022.3212646

DO - 10.1109/TPAMI.2022.3212646

M3 - 文章

C2 - 36342996

AN - SCOPUS:85141607586

SN - 0162-8828

VL - 45

SP - 4858

EP - 4869

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

IS - 4

ER -

Learning Feature-Sparse Principal Subspace

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this