Joint Anchor Graph Embedding and Discrete Feature Scoring for Unsupervised Feature Selection

Zheng Wang; Dongming Wu; Rong Wang; Feiping Nie; Fei Wang

doi:10.1109/TNNLS.2022.3222466

Joint Anchor Graph Embedding and Discrete Feature Scoring for Unsupervised Feature Selection

Zheng Wang, Dongming Wu, Rong Wang, Feiping Nie, Fei Wang

光电与智能研究院

科研成果: 期刊稿件 › 文章 › 同行评审

23 引用（Scopus）

摘要

The success of existing unsupervised feature selection (UFS) methods heavily relies on the assumption that the intrinsic relationships among original high-dimensional (HD) data samples exist in the discriminative low-dimension (LD) subspace. However, previous UFS methods commonly construct pairwise graphs and employ ℓ_2,1-norm regularization to severally preserve the local structure and calculate the score of features, which is computationally complex and easy to get stuck into local optimum, so that those approaches cannot be applied in dealing with large-scale datasets in practice. To overcome this challenge, we propose a novel UFS method, in which a novel anchor graph embedding paradigm is designed to extract the local neighborhood relationships among data samples by reducing the computational complexity of graph construction to be linear in the number of data. Moreover, to improve the optimality of selected features as well as the performance of downstream tasks, we propose a discrete feature scoring mechanism, which imposes orthogonal ℓ_2,0-norm constraints on learned projections, in order to enhance the distinction of feature scores as well as reduce the probability of falling into local optimum. In addition, solving the proposed nonconvex and nonsmooth NP-hard problem is challenging, and we present an efficient optimization algorithm to address it and acquire a closed-form solution of the transformation matrix. Extensive experiments demonstrate the effectiveness and efficiency of the proposed UFS by comparison with several state-of-the-art approaches to clustering and image segmentation tasks.

源语言	英语
页（从-至）	7974-7987
页数	14
期刊	IEEE Transactions on Neural Networks and Learning Systems
卷	35
期	6
DOI	https://doi.org/10.1109/TNNLS.2022.3222466
出版状态	已出版 - 1 6月 2024

访问文件

10.1109/TNNLS.2022.3222466

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{fa159aff659d43c5910bbac10f6b33c5,

title = "Joint Anchor Graph Embedding and Discrete Feature Scoring for Unsupervised Feature Selection",

abstract = "The success of existing unsupervised feature selection (UFS) methods heavily relies on the assumption that the intrinsic relationships among original high-dimensional (HD) data samples exist in the discriminative low-dimension (LD) subspace. However, previous UFS methods commonly construct pairwise graphs and employ ℓ2,1-norm regularization to severally preserve the local structure and calculate the score of features, which is computationally complex and easy to get stuck into local optimum, so that those approaches cannot be applied in dealing with large-scale datasets in practice. To overcome this challenge, we propose a novel UFS method, in which a novel anchor graph embedding paradigm is designed to extract the local neighborhood relationships among data samples by reducing the computational complexity of graph construction to be linear in the number of data. Moreover, to improve the optimality of selected features as well as the performance of downstream tasks, we propose a discrete feature scoring mechanism, which imposes orthogonal ℓ2,0-norm constraints on learned projections, in order to enhance the distinction of feature scores as well as reduce the probability of falling into local optimum. In addition, solving the proposed nonconvex and nonsmooth NP-hard problem is challenging, and we present an efficient optimization algorithm to address it and acquire a closed-form solution of the transformation matrix. Extensive experiments demonstrate the effectiveness and efficiency of the proposed UFS by comparison with several state-of-the-art approaches to clustering and image segmentation tasks.",

keywords = "anchor graph embedding, image segmentation, nonconvex optimization, pattern clustering, unsupervised feature selection, ℓ-norm constraint",

author = "Zheng Wang and Dongming Wu and Rong Wang and Feiping Nie and Fei Wang",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2024",

month = jun,

day = "1",

doi = "10.1109/TNNLS.2022.3222466",

language = "英语",

volume = "35",

pages = "7974--7987",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "6",

}

TY - JOUR

T1 - Joint Anchor Graph Embedding and Discrete Feature Scoring for Unsupervised Feature Selection

AU - Wang, Zheng

AU - Wu, Dongming

AU - Wang, Rong

AU - Nie, Feiping

AU - Wang, Fei

PY - 2024/6/1

Y1 - 2024/6/1

N2 - The success of existing unsupervised feature selection (UFS) methods heavily relies on the assumption that the intrinsic relationships among original high-dimensional (HD) data samples exist in the discriminative low-dimension (LD) subspace. However, previous UFS methods commonly construct pairwise graphs and employ ℓ2,1-norm regularization to severally preserve the local structure and calculate the score of features, which is computationally complex and easy to get stuck into local optimum, so that those approaches cannot be applied in dealing with large-scale datasets in practice. To overcome this challenge, we propose a novel UFS method, in which a novel anchor graph embedding paradigm is designed to extract the local neighborhood relationships among data samples by reducing the computational complexity of graph construction to be linear in the number of data. Moreover, to improve the optimality of selected features as well as the performance of downstream tasks, we propose a discrete feature scoring mechanism, which imposes orthogonal ℓ2,0-norm constraints on learned projections, in order to enhance the distinction of feature scores as well as reduce the probability of falling into local optimum. In addition, solving the proposed nonconvex and nonsmooth NP-hard problem is challenging, and we present an efficient optimization algorithm to address it and acquire a closed-form solution of the transformation matrix. Extensive experiments demonstrate the effectiveness and efficiency of the proposed UFS by comparison with several state-of-the-art approaches to clustering and image segmentation tasks.

AB - The success of existing unsupervised feature selection (UFS) methods heavily relies on the assumption that the intrinsic relationships among original high-dimensional (HD) data samples exist in the discriminative low-dimension (LD) subspace. However, previous UFS methods commonly construct pairwise graphs and employ ℓ2,1-norm regularization to severally preserve the local structure and calculate the score of features, which is computationally complex and easy to get stuck into local optimum, so that those approaches cannot be applied in dealing with large-scale datasets in practice. To overcome this challenge, we propose a novel UFS method, in which a novel anchor graph embedding paradigm is designed to extract the local neighborhood relationships among data samples by reducing the computational complexity of graph construction to be linear in the number of data. Moreover, to improve the optimality of selected features as well as the performance of downstream tasks, we propose a discrete feature scoring mechanism, which imposes orthogonal ℓ2,0-norm constraints on learned projections, in order to enhance the distinction of feature scores as well as reduce the probability of falling into local optimum. In addition, solving the proposed nonconvex and nonsmooth NP-hard problem is challenging, and we present an efficient optimization algorithm to address it and acquire a closed-form solution of the transformation matrix. Extensive experiments demonstrate the effectiveness and efficiency of the proposed UFS by comparison with several state-of-the-art approaches to clustering and image segmentation tasks.

KW - anchor graph embedding

KW - image segmentation

KW - nonconvex optimization

KW - pattern clustering

KW - unsupervised feature selection

KW - ℓ-norm constraint

UR - http://www.scopus.com/inward/record.url?scp=85144030880&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2022.3222466

DO - 10.1109/TNNLS.2022.3222466

M3 - 文章

C2 - 36417731

AN - SCOPUS:85144030880

SN - 2162-237X

VL - 35

SP - 7974

EP - 7987

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 6

ER -

Joint Anchor Graph Embedding and Discrete Feature Scoring for Unsupervised Feature Selection

摘要

访问文件

其它文件与链接

指纹

引用此