Unsupervised Feature Selection With Weighted and Projected Adaptive Neighbors

Zhengxin Li; Feiping Nie; Danyang Wu; Zhanxuan Hu; Xuelong Li

doi:10.1109/TCYB.2021.3087632

Unsupervised Feature Selection With Weighted and Projected Adaptive Neighbors

Zhengxin Li, Feiping Nie, Danyang Wu, Zhanxuan Hu, Xuelong Li

School of Artificial Intelligence, OPtics and Electronics

Research output: Contribution to journal › Article › peer-review

41 Scopus citations

Abstract

In the field of data mining, how to deal with high-dimensional data is a fundamental problem. If they are used directly, it is not only computationally expensive but also difficult to obtain satisfactory results. Unsupervised feature selection is designed to reduce the dimension of data by finding a subset of features in the absence of labels. Many unsupervised methods perform feature selection by exploring spectral analysis and manifold learning, such that the intrinsic structure of data can be preserved. However, most of these methods ignore a fact: due to the existence of noise features, the intrinsic structure directly built from original data may be unreliable. To solve this problem, a new unsupervised feature selection model is proposed. The graph structure, feature weights, and projection matrix are learned simultaneously, such that the intrinsic structure is constructed by the data that have been feature weighted and projected. For each data point, its nearest neighbors are acquired in the process of graph construction. Therefore, we call them adaptive neighbors. Besides, an additional constraint is added to the proposed model. It requires that a graph, corresponding to a similarity matrix, should contain exactly c connected components. Then, we present an optimization algorithm to solve the proposed model. Next, we discuss the method of determining the regularization parameter γ in our proposed method and analyze the computational complexity of the optimization algorithm. Finally, experiments are implemented on both synthetic and real-world datasets to demonstrate the effectiveness of the proposed method.

Original language	English
Pages (from-to)	1260-1271
Number of pages	12
Journal	IEEE Transactions on Cybernetics
Volume	53
Issue number	2
DOIs	https://doi.org/10.1109/TCYB.2021.3087632
State	Published - 1 Feb 2023

Keywords

Adaptive neighbor
dimensionality reduction
projection
unsupervised feature selection
weighted features

Access to Document

10.1109/TCYB.2021.3087632

Cite this

@article{773e6907661b4fe89df2e68dc7222bf4,

title = "Unsupervised Feature Selection With Weighted and Projected Adaptive Neighbors",

abstract = "In the field of data mining, how to deal with high-dimensional data is a fundamental problem. If they are used directly, it is not only computationally expensive but also difficult to obtain satisfactory results. Unsupervised feature selection is designed to reduce the dimension of data by finding a subset of features in the absence of labels. Many unsupervised methods perform feature selection by exploring spectral analysis and manifold learning, such that the intrinsic structure of data can be preserved. However, most of these methods ignore a fact: due to the existence of noise features, the intrinsic structure directly built from original data may be unreliable. To solve this problem, a new unsupervised feature selection model is proposed. The graph structure, feature weights, and projection matrix are learned simultaneously, such that the intrinsic structure is constructed by the data that have been feature weighted and projected. For each data point, its nearest neighbors are acquired in the process of graph construction. Therefore, we call them adaptive neighbors. Besides, an additional constraint is added to the proposed model. It requires that a graph, corresponding to a similarity matrix, should contain exactly c connected components. Then, we present an optimization algorithm to solve the proposed model. Next, we discuss the method of determining the regularization parameter γ in our proposed method and analyze the computational complexity of the optimization algorithm. Finally, experiments are implemented on both synthetic and real-world datasets to demonstrate the effectiveness of the proposed method.",

keywords = "Adaptive neighbor, dimensionality reduction, projection, unsupervised feature selection, weighted features",

author = "Zhengxin Li and Feiping Nie and Danyang Wu and Zhanxuan Hu and Xuelong Li",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.",

year = "2023",

month = feb,

day = "1",

doi = "10.1109/TCYB.2021.3087632",

language = "英语",

volume = "53",

pages = "1260--1271",

journal = "IEEE Transactions on Cybernetics",

issn = "2168-2267",

publisher = "IEEE Advancing Technology for Humanity",

number = "2",

}

TY - JOUR

T1 - Unsupervised Feature Selection With Weighted and Projected Adaptive Neighbors

AU - Li, Zhengxin

AU - Nie, Feiping

AU - Wu, Danyang

AU - Hu, Zhanxuan

AU - Li, Xuelong

PY - 2023/2/1

Y1 - 2023/2/1

N2 - In the field of data mining, how to deal with high-dimensional data is a fundamental problem. If they are used directly, it is not only computationally expensive but also difficult to obtain satisfactory results. Unsupervised feature selection is designed to reduce the dimension of data by finding a subset of features in the absence of labels. Many unsupervised methods perform feature selection by exploring spectral analysis and manifold learning, such that the intrinsic structure of data can be preserved. However, most of these methods ignore a fact: due to the existence of noise features, the intrinsic structure directly built from original data may be unreliable. To solve this problem, a new unsupervised feature selection model is proposed. The graph structure, feature weights, and projection matrix are learned simultaneously, such that the intrinsic structure is constructed by the data that have been feature weighted and projected. For each data point, its nearest neighbors are acquired in the process of graph construction. Therefore, we call them adaptive neighbors. Besides, an additional constraint is added to the proposed model. It requires that a graph, corresponding to a similarity matrix, should contain exactly c connected components. Then, we present an optimization algorithm to solve the proposed model. Next, we discuss the method of determining the regularization parameter γ in our proposed method and analyze the computational complexity of the optimization algorithm. Finally, experiments are implemented on both synthetic and real-world datasets to demonstrate the effectiveness of the proposed method.

AB - In the field of data mining, how to deal with high-dimensional data is a fundamental problem. If they are used directly, it is not only computationally expensive but also difficult to obtain satisfactory results. Unsupervised feature selection is designed to reduce the dimension of data by finding a subset of features in the absence of labels. Many unsupervised methods perform feature selection by exploring spectral analysis and manifold learning, such that the intrinsic structure of data can be preserved. However, most of these methods ignore a fact: due to the existence of noise features, the intrinsic structure directly built from original data may be unreliable. To solve this problem, a new unsupervised feature selection model is proposed. The graph structure, feature weights, and projection matrix are learned simultaneously, such that the intrinsic structure is constructed by the data that have been feature weighted and projected. For each data point, its nearest neighbors are acquired in the process of graph construction. Therefore, we call them adaptive neighbors. Besides, an additional constraint is added to the proposed model. It requires that a graph, corresponding to a similarity matrix, should contain exactly c connected components. Then, we present an optimization algorithm to solve the proposed model. Next, we discuss the method of determining the regularization parameter γ in our proposed method and analyze the computational complexity of the optimization algorithm. Finally, experiments are implemented on both synthetic and real-world datasets to demonstrate the effectiveness of the proposed method.

KW - Adaptive neighbor

KW - dimensionality reduction

KW - projection

KW - unsupervised feature selection

KW - weighted features

UR - http://www.scopus.com/inward/record.url?scp=85112650516&partnerID=8YFLogxK

U2 - 10.1109/TCYB.2021.3087632

DO - 10.1109/TCYB.2021.3087632

M3 - 文章

C2 - 34343100

AN - SCOPUS:85112650516

SN - 2168-2267

VL - 53

SP - 1260

EP - 1271

JO - IEEE Transactions on Cybernetics

JF - IEEE Transactions on Cybernetics

IS - 2

ER -

Unsupervised Feature Selection With Weighted and Projected Adaptive Neighbors

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this