TY - JOUR
T1 - An efficient framework for unsupervised feature selection
AU - Zhang, Han
AU - Zhang, Rui
AU - Nie, Feiping
AU - Li, Xuelong
N1 - Publisher Copyright:
© 2019
PY - 2019/11/13
Y1 - 2019/11/13
N2 - In these years, the task of fast unsupervised feature selection attracts much attentions with the increasing number of data collected from the physical world. To speed up the running time of algorithms, the bipartite graph theory has been applied in many large-scale tasks, including fast clustering, fast feature extraction, etc. Inspired by this, we present a novel bipartite graph based fast feature selection approach named Efficient Unsupervised Feature Selection (EUFS). Compared to the existing methods focusing on the same topic, EUFS is advanced in two aspects: (1) we learn a high-quality discrete indicator matrix for these unlabelled data by virtue of bipartite graph based spectral clustering, instead of obtaining an implicit cluster structure matrix; (2) we learn a row-sparse matrix for evaluating features via a generalized uncorrelated regression model supervised by the achieved indicator matrix, which succeeds in exploring the discriminative and uncorrelated features. Correspondingly, the features selected by our model could achieve an excellent clustering or classification performance while maintaining a low computational complexity. Experimentally, the results of EUFS compared to five state-of-the-art algorithms and one baseline on ten benchmark datasets verifies its efficiency and superiority.
AB - In these years, the task of fast unsupervised feature selection attracts much attentions with the increasing number of data collected from the physical world. To speed up the running time of algorithms, the bipartite graph theory has been applied in many large-scale tasks, including fast clustering, fast feature extraction, etc. Inspired by this, we present a novel bipartite graph based fast feature selection approach named Efficient Unsupervised Feature Selection (EUFS). Compared to the existing methods focusing on the same topic, EUFS is advanced in two aspects: (1) we learn a high-quality discrete indicator matrix for these unlabelled data by virtue of bipartite graph based spectral clustering, instead of obtaining an implicit cluster structure matrix; (2) we learn a row-sparse matrix for evaluating features via a generalized uncorrelated regression model supervised by the achieved indicator matrix, which succeeds in exploring the discriminative and uncorrelated features. Correspondingly, the features selected by our model could achieve an excellent clustering or classification performance while maintaining a low computational complexity. Experimentally, the results of EUFS compared to five state-of-the-art algorithms and one baseline on ten benchmark datasets verifies its efficiency and superiority.
KW - Bipartite graph
KW - Discrete indicator matrix
KW - Efficient unsupervised feature selection
KW - Uncorrelated regression model
UR - http://www.scopus.com/inward/record.url?scp=85070196973&partnerID=8YFLogxK
U2 - 10.1016/j.neucom.2019.07.020
DO - 10.1016/j.neucom.2019.07.020
M3 - 文章
AN - SCOPUS:85070196973
SN - 0925-2312
VL - 366
SP - 194
EP - 207
JO - Neurocomputing
JF - Neurocomputing
ER -