Joint Structured Bipartite Graph and Row-Sparse Projection for Large-Scale Feature Selection

Xia Dong, Feiping Nie, Danyang Wu, Rong Wang, Xuelong Li

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Feature selection plays an important role in data analysis, yet traditional graph-based methods often produce suboptimal results. These methods typically follow a two-stage process: constructing a graph with data-to-data affinities or a bipartite graph with data-to-anchor affinities and independently selecting features based on their scores. In this article, a large-scale feature selection approach based on structured bipartite graph and row-sparse projection (RS2BLFS) is proposed to overcome this limitation. RS2BLFS integrates the construction of a structured bipartite graph consisting of c connected components into row-sparse projection learning with k nonzero rows. This integration allows for the joint selection of an optimal feature subset in an unsupervised manner. Notably, the c connected components of the structured bipartite graph correspond to c clusters, each with multiple subcluster centers. This feature makes RS2BLFS particularly effective for feature selection and clustering on nonspherical large-scale data. An algorithm with theoretical analysis is developed to solve the optimization problem involved in RS2BLFS. Experimental results on synthetic and real-world datasets confirm its effectiveness in feature selection tasks.

Original languageEnglish
Pages (from-to)6911-6924
Number of pages14
JournalIEEE Transactions on Neural Networks and Learning Systems
Volume36
Issue number4
DOIs
StatePublished - 2025

Keywords

  • Large-scale feature selection
  • multiple subcluster centers
  • row-sparse projection
  • structured bipartite graph
  • unsupervised learning

Cite this