Semi-supervised feature selection based on label propagation and subset selection

Yun Liu, Feiping Nie, Jigang Wu, Lihui Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

27 Scopus citations

Abstract

In practice, the data to be handled are often high dimensional, and labeled data are often very limited while a large numbers of unlabeled data can be easily collected. Feature selection is an important method to deal with high dimensional data. In this paper, we propose a novel semi-supervised feature selection algorithm to select relevant features using both labeled and unlabeled data. Specifically, the algorithm explores the distribution of the labeled and unlabeled data with a special label propagation method to obtain the soft labels of unlabeled data, then an efficient algorithm to optimize the trace ratio criterion is used to directly select the optimal feature subset. Experimental results verify the effectiveness of the proposed algorithm, and show significant improvement over traditional supervised feature selection algorithms.

Original languageEnglish
Title of host publicationProceedings of ICCIA 2010 - 2010 International Conference on Computer and Information Application
Pages293-296
Number of pages4
DOIs
StatePublished - 2010
Externally publishedYes
Event2010 International Conference on Computer and Information Application, ICCIA 2010 - Tianjin, China
Duration: 2 Nov 20104 Nov 2010

Publication series

NameProceedings of ICCIA 2010 - 2010 International Conference on Computer and Information Application

Conference

Conference2010 International Conference on Computer and Information Application, ICCIA 2010
Country/TerritoryChina
CityTianjin
Period2/11/104/11/10

Fingerprint

Dive into the research topics of 'Semi-supervised feature selection based on label propagation and subset selection'. Together they form a unique fingerprint.

Cite this