Unsupervised feature selection via unified trace ratio formulation and K-means clustering (TRACK)

De Wang, Feiping Nie, Heng Huang

科研成果: 书/报告/会议事项章节会议稿件同行评审

95 引用 (Scopus)

摘要

Feature selection plays a crucial role in scientific research and practical applications. In the real world applications, labeling data is time and labor consuming. Thus, unsupervised feature selection methods are desired for many practical applications. Linear discriminant analysis (LDA) with trace ratio criterion is a supervised dimensionality reduction method that has shown good performance to improve classifications. In this paper, we first propose a unified objective to seamlessly accommodate trace ratio formulation and K-means clustering procedure, such that the trace ratio criterion is extended to unsupervised model. After that, we propose a novel unsupervised feature selection method by integrating unsupervised trace ratio formulation and structured sparsity-inducing norms regularization. The proposed method can harness the discriminant power of trace ratio criterion, thus it tends to select discriminative features. Meanwhile, we also provide two important theorems to guarantee the unsupervised feature selection process. Empirical results on four benchmark data sets show that the proposed method outperforms other sate-of-the-art unsupervised feature selection algorithms in all three clustering evaluation metrics.

源语言英语
主期刊名Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2014, Proceedings
出版商Springer Verlag
306-321
页数16
版本PART 3
ISBN(印刷版)9783662448441
DOI
出版状态已出版 - 2014
已对外发布
活动European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2014 - Nancy, 法国
期限: 15 9月 201419 9月 2014

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
编号PART 3
8726 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2014
国家/地区法国
Nancy
时期15/09/1419/09/14

指纹

探究 'Unsupervised feature selection via unified trace ratio formulation and K-means clustering (TRACK)' 的科研主题。它们共同构成独一无二的指纹。

引用此