Unsupervised feature analysis with class margin optimization

Sen Wang; Feiping Nie; Xiaojun Chang; Lina Yao; Xue Li; Quan Z. Sheng

doi:10.1007/978-3-319-23528-8_24

Unsupervised feature analysis with class margin optimization

Sen Wang, Feiping Nie, Xiaojun Chang, Lina Yao, Xue Li, Quan Z. Sheng

光电与智能研究院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

22 引用（Scopus）

摘要

Unsupervised feature selection has been attracting research attention in the communities of machine learning and data mining for decades. In this paper, we propose an unsupervised feature selection method seeking a feature coefficient matrix to select the most distinctive features. Specifically, our proposed algorithm integrates the Maximum Margin Criterion with a sparsity-based model into a joint framework, where the class margin and feature correlation are taken into account at the same time. To maximize the total data separability while preserving minimized within-class scatter simultaneously, we propose to embed Kmeans into the framework generating pseudo class label information in a scenario of unsupervised feature selection. Meanwhile, a sparsity-based model, ℓ_2,p-norm, is imposed to the regularization term to effectively discover the sparse structures of the feature coefficient matrix. In this way, noisy and irrelevant features are removed by ruling out those features whose corresponding coefficients are zeros. To alleviate the local optimum problem that is caused by random initializations of K-means, a convergence guaranteed algorithm with an updating strategy for the clustering indicator matrix, is proposed to iteratively chase the optimal solution. Performance evaluation is extensively conducted over six benchmark data sets. From our comprehensive experimental results, it is demonstrated that our method has superior performance against all other compared approaches.

源语言	英语
主期刊名	Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2015, Proceedings
编辑	Annalisa Appice, João Gama, Vitor Santos Costa, João Gama, Alípio Jorge, Annalisa Appice, Annalisa Appice, Vitor Santos Costa, Alípio Jorge, Annalisa Appice, Pedro Pereira Rodrigues, Pedro Pereira Rodrigues, João Gama, Vitor Santos Costa, Soares Soares, Pedro Pereira Rodrigues, Soares Soares, Soares Soares, João Gama, Soares Soares, Alípio Jorge, Alípio Jorge, Pedro Pereira Rodrigues, Vitor Santos Costa
出版商	Springer Verlag
页	383-398
页数	16
ISBN（印刷版）	9783319235271, 9783319235271, 9783319235271, 9783319235271
DOI	https://doi.org/10.1007/978-3-319-23528-8_24
出版状态	已出版 - 2015
活动	European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2015 - Porto, 葡萄牙期限: 7 9月 2015 → 11 9月 2015

出版系列

姓名	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
卷	9284
ISSN（印刷版）	0302-9743
ISSN（电子版）	1611-3349

会议

会议	European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2015
国家/地区	葡萄牙
市	Porto
时期	7/09/15 → 11/09/15

访问文件

10.1007/978-3-319-23528-8_24

其它文件与链接

链接到 Scopus 的出版物

引用此

Wang, S., Nie, F., Chang, X., Yao, L., Li, X., & Sheng, Q. Z. (2015). Unsupervised feature analysis with class margin optimization. 在 A. Appice, J. Gama, V. S. Costa, J. Gama, A. Jorge, A. Appice, A. Appice, V. S. Costa, A. Jorge, A. Appice, P. P. Rodrigues, P. P. Rodrigues, J. Gama, V. S. Costa, S. Soares, P. P. Rodrigues, S. Soares, S. Soares, J. Gama, S. Soares, A. Jorge, A. Jorge, P. P. Rodrigues, ... V. S. Costa (编辑), Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2015, Proceedings (页码 383-398). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 9284). Springer Verlag. https://doi.org/10.1007/978-3-319-23528-8_24

Wang, Sen ; Nie, Feiping ; Chang, Xiaojun 等. / Unsupervised feature analysis with class margin optimization. Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2015, Proceedings. 编辑 / Annalisa Appice ; João Gama ; Vitor Santos Costa ; João Gama ; Alípio Jorge ; Annalisa Appice ; Annalisa Appice ; Vitor Santos Costa ; Alípio Jorge ; Annalisa Appice ; Pedro Pereira Rodrigues ; Pedro Pereira Rodrigues ; João Gama ; Vitor Santos Costa ; Soares Soares ; Pedro Pereira Rodrigues ; Soares Soares ; Soares Soares ; João Gama ; Soares Soares ; Alípio Jorge ; Alípio Jorge ; Pedro Pereira Rodrigues ; Vitor Santos Costa. Springer Verlag, 2015. 页码 383-398 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{84e5f5ba723a476c953bec52832ed1cd,

title = "Unsupervised feature analysis with class margin optimization",

abstract = "Unsupervised feature selection has been attracting research attention in the communities of machine learning and data mining for decades. In this paper, we propose an unsupervised feature selection method seeking a feature coefficient matrix to select the most distinctive features. Specifically, our proposed algorithm integrates the Maximum Margin Criterion with a sparsity-based model into a joint framework, where the class margin and feature correlation are taken into account at the same time. To maximize the total data separability while preserving minimized within-class scatter simultaneously, we propose to embed Kmeans into the framework generating pseudo class label information in a scenario of unsupervised feature selection. Meanwhile, a sparsity-based model, ℓ2,p-norm, is imposed to the regularization term to effectively discover the sparse structures of the feature coefficient matrix. In this way, noisy and irrelevant features are removed by ruling out those features whose corresponding coefficients are zeros. To alleviate the local optimum problem that is caused by random initializations of K-means, a convergence guaranteed algorithm with an updating strategy for the clustering indicator matrix, is proposed to iteratively chase the optimal solution. Performance evaluation is extensively conducted over six benchmark data sets. From our comprehensive experimental results, it is demonstrated that our method has superior performance against all other compared approaches.",

keywords = "Embedded K-means clustering, Maximum margin criterion, Sparse structure learning, Unsupervised feature selection",

author = "Sen Wang and Feiping Nie and Xiaojun Chang and Lina Yao and Xue Li and Sheng, {Quan Z.}",

note = "Publisher Copyright: {\textcopyright} Springer International Publishing Switzerland 2015.; European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2015 ; Conference date: 07-09-2015 Through 11-09-2015",

year = "2015",

doi = "10.1007/978-3-319-23528-8_24",

language = "英语",

isbn = "9783319235271",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "383--398",

editor = "Annalisa Appice and Jo{\~a}o Gama and Costa, {Vitor Santos} and Jo{\~a}o Gama and Al{\'i}pio Jorge and Annalisa Appice and Annalisa Appice and Costa, {Vitor Santos} and Al{\'i}pio Jorge and Annalisa Appice and Rodrigues, {Pedro Pereira} and Rodrigues, {Pedro Pereira} and Jo{\~a}o Gama and Costa, {Vitor Santos} and Soares Soares and Rodrigues, {Pedro Pereira} and Soares Soares and Soares Soares and Jo{\~a}o Gama and Soares Soares and Al{\'i}pio Jorge and Al{\'i}pio Jorge and Rodrigues, {Pedro Pereira} and Costa, {Vitor Santos}",

booktitle = "Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2015, Proceedings",

}

Wang, S, Nie, F, Chang, X, Yao, L, Li, X & Sheng, QZ 2015, Unsupervised feature analysis with class margin optimization. 在 A Appice, J Gama, VS Costa, J Gama, A Jorge, A Appice, A Appice, VS Costa, A Jorge, A Appice, PP Rodrigues, PP Rodrigues, J Gama, VS Costa, S Soares, PP Rodrigues, S Soares, S Soares, J Gama, S Soares, A Jorge, A Jorge, PP Rodrigues & VS Costa (编辑), Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2015, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 卷 9284, Springer Verlag, 页码 383-398, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2015, Porto, 葡萄牙, 7/09/15. https://doi.org/10.1007/978-3-319-23528-8_24

Unsupervised feature analysis with class margin optimization. / Wang, Sen; Nie, Feiping; Chang, Xiaojun 等.
Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2015, Proceedings. 编辑 / Annalisa Appice; João Gama; Vitor Santos Costa; João Gama; Alípio Jorge; Annalisa Appice; Annalisa Appice; Vitor Santos Costa; Alípio Jorge; Annalisa Appice; Pedro Pereira Rodrigues; Pedro Pereira Rodrigues; João Gama; Vitor Santos Costa; Soares Soares; Pedro Pereira Rodrigues; Soares Soares; Soares Soares; João Gama; Soares Soares; Alípio Jorge; Alípio Jorge; Pedro Pereira Rodrigues; Vitor Santos Costa. Springer Verlag, 2015. 页码 383-398 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 9284).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Unsupervised feature analysis with class margin optimization

AU - Wang, Sen

AU - Nie, Feiping

AU - Chang, Xiaojun

AU - Yao, Lina

AU - Li, Xue

AU - Sheng, Quan Z.

N1 - Publisher Copyright: © Springer International Publishing Switzerland 2015.

PY - 2015

Y1 - 2015

N2 - Unsupervised feature selection has been attracting research attention in the communities of machine learning and data mining for decades. In this paper, we propose an unsupervised feature selection method seeking a feature coefficient matrix to select the most distinctive features. Specifically, our proposed algorithm integrates the Maximum Margin Criterion with a sparsity-based model into a joint framework, where the class margin and feature correlation are taken into account at the same time. To maximize the total data separability while preserving minimized within-class scatter simultaneously, we propose to embed Kmeans into the framework generating pseudo class label information in a scenario of unsupervised feature selection. Meanwhile, a sparsity-based model, ℓ2,p-norm, is imposed to the regularization term to effectively discover the sparse structures of the feature coefficient matrix. In this way, noisy and irrelevant features are removed by ruling out those features whose corresponding coefficients are zeros. To alleviate the local optimum problem that is caused by random initializations of K-means, a convergence guaranteed algorithm with an updating strategy for the clustering indicator matrix, is proposed to iteratively chase the optimal solution. Performance evaluation is extensively conducted over six benchmark data sets. From our comprehensive experimental results, it is demonstrated that our method has superior performance against all other compared approaches.

AB - Unsupervised feature selection has been attracting research attention in the communities of machine learning and data mining for decades. In this paper, we propose an unsupervised feature selection method seeking a feature coefficient matrix to select the most distinctive features. Specifically, our proposed algorithm integrates the Maximum Margin Criterion with a sparsity-based model into a joint framework, where the class margin and feature correlation are taken into account at the same time. To maximize the total data separability while preserving minimized within-class scatter simultaneously, we propose to embed Kmeans into the framework generating pseudo class label information in a scenario of unsupervised feature selection. Meanwhile, a sparsity-based model, ℓ2,p-norm, is imposed to the regularization term to effectively discover the sparse structures of the feature coefficient matrix. In this way, noisy and irrelevant features are removed by ruling out those features whose corresponding coefficients are zeros. To alleviate the local optimum problem that is caused by random initializations of K-means, a convergence guaranteed algorithm with an updating strategy for the clustering indicator matrix, is proposed to iteratively chase the optimal solution. Performance evaluation is extensively conducted over six benchmark data sets. From our comprehensive experimental results, it is demonstrated that our method has superior performance against all other compared approaches.

KW - Embedded K-means clustering

KW - Maximum margin criterion

KW - Sparse structure learning

KW - Unsupervised feature selection

UR - http://www.scopus.com/inward/record.url?scp=84984650296&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-23528-8_24

DO - 10.1007/978-3-319-23528-8_24

M3 - 会议稿件

AN - SCOPUS:84984650296

SN - 9783319235271

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 383

EP - 398

BT - Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2015, Proceedings

A2 - Appice, Annalisa

A2 - Gama, João

A2 - Costa, Vitor Santos

A2 - Gama, João

A2 - Jorge, Alípio

A2 - Appice, Annalisa

A2 - Costa, Vitor Santos

A2 - Jorge, Alípio

A2 - Appice, Annalisa

A2 - Rodrigues, Pedro Pereira

A2 - Gama, João

A2 - Costa, Vitor Santos

A2 - Soares, Soares

A2 - Rodrigues, Pedro Pereira

A2 - Soares, Soares

A2 - Gama, João

A2 - Soares, Soares

A2 - Jorge, Alípio

A2 - Rodrigues, Pedro Pereira

A2 - Costa, Vitor Santos

PB - Springer Verlag

T2 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2015

Y2 - 7 September 2015 through 11 September 2015

ER -

Wang S, Nie F, Chang X, Yao L, Li X, Sheng QZ. Unsupervised feature analysis with class margin optimization. 在 Appice A, Gama J, Costa VS, Gama J, Jorge A, Appice A, Appice A, Costa VS, Jorge A, Appice A, Rodrigues PP, Rodrigues PP, Gama J, Costa VS, Soares S, Rodrigues PP, Soares S, Soares S, Gama J, Soares S, Jorge A, Jorge A, Rodrigues PP, Costa VS, 编辑, Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2015, Proceedings. Springer Verlag. 2015. 页码 383-398. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-319-23528-8_24

Unsupervised feature analysis with class margin optimization

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此