Robust feature selection via simultaneous sapped norm and sparse regularizer minimization

Gongmin Lan; Chenping Hou; Feiping Nie; Tingjin Luo; Dongyun Yi

doi:10.1016/j.neucom.2017.12.055

Robust feature selection via simultaneous sapped norm and sparse regularizer minimization

Gongmin Lan, Chenping Hou, Feiping Nie, Tingjin Luo, Dongyun Yi

光电与智能研究院

National University of Defense Technology

科研成果: 期刊稿件 › 文章 › 同行评审

29 引用（Scopus）

摘要

High dimension is one of the key characters of big data. Feature selection, as a framework to identify a small subset of illustrative and discriminative features, has been proved as a basic solution in dealing with high-dimensional data. In previous literatures, ℓ_{2, p}-norm regularization was studied by many researches as an effective approach to select features across data sets with sparsity. However, ℓ_{2, p}-norm loss function is just robust to noise but not considering the influence of outliers. In this paper, we propose a new robust and efficient feature selection method with emphasizing Simultaneous Capped ℓ₂-norm loss and ℓ_{2, p}-norm regularizer Minimization (SCM). The capped ℓ₂-norm based loss function can effectively eliminate the influence of noise and outliers in regression and the ℓ_{2, p}-norm regularization is used to select features across data sets with joint sparsity. An efficient approach is then introduced with proved convergence. Extensive experimental studies on synthetic and real-world datasets demonstrate the effectiveness of our method in comparison with other popular feature selection methods.

源语言	英语
页（从-至）	228-240
页数	13
期刊	Neurocomputing
卷	283
DOI	https://doi.org/10.1016/j.neucom.2017.12.055
出版状态	已出版 - 29 3月 2018

访问文件

10.1016/j.neucom.2017.12.055

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{2cd47266de2f42ff82a349551dcaedf4,

title = "Robust feature selection via simultaneous sapped norm and sparse regularizer minimization",

abstract = "High dimension is one of the key characters of big data. Feature selection, as a framework to identify a small subset of illustrative and discriminative features, has been proved as a basic solution in dealing with high-dimensional data. In previous literatures, ℓ2, p-norm regularization was studied by many researches as an effective approach to select features across data sets with sparsity. However, ℓ2, p-norm loss function is just robust to noise but not considering the influence of outliers. In this paper, we propose a new robust and efficient feature selection method with emphasizing Simultaneous Capped ℓ2-norm loss and ℓ2, p-norm regularizer Minimization (SCM). The capped ℓ2-norm based loss function can effectively eliminate the influence of noise and outliers in regression and the ℓ2, p-norm regularization is used to select features across data sets with joint sparsity. An efficient approach is then introduced with proved convergence. Extensive experimental studies on synthetic and real-world datasets demonstrate the effectiveness of our method in comparison with other popular feature selection methods.",

keywords = "Capped ℓ-norm loss, Feature selection, ℓ-norm regularization",

author = "Gongmin Lan and Chenping Hou and Feiping Nie and Tingjin Luo and Dongyun Yi",

note = "Publisher Copyright: {\textcopyright} 2017 Elsevier B.V.",

year = "2018",

month = mar,

day = "29",

doi = "10.1016/j.neucom.2017.12.055",

language = "英语",

volume = "283",

pages = "228--240",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Robust feature selection via simultaneous sapped norm and sparse regularizer minimization

AU - Lan, Gongmin

AU - Hou, Chenping

AU - Nie, Feiping

AU - Luo, Tingjin

AU - Yi, Dongyun

PY - 2018/3/29

Y1 - 2018/3/29

N2 - High dimension is one of the key characters of big data. Feature selection, as a framework to identify a small subset of illustrative and discriminative features, has been proved as a basic solution in dealing with high-dimensional data. In previous literatures, ℓ2, p-norm regularization was studied by many researches as an effective approach to select features across data sets with sparsity. However, ℓ2, p-norm loss function is just robust to noise but not considering the influence of outliers. In this paper, we propose a new robust and efficient feature selection method with emphasizing Simultaneous Capped ℓ2-norm loss and ℓ2, p-norm regularizer Minimization (SCM). The capped ℓ2-norm based loss function can effectively eliminate the influence of noise and outliers in regression and the ℓ2, p-norm regularization is used to select features across data sets with joint sparsity. An efficient approach is then introduced with proved convergence. Extensive experimental studies on synthetic and real-world datasets demonstrate the effectiveness of our method in comparison with other popular feature selection methods.

AB - High dimension is one of the key characters of big data. Feature selection, as a framework to identify a small subset of illustrative and discriminative features, has been proved as a basic solution in dealing with high-dimensional data. In previous literatures, ℓ2, p-norm regularization was studied by many researches as an effective approach to select features across data sets with sparsity. However, ℓ2, p-norm loss function is just robust to noise but not considering the influence of outliers. In this paper, we propose a new robust and efficient feature selection method with emphasizing Simultaneous Capped ℓ2-norm loss and ℓ2, p-norm regularizer Minimization (SCM). The capped ℓ2-norm based loss function can effectively eliminate the influence of noise and outliers in regression and the ℓ2, p-norm regularization is used to select features across data sets with joint sparsity. An efficient approach is then introduced with proved convergence. Extensive experimental studies on synthetic and real-world datasets demonstrate the effectiveness of our method in comparison with other popular feature selection methods.

KW - Capped ℓ-norm loss

KW - Feature selection

KW - ℓ-norm regularization

UR - http://www.scopus.com/inward/record.url?scp=85040593768&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2017.12.055

DO - 10.1016/j.neucom.2017.12.055

M3 - 文章

AN - SCOPUS:85040593768

SN - 0925-2312

VL - 283

SP - 228

EP - 240

JO - Neurocomputing

JF - Neurocomputing

ER -

Robust feature selection via simultaneous sapped norm and sparse regularizer minimization

摘要

访问文件

其它文件与链接

指纹

引用此