Robust feature selection via simultaneous sapped norm and sparse regularizer minimization

Gongmin Lan; Chenping Hou; Feiping Nie; Tingjin Luo; Dongyun Yi

doi:10.1016/j.neucom.2017.12.055

Robust feature selection via simultaneous sapped norm and sparse regularizer minimization

Gongmin Lan, Chenping Hou, Feiping Nie, Tingjin Luo, Dongyun Yi

School of Artificial Intelligence, OPtics and Electronics

National University of Defense Technology

Research output: Contribution to journal › Article › peer-review

29 Scopus citations

Abstract

High dimension is one of the key characters of big data. Feature selection, as a framework to identify a small subset of illustrative and discriminative features, has been proved as a basic solution in dealing with high-dimensional data. In previous literatures, ℓ_{2, p}-norm regularization was studied by many researches as an effective approach to select features across data sets with sparsity. However, ℓ_{2, p}-norm loss function is just robust to noise but not considering the influence of outliers. In this paper, we propose a new robust and efficient feature selection method with emphasizing Simultaneous Capped ℓ₂-norm loss and ℓ_{2, p}-norm regularizer Minimization (SCM). The capped ℓ₂-norm based loss function can effectively eliminate the influence of noise and outliers in regression and the ℓ_{2, p}-norm regularization is used to select features across data sets with joint sparsity. An efficient approach is then introduced with proved convergence. Extensive experimental studies on synthetic and real-world datasets demonstrate the effectiveness of our method in comparison with other popular feature selection methods.

Original language	English
Pages (from-to)	228-240
Number of pages	13
Journal	Neurocomputing
Volume	283
DOIs	https://doi.org/10.1016/j.neucom.2017.12.055
State	Published - 29 Mar 2018

Keywords

Capped ℓ-norm loss
Feature selection
ℓ-norm regularization

Access to Document

10.1016/j.neucom.2017.12.055

Cite this

@article{2cd47266de2f42ff82a349551dcaedf4,

title = "Robust feature selection via simultaneous sapped norm and sparse regularizer minimization",

abstract = "High dimension is one of the key characters of big data. Feature selection, as a framework to identify a small subset of illustrative and discriminative features, has been proved as a basic solution in dealing with high-dimensional data. In previous literatures, ℓ2, p-norm regularization was studied by many researches as an effective approach to select features across data sets with sparsity. However, ℓ2, p-norm loss function is just robust to noise but not considering the influence of outliers. In this paper, we propose a new robust and efficient feature selection method with emphasizing Simultaneous Capped ℓ2-norm loss and ℓ2, p-norm regularizer Minimization (SCM). The capped ℓ2-norm based loss function can effectively eliminate the influence of noise and outliers in regression and the ℓ2, p-norm regularization is used to select features across data sets with joint sparsity. An efficient approach is then introduced with proved convergence. Extensive experimental studies on synthetic and real-world datasets demonstrate the effectiveness of our method in comparison with other popular feature selection methods.",

keywords = "Capped ℓ-norm loss, Feature selection, ℓ-norm regularization",

author = "Gongmin Lan and Chenping Hou and Feiping Nie and Tingjin Luo and Dongyun Yi",

note = "Publisher Copyright: {\textcopyright} 2017 Elsevier B.V.",

year = "2018",

month = mar,

day = "29",

doi = "10.1016/j.neucom.2017.12.055",

language = "英语",

volume = "283",

pages = "228--240",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Robust feature selection via simultaneous sapped norm and sparse regularizer minimization

AU - Lan, Gongmin

AU - Hou, Chenping

AU - Nie, Feiping

AU - Luo, Tingjin

AU - Yi, Dongyun

PY - 2018/3/29

Y1 - 2018/3/29

N2 - High dimension is one of the key characters of big data. Feature selection, as a framework to identify a small subset of illustrative and discriminative features, has been proved as a basic solution in dealing with high-dimensional data. In previous literatures, ℓ2, p-norm regularization was studied by many researches as an effective approach to select features across data sets with sparsity. However, ℓ2, p-norm loss function is just robust to noise but not considering the influence of outliers. In this paper, we propose a new robust and efficient feature selection method with emphasizing Simultaneous Capped ℓ2-norm loss and ℓ2, p-norm regularizer Minimization (SCM). The capped ℓ2-norm based loss function can effectively eliminate the influence of noise and outliers in regression and the ℓ2, p-norm regularization is used to select features across data sets with joint sparsity. An efficient approach is then introduced with proved convergence. Extensive experimental studies on synthetic and real-world datasets demonstrate the effectiveness of our method in comparison with other popular feature selection methods.

AB - High dimension is one of the key characters of big data. Feature selection, as a framework to identify a small subset of illustrative and discriminative features, has been proved as a basic solution in dealing with high-dimensional data. In previous literatures, ℓ2, p-norm regularization was studied by many researches as an effective approach to select features across data sets with sparsity. However, ℓ2, p-norm loss function is just robust to noise but not considering the influence of outliers. In this paper, we propose a new robust and efficient feature selection method with emphasizing Simultaneous Capped ℓ2-norm loss and ℓ2, p-norm regularizer Minimization (SCM). The capped ℓ2-norm based loss function can effectively eliminate the influence of noise and outliers in regression and the ℓ2, p-norm regularization is used to select features across data sets with joint sparsity. An efficient approach is then introduced with proved convergence. Extensive experimental studies on synthetic and real-world datasets demonstrate the effectiveness of our method in comparison with other popular feature selection methods.

KW - Capped ℓ-norm loss

KW - Feature selection

KW - ℓ-norm regularization

UR - http://www.scopus.com/inward/record.url?scp=85040593768&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2017.12.055

DO - 10.1016/j.neucom.2017.12.055

M3 - 文章

AN - SCOPUS:85040593768

SN - 0925-2312

VL - 283

SP - 228

EP - 240

JO - Neurocomputing

JF - Neurocomputing

ER -

Robust feature selection via simultaneous sapped norm and sparse regularizer minimization

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this