Discriminative and Robust Autoencoders for Unsupervised Feature Selection

Yunzhi Ling; Feiping Nie; Weizhong Yu; Xuelong Li

doi:10.1109/TNNLS.2023.3333737

Discriminative and Robust Autoencoders for Unsupervised Feature Selection

Yunzhi Ling, Feiping Nie, Weizhong Yu, Xuelong Li

School of Artificial Intelligence, OPtics and Electronics

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

7 Scopus citations

Abstract

Many recent research works on unsupervised feature selection (UFS) have focused on how to exploit autoencoders (AEs) to seek informative features. However, existing methods typically employ the squared error to estimate the data reconstruction, which amplifies the negative effect of outliers and can lead to performance degradation. Moreover, traditional AEs aim to extract latent features that capture intrinsic information of the data for accurate data recovery. Without incorporating explicit cluster structure-detecting objectives into the training criterion, AEs fail to capture the latent cluster structure of the data which is essential for identifying discriminative features. Thus, the selected features lack strong discriminative power. To address the issues, we propose to jointly perform robust feature selection and k-means clustering in a unified framework. Concretely, we exploit an AE with a l_2,1-norm as a basic model to seek informative features. To improve robustness against outliers, we introduce an adaptive weight vector for the data reconstruction terms of AE, which assigns smaller weights to the data with larger errors to automatically reduce the influence of the outliers, and larger weights to the data with smaller errors to strengthen the influence of clean data. To enhance the discriminative power of the selected features, we incorporate k-means clustering into the representation learning of the AE. This allows the AE to continually explore cluster structure information, which can be used to discover more discriminative features. Then, we also present an efficient approach to solve the objective of the corresponding problem. Extensive experiments on various benchmark datasets are provided, which clearly demonstrate that the proposed method outperforms state-of-the-art methods.

Original language	English
Pages (from-to)	1622-1636
Number of pages	15
Journal	IEEE Transactions on Neural Networks and Learning Systems
Volume	36
Issue number	1
DOIs	https://doi.org/10.1109/TNNLS.2023.3333737
State	Published - 2025

Keywords

Autoencoders (AEs)
clustering
feature selection
neural networks
robustness
unsupervised learning

Access to Document

10.1109/TNNLS.2023.3333737

Cite this

@article{ca61027dfe754a96a2fb7910dc9d6e4c,

title = "Discriminative and Robust Autoencoders for Unsupervised Feature Selection",

abstract = "Many recent research works on unsupervised feature selection (UFS) have focused on how to exploit autoencoders (AEs) to seek informative features. However, existing methods typically employ the squared error to estimate the data reconstruction, which amplifies the negative effect of outliers and can lead to performance degradation. Moreover, traditional AEs aim to extract latent features that capture intrinsic information of the data for accurate data recovery. Without incorporating explicit cluster structure-detecting objectives into the training criterion, AEs fail to capture the latent cluster structure of the data which is essential for identifying discriminative features. Thus, the selected features lack strong discriminative power. To address the issues, we propose to jointly perform robust feature selection and k-means clustering in a unified framework. Concretely, we exploit an AE with a l2,1-norm as a basic model to seek informative features. To improve robustness against outliers, we introduce an adaptive weight vector for the data reconstruction terms of AE, which assigns smaller weights to the data with larger errors to automatically reduce the influence of the outliers, and larger weights to the data with smaller errors to strengthen the influence of clean data. To enhance the discriminative power of the selected features, we incorporate k-means clustering into the representation learning of the AE. This allows the AE to continually explore cluster structure information, which can be used to discover more discriminative features. Then, we also present an efficient approach to solve the objective of the corresponding problem. Extensive experiments on various benchmark datasets are provided, which clearly demonstrate that the proposed method outperforms state-of-the-art methods.",

keywords = "Autoencoders (AEs), clustering, feature selection, neural networks, robustness, unsupervised learning",

author = "Yunzhi Ling and Feiping Nie and Weizhong Yu and Xuelong Li",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2025",

doi = "10.1109/TNNLS.2023.3333737",

language = "英语",

volume = "36",

pages = "1622--1636",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "1",

}

TY - JOUR

T1 - Discriminative and Robust Autoencoders for Unsupervised Feature Selection

AU - Ling, Yunzhi

AU - Nie, Feiping

AU - Yu, Weizhong

AU - Li, Xuelong

PY - 2025

Y1 - 2025

N2 - Many recent research works on unsupervised feature selection (UFS) have focused on how to exploit autoencoders (AEs) to seek informative features. However, existing methods typically employ the squared error to estimate the data reconstruction, which amplifies the negative effect of outliers and can lead to performance degradation. Moreover, traditional AEs aim to extract latent features that capture intrinsic information of the data for accurate data recovery. Without incorporating explicit cluster structure-detecting objectives into the training criterion, AEs fail to capture the latent cluster structure of the data which is essential for identifying discriminative features. Thus, the selected features lack strong discriminative power. To address the issues, we propose to jointly perform robust feature selection and k-means clustering in a unified framework. Concretely, we exploit an AE with a l2,1-norm as a basic model to seek informative features. To improve robustness against outliers, we introduce an adaptive weight vector for the data reconstruction terms of AE, which assigns smaller weights to the data with larger errors to automatically reduce the influence of the outliers, and larger weights to the data with smaller errors to strengthen the influence of clean data. To enhance the discriminative power of the selected features, we incorporate k-means clustering into the representation learning of the AE. This allows the AE to continually explore cluster structure information, which can be used to discover more discriminative features. Then, we also present an efficient approach to solve the objective of the corresponding problem. Extensive experiments on various benchmark datasets are provided, which clearly demonstrate that the proposed method outperforms state-of-the-art methods.

AB - Many recent research works on unsupervised feature selection (UFS) have focused on how to exploit autoencoders (AEs) to seek informative features. However, existing methods typically employ the squared error to estimate the data reconstruction, which amplifies the negative effect of outliers and can lead to performance degradation. Moreover, traditional AEs aim to extract latent features that capture intrinsic information of the data for accurate data recovery. Without incorporating explicit cluster structure-detecting objectives into the training criterion, AEs fail to capture the latent cluster structure of the data which is essential for identifying discriminative features. Thus, the selected features lack strong discriminative power. To address the issues, we propose to jointly perform robust feature selection and k-means clustering in a unified framework. Concretely, we exploit an AE with a l2,1-norm as a basic model to seek informative features. To improve robustness against outliers, we introduce an adaptive weight vector for the data reconstruction terms of AE, which assigns smaller weights to the data with larger errors to automatically reduce the influence of the outliers, and larger weights to the data with smaller errors to strengthen the influence of clean data. To enhance the discriminative power of the selected features, we incorporate k-means clustering into the representation learning of the AE. This allows the AE to continually explore cluster structure information, which can be used to discover more discriminative features. Then, we also present an efficient approach to solve the objective of the corresponding problem. Extensive experiments on various benchmark datasets are provided, which clearly demonstrate that the proposed method outperforms state-of-the-art methods.

KW - Autoencoders (AEs)

KW - clustering

KW - feature selection

KW - neural networks

KW - robustness

KW - unsupervised learning

UR - http://www.scopus.com/inward/record.url?scp=85180317726&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2023.3333737

DO - 10.1109/TNNLS.2023.3333737

M3 - 文章

AN - SCOPUS:85180317726

SN - 2162-237X

VL - 36

SP - 1622

EP - 1636

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 1

ER -

Discriminative and Robust Autoencoders for Unsupervised Feature Selection

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this