Principal Component Analysis with Fuzzy Elastic Net for Feature Selection

Yunlong Gao, Qinting Wu, Zhenghong Xu, Chao Cao, Jinyan Pan, Guifang Shao, Feiping Nie, Qingyuan Zhu

科研成果: 期刊稿件文章同行评审

4 引用 (Scopus)

摘要

Feature selection serves as a fundamental technique in machine learning and data analysis, playing a crucial role in extracting valuable features from large-scale and high-dimensional datasets that may contain irrelevant features. To enhance the performance of feature selection, regularizers like ℓ1-norm or ℓ1-norm are commonly utilized to encourage sparsity. Nonetheless, these traditional regularization techniques encounter certain challenges. When correlations exist among features, the sparsity-driven regularization can unfairly diminish weights of correlated features to zero, thus ignoring the feature correlations and lacking group sparsity properties. While a straightforward combination of ℓ1-norm and ℓ2,1-norm can uncover feature correlations, it lacks adaptability and effectively balancing sparsity and correlation. To address these challenges, we introduce a novel matrix-based regularization term, called a fuzzy elastic net, in the unsupervised feature selection model. Our model is founded on principal component analysis, a well-established dimensionality reduction technique adept at finding subspaces that retain most information from raw data. The model is enhanced by a fuzzy elastic net, which promotes group or sparsity properties through adaptive parameter tuning. The new regularization term introduces a flexible fuzzy weighted scheme combining the ℓ2,2}}-norm and ℓ2,p-norm (0< p≤ 1). This approach allows adaptive adjustment based on data characteristics, offering a tunable balance between selecting discriminative features and identifying correlated ones. Consequently, this regularization term equips the model to handle diverse data analysis tasks flexibly, thereby enhancing adaptability and generalization performance. Furthermore, we propose an efficient optimization strategy to solve this model. Extensive experiments conducted on UCI datasets and real-world datasets demonstrate the effectiveness and efficiency of our proposed method.

源语言英语
页(从-至)6878-6890
页数13
期刊IEEE Transactions on Fuzzy Systems
32
12
DOI
出版状态已出版 - 2024

指纹

探究 'Principal Component Analysis with Fuzzy Elastic Net for Feature Selection' 的科研主题。它们共同构成独一无二的指纹。

引用此