Self-Weighted Euler k-Means Clustering

Haonan Xin; Yihang Lu; Haoliang Tang; Rong Wang; Feiping Nie

doi:10.1109/LSP.2023.3305909

Self-Weighted Euler k-Means Clustering

Haonan Xin, Yihang Lu, Haoliang Tang, Rong Wang, Feiping Nie

光电与智能研究院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

5 引用（Scopus）

摘要

Clustering is used widely in various kinds of signal processing tasks, in which k-means is warmly welcomed by the researchers due to its efficiency and simplicity. Nevertheless, it fails to process non-spherical clusters which are common data distribution. As a variant of k-means, kernel k-means uses a kernel trick to map the raw data into a feature space to better describe the data with improved clustering performance. But the algorithms still have a lot of shortcomings in the application of signal processing: 1) Not all features contain a wealth of useful information, so all dimensions of features cannot be treated equally; 2) The use of high dimensional features for clustering exceedingly increases computational complexity with negligible improvement of clustering performance. To solve the problems, we propose a self-weighted Euler k-means (SWEKM) model, which can adaptively identify the importance of different features, perfectly integrating clustering and feature selection into a joint framework. Moreover, Euler kernel is adopted in SWEKM, which is capable of suppressing the interference of noisy points and outliers with comparable computational complexity. Extensive experiments on datasets from the UCI database show that the SWEKM outperforms the state-of-the-art kernel k-means for clustering-based signal processing tasks.

源语言	英语
页（从-至）	1127-1131
页数	5
期刊	IEEE Signal Processing Letters
卷	30
DOI	https://doi.org/10.1109/LSP.2023.3305909
出版状态	已出版 - 2023

访问文件

10.1109/LSP.2023.3305909

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{25c01f5c512f424291c0e02899edde8d,

title = "Self-Weighted Euler k-Means Clustering",

abstract = "Clustering is used widely in various kinds of signal processing tasks, in which k-means is warmly welcomed by the researchers due to its efficiency and simplicity. Nevertheless, it fails to process non-spherical clusters which are common data distribution. As a variant of k-means, kernel k-means uses a kernel trick to map the raw data into a feature space to better describe the data with improved clustering performance. But the algorithms still have a lot of shortcomings in the application of signal processing: 1) Not all features contain a wealth of useful information, so all dimensions of features cannot be treated equally; 2) The use of high dimensional features for clustering exceedingly increases computational complexity with negligible improvement of clustering performance. To solve the problems, we propose a self-weighted Euler k-means (SWEKM) model, which can adaptively identify the importance of different features, perfectly integrating clustering and feature selection into a joint framework. Moreover, Euler kernel is adopted in SWEKM, which is capable of suppressing the interference of noisy points and outliers with comparable computational complexity. Extensive experiments on datasets from the UCI database show that the SWEKM outperforms the state-of-the-art kernel k-means for clustering-based signal processing tasks.",

keywords = "Euler kernel, feature selection, kernel clustering",

author = "Haonan Xin and Yihang Lu and Haoliang Tang and Rong Wang and Feiping Nie",

note = "Publisher Copyright: {\textcopyright} 1994-2012 IEEE.",

year = "2023",

doi = "10.1109/LSP.2023.3305909",

language = "英语",

volume = "30",

pages = "1127--1131",

journal = "IEEE Signal Processing Letters",

issn = "1070-9908",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Self-Weighted Euler k-Means Clustering

AU - Xin, Haonan

AU - Lu, Yihang

AU - Tang, Haoliang

AU - Wang, Rong

AU - Nie, Feiping

PY - 2023

Y1 - 2023

N2 - Clustering is used widely in various kinds of signal processing tasks, in which k-means is warmly welcomed by the researchers due to its efficiency and simplicity. Nevertheless, it fails to process non-spherical clusters which are common data distribution. As a variant of k-means, kernel k-means uses a kernel trick to map the raw data into a feature space to better describe the data with improved clustering performance. But the algorithms still have a lot of shortcomings in the application of signal processing: 1) Not all features contain a wealth of useful information, so all dimensions of features cannot be treated equally; 2) The use of high dimensional features for clustering exceedingly increases computational complexity with negligible improvement of clustering performance. To solve the problems, we propose a self-weighted Euler k-means (SWEKM) model, which can adaptively identify the importance of different features, perfectly integrating clustering and feature selection into a joint framework. Moreover, Euler kernel is adopted in SWEKM, which is capable of suppressing the interference of noisy points and outliers with comparable computational complexity. Extensive experiments on datasets from the UCI database show that the SWEKM outperforms the state-of-the-art kernel k-means for clustering-based signal processing tasks.

AB - Clustering is used widely in various kinds of signal processing tasks, in which k-means is warmly welcomed by the researchers due to its efficiency and simplicity. Nevertheless, it fails to process non-spherical clusters which are common data distribution. As a variant of k-means, kernel k-means uses a kernel trick to map the raw data into a feature space to better describe the data with improved clustering performance. But the algorithms still have a lot of shortcomings in the application of signal processing: 1) Not all features contain a wealth of useful information, so all dimensions of features cannot be treated equally; 2) The use of high dimensional features for clustering exceedingly increases computational complexity with negligible improvement of clustering performance. To solve the problems, we propose a self-weighted Euler k-means (SWEKM) model, which can adaptively identify the importance of different features, perfectly integrating clustering and feature selection into a joint framework. Moreover, Euler kernel is adopted in SWEKM, which is capable of suppressing the interference of noisy points and outliers with comparable computational complexity. Extensive experiments on datasets from the UCI database show that the SWEKM outperforms the state-of-the-art kernel k-means for clustering-based signal processing tasks.

KW - Euler kernel

KW - feature selection

KW - kernel clustering

UR - http://www.scopus.com/inward/record.url?scp=85168679139&partnerID=8YFLogxK

U2 - 10.1109/LSP.2023.3305909

DO - 10.1109/LSP.2023.3305909

M3 - 文章

AN - SCOPUS:85168679139

SN - 1070-9908

VL - 30

SP - 1127

EP - 1131

JO - IEEE Signal Processing Letters

JF - IEEE Signal Processing Letters

ER -

Self-Weighted Euler k-Means Clustering

摘要

访问文件

其它文件与链接

指纹

引用此