Sparse K-means clustering algorithm with anchor graph regularization

Xiaojun Yang; Weihao Zhao; Yuxiong Xu; Chang Dong Wang; Bin Li; Feiping Nie

doi:10.1016/j.ins.2024.120504

Sparse K-means clustering algorithm with anchor graph regularization

Xiaojun Yang, Weihao Zhao, Yuxiong Xu, Chang Dong Wang, Bin Li, Feiping Nie

光电与智能研究院

科研成果: 期刊稿件 › 文章 › 同行评审

12 引用（Scopus）

摘要

As a classical unsupervised learning method, the K-means algorithm selects the cluster centers randomly and calculates the mean values of the cluster's data points to generate clusters. However, its performance is susceptible to the initial cluster centers and the sparsity of the membership matrix. To overcome these limitations, in this paper, we propose a sparse K-means clustering algorithm with anchor graph regularization (SKM-AGR) for optimizing initial cluster center sensitivity and improving membership matrix sparsity. The main idea is to use the anchor graph regularization (AGR) constrained K-means models, which effectively learn the membership matrix of data points and the membership matrix of anchors. In particular, by constructing an anchor graph, the AGR term not only discovers the internal structure information of data, but also covers the data distribution. Furthermore, an alternating optimization algorithm with fast-converging is adopted to solve the optimization problems of SKM-AGR, and the computational complexity is analyzed. Extensive clustering experiments on several synthetic and benchmark datasets show that the proposed SKM-AGR method performs better than several previous methods in most cases.

源语言	英语
文章编号	120504
期刊	Information Sciences
卷	667
DOI	https://doi.org/10.1016/j.ins.2024.120504
出版状态	已出版 - 5月 2024

访问文件

10.1016/j.ins.2024.120504

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{3bb556e6af67431884671e8a60572c3a,

title = "Sparse K-means clustering algorithm with anchor graph regularization",

abstract = "As a classical unsupervised learning method, the K-means algorithm selects the cluster centers randomly and calculates the mean values of the cluster's data points to generate clusters. However, its performance is susceptible to the initial cluster centers and the sparsity of the membership matrix. To overcome these limitations, in this paper, we propose a sparse K-means clustering algorithm with anchor graph regularization (SKM-AGR) for optimizing initial cluster center sensitivity and improving membership matrix sparsity. The main idea is to use the anchor graph regularization (AGR) constrained K-means models, which effectively learn the membership matrix of data points and the membership matrix of anchors. In particular, by constructing an anchor graph, the AGR term not only discovers the internal structure information of data, but also covers the data distribution. Furthermore, an alternating optimization algorithm with fast-converging is adopted to solve the optimization problems of SKM-AGR, and the computational complexity is analyzed. Extensive clustering experiments on several synthetic and benchmark datasets show that the proposed SKM-AGR method performs better than several previous methods in most cases.",

keywords = "Anchor graph regularization, Cluster center, Membership matrix, Sparse K-means clustering",

author = "Xiaojun Yang and Weihao Zhao and Yuxiong Xu and Wang, {Chang Dong} and Bin Li and Feiping Nie",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier Inc.",

year = "2024",

month = may,

doi = "10.1016/j.ins.2024.120504",

language = "英语",

volume = "667",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - Sparse K-means clustering algorithm with anchor graph regularization

AU - Yang, Xiaojun

AU - Zhao, Weihao

AU - Xu, Yuxiong

AU - Wang, Chang Dong

AU - Li, Bin

AU - Nie, Feiping

PY - 2024/5

Y1 - 2024/5

N2 - As a classical unsupervised learning method, the K-means algorithm selects the cluster centers randomly and calculates the mean values of the cluster's data points to generate clusters. However, its performance is susceptible to the initial cluster centers and the sparsity of the membership matrix. To overcome these limitations, in this paper, we propose a sparse K-means clustering algorithm with anchor graph regularization (SKM-AGR) for optimizing initial cluster center sensitivity and improving membership matrix sparsity. The main idea is to use the anchor graph regularization (AGR) constrained K-means models, which effectively learn the membership matrix of data points and the membership matrix of anchors. In particular, by constructing an anchor graph, the AGR term not only discovers the internal structure information of data, but also covers the data distribution. Furthermore, an alternating optimization algorithm with fast-converging is adopted to solve the optimization problems of SKM-AGR, and the computational complexity is analyzed. Extensive clustering experiments on several synthetic and benchmark datasets show that the proposed SKM-AGR method performs better than several previous methods in most cases.

AB - As a classical unsupervised learning method, the K-means algorithm selects the cluster centers randomly and calculates the mean values of the cluster's data points to generate clusters. However, its performance is susceptible to the initial cluster centers and the sparsity of the membership matrix. To overcome these limitations, in this paper, we propose a sparse K-means clustering algorithm with anchor graph regularization (SKM-AGR) for optimizing initial cluster center sensitivity and improving membership matrix sparsity. The main idea is to use the anchor graph regularization (AGR) constrained K-means models, which effectively learn the membership matrix of data points and the membership matrix of anchors. In particular, by constructing an anchor graph, the AGR term not only discovers the internal structure information of data, but also covers the data distribution. Furthermore, an alternating optimization algorithm with fast-converging is adopted to solve the optimization problems of SKM-AGR, and the computational complexity is analyzed. Extensive clustering experiments on several synthetic and benchmark datasets show that the proposed SKM-AGR method performs better than several previous methods in most cases.

KW - Anchor graph regularization

KW - Cluster center

KW - Membership matrix

KW - Sparse K-means clustering

UR - http://www.scopus.com/inward/record.url?scp=85188939173&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2024.120504

DO - 10.1016/j.ins.2024.120504

M3 - 文章

AN - SCOPUS:85188939173

SN - 0020-0255

VL - 667

JO - Information Sciences

JF - Information Sciences

M1 - 120504

ER -

Sparse K-means clustering algorithm with anchor graph regularization

摘要

访问文件

其它文件与链接

指纹

引用此