Spectral clustering with linear embedding: A discrete clustering method for large-scale data

Chenhui Gao; Wenzhi Chen; Feiping Nie; Weizhong Yu; Zonghui Wang

doi:10.1016/j.patcog.2024.110396

Spectral clustering with linear embedding: A discrete clustering method for large-scale data

Chenhui Gao, Wenzhi Chen, Feiping Nie, Weizhong Yu, Zonghui Wang

School of Artificial Intelligence, OPtics and Electronics

Research output: Contribution to journal › Article › peer-review

8 Scopus citations

Abstract

In recent decades, spectral clustering has found widespread applications in various real-world scenarios, showcasing its effectiveness. Traditional spectral clustering typically follows a two-step procedure to address the optimization problem. However, this approach may result in substantial information loss and performance decline. Furthermore, the eigenvalue decomposition, a key step in spectral clustering, entails cubic computational complexity. This paper incorporates linear embedding into the objective function of spectral clustering and proposes a direct method to solve the indicator matrix. Moreover, our method achieves a linear time complexity with respect to the input data size. Our method, referred to as Spectral Clustering with Linear Embedding (SCLE), achieves a direct and efficient solution and naturally handles out-of-sample data. SCLE initiates the process with balanced and hierarchical K-means, effectively partitioning the input data into balanced clusters. After generating anchors, we compute a similarity matrix based on the distances between the input data points and the generated anchors. In contrast to the conventional two-step spectral clustering approach, we directly solve the cluster indicator matrix at a linear time complexity. Extensive experiments across multiple datasets underscore the efficiency and effectiveness of our proposed SCLE method.

Original language	English
Article number	110396
Journal	Pattern Recognition
Volume	151
DOIs	https://doi.org/10.1016/j.patcog.2024.110396
State	Published - Jul 2024

Keywords

Graph embedding
Spectral clustering
Unsupervised learning

Access to Document

10.1016/j.patcog.2024.110396

Cite this

@article{e5af9f4904184eed978b571bc23253d2,

title = "Spectral clustering with linear embedding: A discrete clustering method for large-scale data",

abstract = "In recent decades, spectral clustering has found widespread applications in various real-world scenarios, showcasing its effectiveness. Traditional spectral clustering typically follows a two-step procedure to address the optimization problem. However, this approach may result in substantial information loss and performance decline. Furthermore, the eigenvalue decomposition, a key step in spectral clustering, entails cubic computational complexity. This paper incorporates linear embedding into the objective function of spectral clustering and proposes a direct method to solve the indicator matrix. Moreover, our method achieves a linear time complexity with respect to the input data size. Our method, referred to as Spectral Clustering with Linear Embedding (SCLE), achieves a direct and efficient solution and naturally handles out-of-sample data. SCLE initiates the process with balanced and hierarchical K-means, effectively partitioning the input data into balanced clusters. After generating anchors, we compute a similarity matrix based on the distances between the input data points and the generated anchors. In contrast to the conventional two-step spectral clustering approach, we directly solve the cluster indicator matrix at a linear time complexity. Extensive experiments across multiple datasets underscore the efficiency and effectiveness of our proposed SCLE method.",

keywords = "Graph embedding, Spectral clustering, Unsupervised learning",

author = "Chenhui Gao and Wenzhi Chen and Feiping Nie and Weizhong Yu and Zonghui Wang",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier Ltd",

year = "2024",

month = jul,

doi = "10.1016/j.patcog.2024.110396",

language = "英语",

volume = "151",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier Ltd",

}

TY - JOUR

T1 - Spectral clustering with linear embedding

T2 - A discrete clustering method for large-scale data

AU - Gao, Chenhui

AU - Chen, Wenzhi

AU - Nie, Feiping

AU - Yu, Weizhong

AU - Wang, Zonghui

PY - 2024/7

Y1 - 2024/7

N2 - In recent decades, spectral clustering has found widespread applications in various real-world scenarios, showcasing its effectiveness. Traditional spectral clustering typically follows a two-step procedure to address the optimization problem. However, this approach may result in substantial information loss and performance decline. Furthermore, the eigenvalue decomposition, a key step in spectral clustering, entails cubic computational complexity. This paper incorporates linear embedding into the objective function of spectral clustering and proposes a direct method to solve the indicator matrix. Moreover, our method achieves a linear time complexity with respect to the input data size. Our method, referred to as Spectral Clustering with Linear Embedding (SCLE), achieves a direct and efficient solution and naturally handles out-of-sample data. SCLE initiates the process with balanced and hierarchical K-means, effectively partitioning the input data into balanced clusters. After generating anchors, we compute a similarity matrix based on the distances between the input data points and the generated anchors. In contrast to the conventional two-step spectral clustering approach, we directly solve the cluster indicator matrix at a linear time complexity. Extensive experiments across multiple datasets underscore the efficiency and effectiveness of our proposed SCLE method.

AB - In recent decades, spectral clustering has found widespread applications in various real-world scenarios, showcasing its effectiveness. Traditional spectral clustering typically follows a two-step procedure to address the optimization problem. However, this approach may result in substantial information loss and performance decline. Furthermore, the eigenvalue decomposition, a key step in spectral clustering, entails cubic computational complexity. This paper incorporates linear embedding into the objective function of spectral clustering and proposes a direct method to solve the indicator matrix. Moreover, our method achieves a linear time complexity with respect to the input data size. Our method, referred to as Spectral Clustering with Linear Embedding (SCLE), achieves a direct and efficient solution and naturally handles out-of-sample data. SCLE initiates the process with balanced and hierarchical K-means, effectively partitioning the input data into balanced clusters. After generating anchors, we compute a similarity matrix based on the distances between the input data points and the generated anchors. In contrast to the conventional two-step spectral clustering approach, we directly solve the cluster indicator matrix at a linear time complexity. Extensive experiments across multiple datasets underscore the efficiency and effectiveness of our proposed SCLE method.

KW - Graph embedding

KW - Spectral clustering

KW - Unsupervised learning

UR - http://www.scopus.com/inward/record.url?scp=85186727733&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2024.110396

DO - 10.1016/j.patcog.2024.110396

M3 - 文章

AN - SCOPUS:85186727733

SN - 0031-3203

VL - 151

JO - Pattern Recognition

JF - Pattern Recognition

M1 - 110396

ER -

Spectral clustering with linear embedding: A discrete clustering method for large-scale data

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this