Fast Self-Supervised Clustering With Anchor Graph

Jingyu Wang; Zhenyu Ma; Feiping Nie; Xuelong Li

doi:10.1109/TNNLS.2021.3056080

Fast Self-Supervised Clustering With Anchor Graph

Jingyu Wang, Zhenyu Ma, Feiping Nie, Xuelong Li

School of Artificial Intelligence, OPtics and Electronics

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

41 Scopus citations

Abstract

Benefit from avoiding the utilization of labeled samples, which are usually insufficient in the real world, unsupervised learning has been regarded as a speedy and powerful strategy on clustering tasks. However, clustering directly from primal data sets leads to high computational cost, which limits its application on large-scale and high-dimensional problems. Recently, anchor-based theories are proposed to partly mitigate this problem and field naturally sparse affinity matrix, while it is still a challenge to get excellent performance along with high efficiency. To dispose of this issue, we first presented a fast semisupervised framework (FSSF) combined with a balanced $K$ -means-based hierarchical $K$ -means (BKHK) method and the bipartite graph theory. Thereafter, we proposed a fast self-supervised clustering method involved in this crucial semisupervised framework, in which all labels are inferred from a constructed bipartite graph with exactly $k$ connected components. The proposed method remarkably accelerates the general semisupervised learning through the anchor and consists of four significant parts: 1) obtaining the anchor set as interim through BKHK algorithm; 2) constructing the bipartite graph; 3) solving the self-supervised problem to construct a typical probability model with FSSF; and 4) selecting the most representative points regarding anchors from BKHK as an interim and conducting label propagation. The experimental results on toy examples and benchmark data sets have demonstrated that the proposed method outperforms other approaches.

Original language	English
Pages (from-to)	4199-4212
Number of pages	14
Journal	IEEE Transactions on Neural Networks and Learning Systems
Volume	33
Issue number	9
DOIs	https://doi.org/10.1109/TNNLS.2021.3056080
State	Published - 1 Sep 2022

Keywords

Bipartite graph
label propagation
self-supervised learning
semisupervised framework
special selection

Access to Document

10.1109/TNNLS.2021.3056080

Cite this

@article{92d092aa1e91416397b1fc5e963093b9,

title = "Fast Self-Supervised Clustering With Anchor Graph",

abstract = "Benefit from avoiding the utilization of labeled samples, which are usually insufficient in the real world, unsupervised learning has been regarded as a speedy and powerful strategy on clustering tasks. However, clustering directly from primal data sets leads to high computational cost, which limits its application on large-scale and high-dimensional problems. Recently, anchor-based theories are proposed to partly mitigate this problem and field naturally sparse affinity matrix, while it is still a challenge to get excellent performance along with high efficiency. To dispose of this issue, we first presented a fast semisupervised framework (FSSF) combined with a balanced $K$ -means-based hierarchical $K$ -means (BKHK) method and the bipartite graph theory. Thereafter, we proposed a fast self-supervised clustering method involved in this crucial semisupervised framework, in which all labels are inferred from a constructed bipartite graph with exactly $k$ connected components. The proposed method remarkably accelerates the general semisupervised learning through the anchor and consists of four significant parts: 1) obtaining the anchor set as interim through BKHK algorithm; 2) constructing the bipartite graph; 3) solving the self-supervised problem to construct a typical probability model with FSSF; and 4) selecting the most representative points regarding anchors from BKHK as an interim and conducting label propagation. The experimental results on toy examples and benchmark data sets have demonstrated that the proposed method outperforms other approaches.",

keywords = "Bipartite graph, label propagation, self-supervised learning, semisupervised framework, special selection",

author = "Jingyu Wang and Zhenyu Ma and Feiping Nie and Xuelong Li",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2022",

month = sep,

day = "1",

doi = "10.1109/TNNLS.2021.3056080",

language = "英语",

volume = "33",

pages = "4199--4212",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "9",

}

TY - JOUR

T1 - Fast Self-Supervised Clustering With Anchor Graph

AU - Wang, Jingyu

AU - Ma, Zhenyu

AU - Nie, Feiping

AU - Li, Xuelong

PY - 2022/9/1

Y1 - 2022/9/1

N2 - Benefit from avoiding the utilization of labeled samples, which are usually insufficient in the real world, unsupervised learning has been regarded as a speedy and powerful strategy on clustering tasks. However, clustering directly from primal data sets leads to high computational cost, which limits its application on large-scale and high-dimensional problems. Recently, anchor-based theories are proposed to partly mitigate this problem and field naturally sparse affinity matrix, while it is still a challenge to get excellent performance along with high efficiency. To dispose of this issue, we first presented a fast semisupervised framework (FSSF) combined with a balanced $K$ -means-based hierarchical $K$ -means (BKHK) method and the bipartite graph theory. Thereafter, we proposed a fast self-supervised clustering method involved in this crucial semisupervised framework, in which all labels are inferred from a constructed bipartite graph with exactly $k$ connected components. The proposed method remarkably accelerates the general semisupervised learning through the anchor and consists of four significant parts: 1) obtaining the anchor set as interim through BKHK algorithm; 2) constructing the bipartite graph; 3) solving the self-supervised problem to construct a typical probability model with FSSF; and 4) selecting the most representative points regarding anchors from BKHK as an interim and conducting label propagation. The experimental results on toy examples and benchmark data sets have demonstrated that the proposed method outperforms other approaches.

AB - Benefit from avoiding the utilization of labeled samples, which are usually insufficient in the real world, unsupervised learning has been regarded as a speedy and powerful strategy on clustering tasks. However, clustering directly from primal data sets leads to high computational cost, which limits its application on large-scale and high-dimensional problems. Recently, anchor-based theories are proposed to partly mitigate this problem and field naturally sparse affinity matrix, while it is still a challenge to get excellent performance along with high efficiency. To dispose of this issue, we first presented a fast semisupervised framework (FSSF) combined with a balanced $K$ -means-based hierarchical $K$ -means (BKHK) method and the bipartite graph theory. Thereafter, we proposed a fast self-supervised clustering method involved in this crucial semisupervised framework, in which all labels are inferred from a constructed bipartite graph with exactly $k$ connected components. The proposed method remarkably accelerates the general semisupervised learning through the anchor and consists of four significant parts: 1) obtaining the anchor set as interim through BKHK algorithm; 2) constructing the bipartite graph; 3) solving the self-supervised problem to construct a typical probability model with FSSF; and 4) selecting the most representative points regarding anchors from BKHK as an interim and conducting label propagation. The experimental results on toy examples and benchmark data sets have demonstrated that the proposed method outperforms other approaches.

KW - Bipartite graph

KW - label propagation

KW - self-supervised learning

KW - semisupervised framework

KW - special selection

UR - http://www.scopus.com/inward/record.url?scp=85100919990&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2021.3056080

DO - 10.1109/TNNLS.2021.3056080

M3 - 文章

C2 - 33587715

AN - SCOPUS:85100919990

SN - 2162-237X

VL - 33

SP - 4199

EP - 4212

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 9

ER -

Fast Self-Supervised Clustering With Anchor Graph

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this