Self-paced and structured graph-based ensemble clustering

Research output: Contribution to journalArticlepeer-review

Abstract

Ensemble clustering is an important topic in data mining, which combines multiple base clusterings to produce a more robust and effective clustering model. Scholars have proposed many methods in ensemble clustering and made important theoretical contributions in the past few years. However, there are still several drawbacks in these methods. Firstly, existing methods treat each base clusterings equally. Secondly, difficult instances are not fully considered, which may influence the optimization process. To address the aforementioned limitations, we introduce a novel self-paced and structured graph-based ensemble clustering algorithm (S2GEC), which introduces self-paced learning to gradually involve instances into training procedure from easy to difficult. In addition, we also proposed a unified optimization framework which can evaluate instances and base clusterings simultaneously so as to obtain the consensus clustering result. Moreover, S2GEC can obtain a structured similarity matrix to extract the clustering indicators directly. Extensive experiments conducted on commonly used datasets have demonstrated the superiority of our algorithm.

Original languageEnglish
Article number110269
JournalSignal Processing
Volume239
DOIs
StatePublished - Feb 2026

Keywords

  • Clustering
  • High-order bipartite graph
  • Structured proximity matrix
  • Tensor nuclear norm

Fingerprint

Dive into the research topics of 'Self-paced and structured graph-based ensemble clustering'. Together they form a unique fingerprint.

Cite this