Abstract
Ensemble clustering is an important topic in data mining, which combines multiple base clusterings to produce a more robust and effective clustering model. Scholars have proposed many methods in ensemble clustering and made important theoretical contributions in the past few years. However, there are still several drawbacks in these methods. Firstly, existing methods treat each base clusterings equally. Secondly, difficult instances are not fully considered, which may influence the optimization process. To address the aforementioned limitations, we introduce a novel self-paced and structured graph-based ensemble clustering algorithm (S2GEC), which introduces self-paced learning to gradually involve instances into training procedure from easy to difficult. In addition, we also proposed a unified optimization framework which can evaluate instances and base clusterings simultaneously so as to obtain the consensus clustering result. Moreover, S2GEC can obtain a structured similarity matrix to extract the clustering indicators directly. Extensive experiments conducted on commonly used datasets have demonstrated the superiority of our algorithm.
| Original language | English |
|---|---|
| Article number | 110269 |
| Journal | Signal Processing |
| Volume | 239 |
| DOIs | |
| State | Published - Feb 2026 |
Keywords
- Clustering
- High-order bipartite graph
- Structured proximity matrix
- Tensor nuclear norm