Skip to main navigation Skip to search Skip to main content

Large-Scale Clustering With Anchor-Based Constrained Laplacian Rank

  • Northwestern Polytechnical University Xian
  • National Key Laboratory of Air-based Information Perception and Fusion
  • China Telecommunications

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Graph-based clustering technique has garnered sig nificant attention due to precise information characterization by pairwise graph similarity. Nevertheless, the post-processing step in traditional methods often limits clustering effects because of crucial information loss. Therefore, the Constrained Laplacian Rank (CLR) theory emerges to directly obtain discrete labels from optimally structural graph, achieving desirable outcomes. However, CLR suffers from substantial time overhead, making it infeasible for large-scale data analysis. To overcome this issue, we propose Anchor-based CLR (ACLR), a simple yet effective method for efficient large-scale clustering. The ACLR method comprises four stages: (1) anchors that roughly cover original data are opted to prepare bipartite graph construction; (2) a novel two-step prob ability transition (TSPT) strategy initializes a small-scale graph with random walk probability among anchors;(3) the main ACLR model alternately optimizes the graph connected structure and directly produces discrete anchor labels, achieving a time complexity independent of the number of samples due to dramaticallybreduced graphscale; and (4) labels are propagated from anchors to samples using K-NN algorithm. Extensive experiments demonstrate that ACLR yields superior accuracy and efficiency, particularly when applied to large-scale data.

Original languageEnglish
Pages (from-to)4144-4158
Number of pages15
JournalIEEE Transactions on Knowledge and Data Engineering
Volume37
Issue number7
DOIs
StatePublished - 2025

Keywords

  • Constrained Laplacian rank
  • anchor
  • bipartite graph
  • graph connected structure
  • label propagation
  • large-scale clustering
  • two-step probability transition

Fingerprint

Dive into the research topics of 'Large-Scale Clustering With Anchor-Based Constrained Laplacian Rank'. Together they form a unique fingerprint.

Cite this