Skip to main navigation Skip to search Skip to main content

An Effective and Efficient Algorithm for K-Means Clustering With New Formulation

  • Northwestern Polytechnical University Xian

Research output: Contribution to journalArticlepeer-review

140 Scopus citations

Abstract

K-means is one of the most simple and popular clustering algorithms, which implemented as a standard clustering method in most of machine learning researches. The goal of K-means clustering is finding a set of cluster centers and minimizing the sum of squared distances between each sample and its nearest clustering center. In this paper, we proposed a novel K-means clustering algorithm, which reformulate the classical K-Means objective function as a trace maximization problem and then replace it with a new formulation. The proposed algorithm does not need to calculate the cluster centers in each iteration and requires fewer additional intermediate variables during the optimization process. In addition, we proposed an efficient iterative re-weighted algorithm to solve the involved optimization problem and provided the corresponding convergence analysis. The proposed algorithm keeps a consistent computational complexity as Lloyd's algorithm, O(ndk), but shows a faster convergence rate in experiments. Extensive experimental results on real world benchmark datasets show the effectiveness and efficiency of the proposed algorithm.

Original languageEnglish
Pages (from-to)3433-3443
Number of pages11
JournalIEEE Transactions on Knowledge and Data Engineering
Volume35
Issue number4
DOIs
StatePublished - 1 Apr 2023

Keywords

  • Clustering
  • K-means
  • optimization
  • re-weighted

Fingerprint

Dive into the research topics of 'An Effective and Efficient Algorithm for K-Means Clustering With New Formulation'. Together they form a unique fingerprint.

Cite this