2,p-Norm and Mahalanobis Distance-Based Robust Fuzzy C-Means

Qiang Chen, Feiping Nie, Weizhong Yu, Xuelong Li

Research output: Contribution to journalArticlepeer-review

12 Scopus citations

Abstract

Fuzzy C-means (FCM) is a kind of classic cluster method, which has been widely used in various fields, such as image segmentation and data mining. Euclidean distance is a frequently used distance metric in FCM, but it is only suitable for data with spherical structure. As an extension of Euclidean distance, Mahalanobis distance has been used in Gustafson-Kessel FCM and its variants to tackle ellipsoidal data. For the convenience of optimizing, most existing Mahalanobis distance-based FCM algorithms only focus on squared Mahalanobis distance. However, squared Mahalanobis distance may not be the best distance metric for FCM because it is easy to enlarge the influence of outliers. In this article, we propose a novel ℓ2,p-norm and Mahalanobis distance-based FCM model, abbreviated as LM-FCM, which can help FCM improve the ability of tackling ellipsoidal clusters and outliers. Then, in order to reduce computational complexity, we propose a more simplified model, abbreviated as SLM-FCM. Furthermore, we develop an iteratively reweighted optimization algorithm to optimize the proposed models and provide a rigorous monotonous convergence proof. Finally, compared with the existing state-of-the-art FCM algorithms, we conduct extensive experiments on both synthetic and real-world datasets to manifest the superior clustering performance and robustness of the proposed algorithms.

Original languageEnglish
Pages (from-to)2904-2916
Number of pages13
JournalIEEE Transactions on Fuzzy Systems
Volume31
Issue number9
DOIs
StatePublished - 1 Sep 2023

Keywords

  • Euclidean distance
  • fuzzy C-means
  • Gustafson-Kessel (GK)
  • Mahalanobis distance
  • ℓ-norm

Fingerprint

Dive into the research topics of 'ℓ2,p-Norm and Mahalanobis Distance-Based Robust Fuzzy C-Means'. Together they form a unique fingerprint.

Cite this