TY - JOUR
T1 - ℓ2,p-Norm and Mahalanobis Distance-Based Robust Fuzzy C-Means
AU - Chen, Qiang
AU - Nie, Feiping
AU - Yu, Weizhong
AU - Li, Xuelong
N1 - Publisher Copyright:
© 1993-2012 IEEE.
PY - 2023/9/1
Y1 - 2023/9/1
N2 - Fuzzy C-means (FCM) is a kind of classic cluster method, which has been widely used in various fields, such as image segmentation and data mining. Euclidean distance is a frequently used distance metric in FCM, but it is only suitable for data with spherical structure. As an extension of Euclidean distance, Mahalanobis distance has been used in Gustafson-Kessel FCM and its variants to tackle ellipsoidal data. For the convenience of optimizing, most existing Mahalanobis distance-based FCM algorithms only focus on squared Mahalanobis distance. However, squared Mahalanobis distance may not be the best distance metric for FCM because it is easy to enlarge the influence of outliers. In this article, we propose a novel ℓ2,p-norm and Mahalanobis distance-based FCM model, abbreviated as LM-FCM, which can help FCM improve the ability of tackling ellipsoidal clusters and outliers. Then, in order to reduce computational complexity, we propose a more simplified model, abbreviated as SLM-FCM. Furthermore, we develop an iteratively reweighted optimization algorithm to optimize the proposed models and provide a rigorous monotonous convergence proof. Finally, compared with the existing state-of-the-art FCM algorithms, we conduct extensive experiments on both synthetic and real-world datasets to manifest the superior clustering performance and robustness of the proposed algorithms.
AB - Fuzzy C-means (FCM) is a kind of classic cluster method, which has been widely used in various fields, such as image segmentation and data mining. Euclidean distance is a frequently used distance metric in FCM, but it is only suitable for data with spherical structure. As an extension of Euclidean distance, Mahalanobis distance has been used in Gustafson-Kessel FCM and its variants to tackle ellipsoidal data. For the convenience of optimizing, most existing Mahalanobis distance-based FCM algorithms only focus on squared Mahalanobis distance. However, squared Mahalanobis distance may not be the best distance metric for FCM because it is easy to enlarge the influence of outliers. In this article, we propose a novel ℓ2,p-norm and Mahalanobis distance-based FCM model, abbreviated as LM-FCM, which can help FCM improve the ability of tackling ellipsoidal clusters and outliers. Then, in order to reduce computational complexity, we propose a more simplified model, abbreviated as SLM-FCM. Furthermore, we develop an iteratively reweighted optimization algorithm to optimize the proposed models and provide a rigorous monotonous convergence proof. Finally, compared with the existing state-of-the-art FCM algorithms, we conduct extensive experiments on both synthetic and real-world datasets to manifest the superior clustering performance and robustness of the proposed algorithms.
KW - Euclidean distance
KW - fuzzy C-means
KW - Gustafson-Kessel (GK)
KW - Mahalanobis distance
KW - ℓ-norm
UR - http://www.scopus.com/inward/record.url?scp=85147282308&partnerID=8YFLogxK
U2 - 10.1109/TFUZZ.2023.3235384
DO - 10.1109/TFUZZ.2023.3235384
M3 - 文章
AN - SCOPUS:85147282308
SN - 1063-6706
VL - 31
SP - 2904
EP - 2916
JO - IEEE Transactions on Fuzzy Systems
JF - IEEE Transactions on Fuzzy Systems
IS - 9
ER -