Clustering by Unified Principal Component Analysis and Fuzzy C-Means with Sparsity Constraint

Jikui Wang, Quanfu Shi, Zhengguo Yang, Feiping Nie

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

For clustering high-dimensional data, most of the state-of-the-art algorithms often extract principal component beforehand, and then conduct a concrete clustering method. However, the two-stage strategy may deviate from assignments by directly optimizing the unified objective function. Different from the traditional methods, we propose a novel method referred to as clustering by unified principal component analysis and fuzzy c-means (UPF) for clustering high-dimensional data. Our model can explore underlying clustering structure in low-dimensional space and finish clustering simultaneously. In particular, we impose a L0-norm constraint on the membership matrix to make the matrix more sparse. To solve the model, we propose an effective iterative optimization algorithm. Extensive experiments on several benchmark data sets in comparison with two-stage algorithms are conducted to validate effectiveness of the proposed method. The experiments results demonstrate that the performance of our proposed method is superiority.

Original languageEnglish
Title of host publicationAlgorithms and Architectures for Parallel Processing - 20th International Conference, ICA3PP 2020, Proceedings
EditorsMeikang Qiu
PublisherSpringer Science and Business Media Deutschland GmbH
Pages337-351
Number of pages15
ISBN (Print)9783030602383
DOIs
StatePublished - 2020
Event20th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2020 - New York, United States
Duration: 2 Oct 20204 Oct 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12453 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference20th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2020
Country/TerritoryUnited States
CityNew York
Period2/10/204/10/20

Keywords

  • Artificial Intelligence
  • Fuzzy c-means clustering
  • Machine learning
  • Principal component analysis
  • Sparsity constraint

Fingerprint

Dive into the research topics of 'Clustering by Unified Principal Component Analysis and Fuzzy C-Means with Sparsity Constraint'. Together they form a unique fingerprint.

Cite this