Discriminative embedded clustering: A framework for grouping high-dimensional data

Chenping Hou, Feiping Nie, Dongyun Yi, Dacheng Tao

Research output: Contribution to journalArticlepeer-review

112 Scopus citations

Abstract

In many real applications of machine learning and data mining, we are often confronted with high-dimensional data. How to cluster high-dimensional data is still a challenging problem due to the curse of dimensionality. In this paper, we try to address this problem using joint dimensionality reduction and clustering. Different from traditional approaches that conduct dimensionality reduction and clustering in sequence, we propose a novel framework referred to as discriminative embedded clustering which alternates them iteratively. Within this framework, we are able not only to view several traditional approaches and reveal their intrinsic relationships, but also to be stimulated to develop a new method. We also propose an effective approach for solving the formulated nonconvex optimization problem. Comprehensive analyses, including convergence behavior, parameter determination, and computational complexity, together with the relationship to other related approaches, are also presented. Plenty of experimental results on benchmark data sets illustrate that the proposed method outperforms related state-of-the-art clustering approaches and existing joint dimensionality reduction and clustering methods.

Original languageEnglish
Article number6867384
Pages (from-to)1287-1299
Number of pages13
JournalIEEE Transactions on Neural Networks and Learning Systems
Volume26
Issue number6
DOIs
StatePublished - 1 Jun 2015
Externally publishedYes

Keywords

  • Clustering
  • Dimensionality reduction
  • Discriminative embedded clustering (DEC)
  • High-dimensional data
  • Subspace learning

Fingerprint

Dive into the research topics of 'Discriminative embedded clustering: A framework for grouping high-dimensional data'. Together they form a unique fingerprint.

Cite this