Simultaneous Bayesian Clustering and Model Selection with Mixture of Robust Factor Analyzers

Shan Feng, Wenxian Xie, Yufeng Nie

Research output: Contribution to journalArticlepeer-review

Abstract

Finite Gaussian mixture models are powerful tools for modeling distributions of random phenomena and are widely used for clustering tasks. However, their interpretability and efficiency are often degraded by the impact of redundancy and noise, especially on high-dimensional datasets. In this work, we propose a generative graphical model for parsimonious modeling of the Gaussian mixtures and robust unsupervised learning. The model assumes that the data are generated independently and identically from a finite mixture of robust factor analyzers, where the features’ salience is adjusted by an active set of latent factors to allow a violation of the local independence assumption. For the model inference, we propose a structured variational Bayes inference framework to realize simultaneous clustering, model selection and outlier processing. Performance of the proposed algorithm is evaluated by conducting experiments on artificial and real-world datasets. Moreover, an application on the high-dimensional machine learning task of handwritten alphabet recognition is introduced.

Original languageEnglish
Article number1091
JournalMathematics
Volume12
Issue number7
DOIs
StatePublished - Apr 2024

Keywords

  • Bayesian inference
  • feature selection
  • mixture of factor analyzers
  • robust clustering
  • structured variational Bayes

Fingerprint

Dive into the research topics of 'Simultaneous Bayesian Clustering and Model Selection with Mixture of Robust Factor Analyzers'. Together they form a unique fingerprint.

Cite this