Credal Clustering for Imbalanced Data

Zuowei Zhang, Zhunga Liu, Kuang Zhou, Arnaud Martin, Yiru Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Traditional evidential clustering tends to build clusters where the number of data for each cluster fairly close to each other. However, it may not be suitable for imbalanced data. This paper proposes a new method, called credal clustering (CClu), to deal with imbalanced data based on the theory of belief functions. Consider a dataset with C wanted classes, the credal c-means (CCM) clustering method is employed at first to divide the dataset into some (i.e., S(S>C) ) clusters. Then these clusters are gradually merged following a given principle based on the density of meta-clusters and the associated singleton clusters. The merging is finished when C singleton wanted classes are obtained. During this merging procedure, the objects in each singleton cluster will be assigned to one new singleton class. Moreover, a weighted mean vector rule is developed to classify the objects in the unmerged meta-cluster to the associated new classes using the K-Nearest neighbor technique. Two experiments show that CClu can handle imbalanced datasets with high accuracy, and the errors are reduced by properly modeling imprecision.

Original languageEnglish
Title of host publicationBelief Functions
Subtitle of host publicationTheory and Applications - 6th International Conference, BELIEF 2021, Proceedings
EditorsThierry Denœux, Eric Lefèvre, Zhunga Liu, Frédéric Pichon
PublisherSpringer Science and Business Media Deutschland GmbH
Pages13-21
Number of pages9
ISBN (Print)9783030886004
DOIs
StatePublished - 2021
Event6th International Conference on Belief Functions, BELIEF 2021 - Virtual, Online
Duration: 15 Oct 202119 Oct 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12915 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th International Conference on Belief Functions, BELIEF 2021
CityVirtual, Online
Period15/10/2119/10/21

Keywords

  • Belief functions
  • Credal c-means
  • Evidential clustering
  • Imbalanced data
  • K-NN

Fingerprint

Dive into the research topics of 'Credal Clustering for Imbalanced Data'. Together they form a unique fingerprint.

Cite this