Classification of incomplete data based on belief functions and K-nearest neighbors

Zhun Ga Liu; Yong Liu; Jean Dezert; Quan Pan

doi:10.1016/j.knosys.2015.06.022

Classification of incomplete data based on belief functions and K-nearest neighbors

Zhun Ga Liu, Yong Liu, Jean Dezert, Quan Pan

School of Automation

Research output: Contribution to journal › Article › peer-review

30 Scopus citations

Abstract

It can be quite difficult to correctly and precisely classify the incomplete data with missing values, since the missing information usually causes ambiguities (uncertainty) in the classification result. Belief function theory can well model such uncertain and imprecise information, and a new belief-based method for credal classification of incomplete data (CCI) is proposed using the K nearest neighbors (KNNs) strategy. In CCI, the KNNs of object (incomplete data) are respectively used to estimate the missing values, and one can obtain K versions of edited pattern with estimated values from the KNNs. The K edited patterns are classified by any classical method to get K pieces of classification results with different discounting (weighting) factors depending on the distances between the object and its KNNs, and global fusion of the K classification results represented by the basic belief assignments (bba's) is used for credal classification of the object. The conflicting beliefs produced in the fusion process can well capture the imprecision degree of classification, and it will be transferred to the selected meta-class defined by the disjunction of several classes (i.e. the set of several classes) according to the current context. Thus, the incomplete data that is hard to correctly classify because of the missing values will be reasonably committed to proper meta-class, which is able to characterize the imprecision of classification and reduce the errors as well. Three experiments are given to illustrate the potential and interest of CCI approach.

Original language	English
Pages (from-to)	113-125
Number of pages	13
Journal	Knowledge-Based Systems
Volume	89
DOIs	https://doi.org/10.1016/j.knosys.2015.06.022
State	Published - Nov 2015

Keywords

Belief functions
Credal classification
Evidence fusion
Incomplete data
K-nearest neighbor

Access to Document

10.1016/j.knosys.2015.06.022

Cite this

@article{7a2bc2c762db41e69c5ebe5fba862b9c,

title = "Classification of incomplete data based on belief functions and K-nearest neighbors",

abstract = "It can be quite difficult to correctly and precisely classify the incomplete data with missing values, since the missing information usually causes ambiguities (uncertainty) in the classification result. Belief function theory can well model such uncertain and imprecise information, and a new belief-based method for credal classification of incomplete data (CCI) is proposed using the K nearest neighbors (KNNs) strategy. In CCI, the KNNs of object (incomplete data) are respectively used to estimate the missing values, and one can obtain K versions of edited pattern with estimated values from the KNNs. The K edited patterns are classified by any classical method to get K pieces of classification results with different discounting (weighting) factors depending on the distances between the object and its KNNs, and global fusion of the K classification results represented by the basic belief assignments (bba's) is used for credal classification of the object. The conflicting beliefs produced in the fusion process can well capture the imprecision degree of classification, and it will be transferred to the selected meta-class defined by the disjunction of several classes (i.e. the set of several classes) according to the current context. Thus, the incomplete data that is hard to correctly classify because of the missing values will be reasonably committed to proper meta-class, which is able to characterize the imprecision of classification and reduce the errors as well. Three experiments are given to illustrate the potential and interest of CCI approach.",

keywords = "Belief functions, Credal classification, Evidence fusion, Incomplete data, K-nearest neighbor",

author = "Liu, {Zhun Ga} and Yong Liu and Jean Dezert and Quan Pan",

year = "2015",

month = nov,

doi = "10.1016/j.knosys.2015.06.022",

language = "英语",

volume = "89",

pages = "113--125",

journal = "Knowledge-Based Systems",

issn = "0950-7051",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Classification of incomplete data based on belief functions and K-nearest neighbors

AU - Liu, Zhun Ga

AU - Liu, Yong

AU - Dezert, Jean

AU - Pan, Quan

PY - 2015/11

Y1 - 2015/11

N2 - It can be quite difficult to correctly and precisely classify the incomplete data with missing values, since the missing information usually causes ambiguities (uncertainty) in the classification result. Belief function theory can well model such uncertain and imprecise information, and a new belief-based method for credal classification of incomplete data (CCI) is proposed using the K nearest neighbors (KNNs) strategy. In CCI, the KNNs of object (incomplete data) are respectively used to estimate the missing values, and one can obtain K versions of edited pattern with estimated values from the KNNs. The K edited patterns are classified by any classical method to get K pieces of classification results with different discounting (weighting) factors depending on the distances between the object and its KNNs, and global fusion of the K classification results represented by the basic belief assignments (bba's) is used for credal classification of the object. The conflicting beliefs produced in the fusion process can well capture the imprecision degree of classification, and it will be transferred to the selected meta-class defined by the disjunction of several classes (i.e. the set of several classes) according to the current context. Thus, the incomplete data that is hard to correctly classify because of the missing values will be reasonably committed to proper meta-class, which is able to characterize the imprecision of classification and reduce the errors as well. Three experiments are given to illustrate the potential and interest of CCI approach.

AB - It can be quite difficult to correctly and precisely classify the incomplete data with missing values, since the missing information usually causes ambiguities (uncertainty) in the classification result. Belief function theory can well model such uncertain and imprecise information, and a new belief-based method for credal classification of incomplete data (CCI) is proposed using the K nearest neighbors (KNNs) strategy. In CCI, the KNNs of object (incomplete data) are respectively used to estimate the missing values, and one can obtain K versions of edited pattern with estimated values from the KNNs. The K edited patterns are classified by any classical method to get K pieces of classification results with different discounting (weighting) factors depending on the distances between the object and its KNNs, and global fusion of the K classification results represented by the basic belief assignments (bba's) is used for credal classification of the object. The conflicting beliefs produced in the fusion process can well capture the imprecision degree of classification, and it will be transferred to the selected meta-class defined by the disjunction of several classes (i.e. the set of several classes) according to the current context. Thus, the incomplete data that is hard to correctly classify because of the missing values will be reasonably committed to proper meta-class, which is able to characterize the imprecision of classification and reduce the errors as well. Three experiments are given to illustrate the potential and interest of CCI approach.

KW - Belief functions

KW - Credal classification

KW - Evidence fusion

KW - Incomplete data

KW - K-nearest neighbor

UR - http://www.scopus.com/inward/record.url?scp=84944354545&partnerID=8YFLogxK

U2 - 10.1016/j.knosys.2015.06.022

DO - 10.1016/j.knosys.2015.06.022

M3 - 文章

AN - SCOPUS:84944354545

SN - 0950-7051

VL - 89

SP - 113

EP - 125

JO - Knowledge-Based Systems

JF - Knowledge-Based Systems

ER -

Classification of incomplete data based on belief functions and K-nearest neighbors

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this