TY - GEN
T1 - Evidential Association Classification for High-Dimensional Data
AU - Geng, Xiaojiao
AU - Liang, Yan
AU - Jiao, Lianmeng
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/4/24
Y1 - 2021/4/24
N2 - Association classification (AC) has p-roven to be a promising approach in data mining by integrating association mining and classification. The evidential association rule-based classification (EARC), as an extension of AC in belief function framework, is an effective classification model for addressing multiple uncertainties in real-world applications, but it encounters difficulties in dealing with high-dimensional data. To adapt the EARC to high-dimensional data, an improved evidential association classification method, called EARC-HD, is developed based on four stages: entropy-based fuzzy partition, evidential class association rule mining, rule prescreening and genetic rule selection. Comparing with EARC, an entropy-based fuzzy partition strategy is designed for deriving a series of fuzzy regions, with which some irrelevant attributes can be deleted. Moreover, the number of antecedent attributes is limited for effectively reducing the computational complexity in rule generation. To improve the efficiency of the classifier, the techniques of redundancy reduction and subgroup discovery are used for prescreening the mined rule set before a genetic rule selection process. Experimental results based on real-world datasets demonstrate the effectiveness of the proposed method in dealing with high-dimensional problems.
AB - Association classification (AC) has p-roven to be a promising approach in data mining by integrating association mining and classification. The evidential association rule-based classification (EARC), as an extension of AC in belief function framework, is an effective classification model for addressing multiple uncertainties in real-world applications, but it encounters difficulties in dealing with high-dimensional data. To adapt the EARC to high-dimensional data, an improved evidential association classification method, called EARC-HD, is developed based on four stages: entropy-based fuzzy partition, evidential class association rule mining, rule prescreening and genetic rule selection. Comparing with EARC, an entropy-based fuzzy partition strategy is designed for deriving a series of fuzzy regions, with which some irrelevant attributes can be deleted. Moreover, the number of antecedent attributes is limited for effectively reducing the computational complexity in rule generation. To improve the efficiency of the classifier, the techniques of redundancy reduction and subgroup discovery are used for prescreening the mined rule set before a genetic rule selection process. Experimental results based on real-world datasets demonstrate the effectiveness of the proposed method in dealing with high-dimensional problems.
KW - association classification
KW - belief function
KW - entropy-based fuzzy partition
KW - genetic rule selection
KW - high dimensional data
UR - http://www.scopus.com/inward/record.url?scp=85107669522&partnerID=8YFLogxK
U2 - 10.1109/ICCCBDA51879.2021.9442509
DO - 10.1109/ICCCBDA51879.2021.9442509
M3 - 会议稿件
AN - SCOPUS:85107669522
T3 - 2021 IEEE 6th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA 2021
SP - 100
EP - 105
BT - 2021 IEEE 6th International Conference on Cloud Computing and Big Data Analytics, ICCCBDA 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 6th IEEE International Conference on Cloud Computing and Big Data Analytics, ICCCBDA 2021
Y2 - 24 April 2021 through 26 April 2021
ER -