跳到主要导航 跳到搜索 跳到主要内容

Missing data imputation for machine learning

  • Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节会议稿件同行评审

9 引用 (Scopus)

摘要

. The imputation of missing values in datasets always plays an important role in the data preprocessing. In the process of data collection, because of the various reasons, the datasets often contain some missing values, and the excellent missing data imputation algorithms can increase the reliability of the dataset and reduce the impact of missing values on the whole dataset. In this paper, based on the Artificial Neural Network (ANN), we propose a missing data imputation method for the classification-type datasets. For each record which contains missing values, we make a list of the values that can be used to replace the missing data from the complete dataset. Our ANN model uses the complete records as the train dataset, and selects the most appropriate value in the list as the final result based on the label categories of the missing data. In our experiments, we compare our algorithm with the traditional single value imputation method and mean value imputation method with the Pima dataset. The result shows that our proposed algorithm can achieve better classification results when there are more missing values in the dataset.

源语言英语
主期刊名IoT as a Service- 4th EAI International Conference, IoTaaS 2018, Proceedings
编辑Bo Li, Mao Yang, Zhongjiang Yan, Hui Yuan
出版商Springer Verlag
67-72
页数6
ISBN(印刷版)9783030146566
DOI
出版状态已出版 - 2019
活动4th International Conference on IoT as a Service, IoTaaS 2018 - Xi’an, 中国
期限: 17 11月 201818 11月 2018

出版系列

姓名Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST
271
ISSN(印刷版)1867-8211

会议

会议4th International Conference on IoT as a Service, IoTaaS 2018
国家/地区中国
Xi’an
时期17/11/1818/11/18

指纹

探究 'Missing data imputation for machine learning' 的科研主题。它们共同构成独一无二的指纹。

引用此