Skip to main navigation Skip to search Skip to main content

Missing data imputation for machine learning

  • Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Scopus citations

Abstract

. The imputation of missing values in datasets always plays an important role in the data preprocessing. In the process of data collection, because of the various reasons, the datasets often contain some missing values, and the excellent missing data imputation algorithms can increase the reliability of the dataset and reduce the impact of missing values on the whole dataset. In this paper, based on the Artificial Neural Network (ANN), we propose a missing data imputation method for the classification-type datasets. For each record which contains missing values, we make a list of the values that can be used to replace the missing data from the complete dataset. Our ANN model uses the complete records as the train dataset, and selects the most appropriate value in the list as the final result based on the label categories of the missing data. In our experiments, we compare our algorithm with the traditional single value imputation method and mean value imputation method with the Pima dataset. The result shows that our proposed algorithm can achieve better classification results when there are more missing values in the dataset.

Original languageEnglish
Title of host publicationIoT as a Service- 4th EAI International Conference, IoTaaS 2018, Proceedings
EditorsBo Li, Mao Yang, Zhongjiang Yan, Hui Yuan
PublisherSpringer Verlag
Pages67-72
Number of pages6
ISBN (Print)9783030146566
DOIs
StatePublished - 2019
Event4th International Conference on IoT as a Service, IoTaaS 2018 - Xi’an, China
Duration: 17 Nov 201818 Nov 2018

Publication series

NameLecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST
Volume271
ISSN (Print)1867-8211

Conference

Conference4th International Conference on IoT as a Service, IoTaaS 2018
Country/TerritoryChina
CityXi’an
Period17/11/1818/11/18

Keywords

  • Artificial Neural Network
  • Data imputation
  • Machine learning

Fingerprint

Dive into the research topics of 'Missing data imputation for machine learning'. Together they form a unique fingerprint.

Cite this