Iterative Robust Semi-Supervised Missing Data Imputation

Nikos Fazakis, Georgios Kostopoulos, Sotiris Kotsiantis, Iosif Mporas

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)
47 Downloads (Pure)

Abstract

In many real-world applications scientists are often confronted with the problem of incomplete datasets due to several reasons. The direct analysis of datasets with missing values in attributes inevitably results in inaccurate learning models and erroneous results. Facing effectively the challenge of missing values is an essential step of the data mining process. Imputation is often employed to overcome the shortcomings incurred by missing data during the pre-process stage of data analysis. Therefore, a plethora of statistical and machine learning methods have been proposed and employed with a view to imputing the missing values in incomplete data with their potential or actual values. In this context, the main objective of this paper is to put forward an iterative stepwise imputation method based on the semi-supervised learning approach, called IRSSI. Semi-supervised methods have proved to be particularly effective for exploiting incomplete or partially labeled data with regard to the values of the target attribute. The proposed algorithm was experimentally evaluated on real-world benchmark datasets and artificially generated datasets using different high ratios of missing data. The experimental results demonstrate the efficiency of IRSSI algorithm compared to typical imputation methods.
Original languageEnglish
Pages (from-to)90555 - 90569
Number of pages15
JournalIEEE Access
Volume8
DOIs
Publication statusPublished - 12 May 2020

Fingerprint

Dive into the research topics of 'Iterative Robust Semi-Supervised Missing Data Imputation'. Together they form a unique fingerprint.

Cite this