Improving Health Status Prediction by Applying Appropriate Missing Value Imputation Technique

Shaira Tabassum, Nuren Abedin, Rafiqul Islam Maruf, Mostafa Taufiq Ahmed, Ashir Ahmed

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

The presence of missing information in health data is a common occurrence, especially in remote healthcare systems. Lack of data in the medical domains reduces the representativeness of the samples, creates biased estimations, and leads to improper conclusions. These missing values need to be handled efficiently by selecting an appropriate imputation technique. This paper aims to find a suitable imputation technique for remote healthcare data. We use our Portable Health Clinic (PHC) dataset which was collected over 12 long years from different locations in Bangladesh and it was found that 20% of data items were missing. We carried out a comparative analysis among eight missing value handling methods by applying these methods to five state-of-the-art machine learning models with PHC Healthcare Dataset. The imputation performance of each case is evaluated based on accuracy and f1-score. The Multiple Imputation by Chained Equations (MICE) imputation has achieved the highest accuracy and f1-score in all of the cases. Thus, this study demonstrates MICE as the best performing missing value imputation technique with any composition of machine learning process and algorithms.

Original languageEnglish
Title of host publicationLifeTech 2022 - 2022 IEEE 4th Global Conference on Life Sciences and Technologies
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages345-348
Number of pages4
ISBN (Electronic)9781665419048
DOIs
Publication statusPublished - 2022
Event4th IEEE Global Conference on Life Sciences and Technologies, LifeTech 2022 - Osaka, Japan
Duration: Mar 7 2022Mar 9 2022

Publication series

NameLifeTech 2022 - 2022 IEEE 4th Global Conference on Life Sciences and Technologies

Conference

Conference4th IEEE Global Conference on Life Sciences and Technologies, LifeTech 2022
Country/TerritoryJapan
CityOsaka
Period3/7/223/9/22

All Science Journal Classification (ASJC) codes

  • Agricultural and Biological Sciences (miscellaneous)
  • Artificial Intelligence
  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Biomedical Engineering
  • Instrumentation
  • Education

Fingerprint

Dive into the research topics of 'Improving Health Status Prediction by Applying Appropriate Missing Value Imputation Technique'. Together they form a unique fingerprint.

Cite this