Predictive Modeling of COVID-19 Readmissions: Insights from Machine Learning and Deep Learning Approaches
Document Type
Article
Publication Date
7-1-2024
Abstract
This project employs artificial intelligence, including machine learning and deep learning, to assess COVID-19 readmission risk in Malaysia. It offers tools to mitigate healthcare resource strain and enhance patient outcomes. This study outlines a methodology for classifying COVID-19 readmissions. It starts with dataset description and pre-processing, while the data balancing was computed through Random Oversampling, Borderline SMOTE, and Adaptive Synthetic Sampling. Nine machine learning and ten deep learning techniques are applied, with five-fold cross-validation for evaluation. Optuna is used for hyperparameter selection, while the consistency in training hyperparameters is maintained. Evaluation metrics encompass accuracy, AUC, and training/inference times. Results were based on stratified five-fold cross-validation and different data-balancing methods. Notably, CatBoost consistently excelled in accuracy and AUC across all tables. Using ROS, CatBoost achieved the highest accuracy (0.9882 +/- 0.0020) with an AUC of 1.0000 +/- 0.0000. CatBoost maintained its superiority in BSMOTE and ADASYN as well. Deep learning approaches performed well, with SAINT leading in ROS and TabNet leading in BSMOTE and ADASYN. Decision Tree ensembles like Random Forest and XGBoost consistently showed strong performance.
Keywords
COVID-19, readmission, prediction, machine learning, deep learning
Divisions
biomedengine,medbio,medicinedept,rehab
Funders
Impact-Oriented Interdisciplinary Research Grant (IIRG), Universiti Malaya (IIRG001B-2021IISS),The 2020 APT EBC-C (Extra-Budgetary Contributions from China) Project on Promoting the Use of ICT for Achievement of Sustainable Development Goals (IF015-2021)
Publication Title
Diagnostics
Volume
14
Issue
14
Publisher
MDPI
Publisher Location
ST ALBAN-ANLAGE 66, CH-4052 BASEL, SWITZERLAND