Effectiveness, Explainability and Reliability of Machine Meta-Learning Methods for Predicting Mortality in Patients with COVID-19: Results of the Brazilian COVID-19 Registry
Miranda de Paiva, B. B.; Pereira, P. D.; de Andrade, C. M. V.; Gomes, V. M. R.; Lima, M. C. P. B.; Silva, M. V. R. S.; Carneiro, M.; Martins, K. P. M. P.; Sales, T. L. S.; Carvalho, R. L. R. d.; Pires, M. C.; Ramos, L. E. F.; Silva, R. T.; Bezerra, A. F. B.; Schwarzbold, A. V.; Nunes, A. G. S.; Maurilio, A. d. O.; Scotton, A. L. B. A.; Costa, A. S. d. M.; Castro, A. A.; Farace, B. L.; Cimini, C. C. R.; De Carvalho, C. A.; Silveira, D. V.; Ponce, D.; Pereira, E. C.; Manenti, E. R. F.; Cenci, E. P. d. A.; Lucas, F. B.; Rodrigues, F. D.; Anschau, F.; Botoni, F. A.; Aranha, F. G.; Bartolazzi, F.;
Show abstract
ObjectiveTo provide a thorough comparative study among state-of-the-art machine learning methods and statistical methods for determining in-hospital mortality in COVID-19 patients using data upon hospital admission; to study the reliability of the predictions of the most effective methods by correlating the probability of the outcome and the accuracy of the methods; to investigate how explainable are the predictions produced by the most effective methods. Materials and MethodsDe-identified data were obtained from COVID-19 positive patients in 36 participating hospitals, from March 1 to September 30, 2020. Demographic, comorbidity, clinical presentation and laboratory data were used as training data to develop COVID-19 mortality prediction models. Multiple machine learning and traditional statistics models were trained on this prediction task using a folded cross-validation procedure, from which we assessed performance and interpretability metrics. ResultsThe Stacking of machine learning models improved over the previous state-of-the-art results by more than 26% in predicting the class of interest (death), achieving 87.1% of AUROC and macro F1 of 73.9%. We also show that some machine learning models can be very interpretable and reliable, yielding more accurate predictions while providing a good explanation for the why. ConclusionThe best results were obtained using the meta-learning ensemble model - Stacking. State-of the art explainability techniques such as SHAP-values can be used to draw useful insights into the patterns learned by machine-learning algorithms. Machine-learning models can be more explainable than traditional statistics models while also yielding highly reliable predictions.
Matching journals
The top 6 journals account for 50% of the predicted probability mass.