Back

Machine learning models predict long COVID outcomes based on baseline clinical and immunologic factors

Jayavelu, N. D.; Samaha, H.; Wimalasena, S. T.; Hoch, A.; Gygi, J. P.; Gabernet, G.; Ozonoff, A.; Liu, S.; Milliren, C. E.; Levy, O.; Baden, L. R.; Melamed, E.; Ehrlich, L. I. R.; McComsey, G. A.; Sekaly, R. P.; Cairns, C. B.; Haddad, E. K.; Schaenman, J.; Shaw, A. C.; Hafler, D. A.; Montgomery, R. R.; Corry, D. B.; Kheradmand, F.; Atkinson, M. A.; Brakenridge, S. C.; Higuita, N. I. A.; Metcalf, J. P.; Hough, C. L.; Messer, W. B.; Pulendran, B.; Nadeau, K. C.; Davis, M. M.; Geng, L. N.; Sesma, A. F.; Simon, V.; Krammer, F.; Kraft, M.; Bime, C.; Calfee, C. S.; Erle, D. J.; Langelier, C. R.; IMP

2025-02-13 health informatics

10.1101/2025.02.12.25322164 medRxiv

Show abstract

The post-acute sequelae of SARS-CoV-2 (PASC), also known as long COVID, remain a significant health issue that is incompletely understood. Predicting which acutely infected individuals will go on to develop long COVID is challenging due to the lack of established biomarkers, clear disease mechanisms, or well-defined sub-phenotypes. Machine learning (ML) models offer the potential to address this by leveraging clinical data to enhance diagnostic precision. We utilized clinical data, including antibody titers and viral load measurements collected at the time of hospital admission, to predict the likelihood of acute COVID-19 progressing to long COVID. Our machine learning models achieved median AUROC values ranging from 0.64 to 0.66 and AUPRC values between 0.51 and 0.54, demonstrating their predictive capabilities. Feature importance analysis revealed that low antibody titers and high viral loads at hospital admission were the strongest predictors of long COVID outcomes. Comorbidities, including chronic respiratory, cardiac, and neurologic diseases, as well as female sex, were also identified as significant risk factors for long COVID. Our findings suggest that ML models have the potential to identify patients at risk for developing long COVID based on baseline clinical characteristics. These models can help guide early interventions, improving patient outcomes and mitigating the long-term public health impacts of SARS-CoV-2.

Machine learning models predict long COVID outcomes based on baseline clinical and immunologic factors

Matching journals