Back

Predict community-acquired pneumonia outcome using time series data and machine learning

Lozano-Rojas, D.; Richardson, M.; Woltmann, G.; Free, R. C.

2025-03-12 respiratory medicine
10.1101/2025.03.11.25323764 medRxiv
Show abstract

BackgroundCommunity-acquired pneumonia (CAP) is an acute respiratory condition associated with high mortality in adult populations and is potentially more serious in older patients. Accurate and consistently applied prediction of outcome may contribute to reduce in-hospital mortality. Currently, CAP outcomes are assessed with clinical scores like CURB65, based on signs and symptoms that are non-specific to the disease. Recent literature has shown that machine learning (ML) has the potential to improve outcome prediction, but the sparse and incomplete nature of the data present a challenge for the development of models that can be implemented clinically. MethodsThis study aimed to developed ML models that can support outcome prediction in hospital admissions with CAP using routinely collected and time-dependent data from Leicester hospitals. Thus, by modelling mortality prediction, and predicting URB65 on the third day of admission with the forecast of vital signs, implementing a methodology that explores how different characteristics involved in the training process influence the results of the predictions. ResultsData comprised 9390 admissions in the training set, and 7892 in the validation set, for thirty-four clinical variables (fifteen time-dependent). Results of CAP mortality modelling reported AUC of 0.77 using a GRU model that was trained with the time series of vital signs and blood test. Results also showed improvement in models when balancing classes of the target variable in the training set, as well as improvement when using time dependent data. And importantly when predicting URB65 accuracy of 0.85 was obtained when modelled using GRU, when time series were processed using local scaling. ConclusionsThis approach might represent an opportunity to anticipate adverse outcomes. These results suggest that ML models utilising time series can have sizable impact in the prediction of CAP outcome, from many perspectives: Complementing currently applied scoring systems approaches like CURB65 in hospital settings, prediction of mortality or forecasting the severity of patients from vital signs that have shown correlation with CAP mortality. The models presented require further validation and development, although they present important indication for CAP mortality prediction.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 14%
12.7%
2
Scientific Reports
3102 papers in training set
Top 6%
10.1%
3
JMIR Medical Informatics
17 papers in training set
Top 0.1%
8.2%
4
International Journal of Medical Informatics
25 papers in training set
Top 0.1%
6.8%
5
BMJ Open Respiratory Research
32 papers in training set
Top 0.1%
6.8%
6
Life
27 papers in training set
Top 0.1%
4.0%
7
Journal of Medical Internet Research
85 papers in training set
Top 1%
4.0%
50% of probability mass above
8
ERJ Open Research
44 papers in training set
Top 0.3%
3.6%
9
International Journal of Environmental Research and Public Health
124 papers in training set
Top 2%
3.6%
10
JMIRx Med
31 papers in training set
Top 0.2%
3.1%
11
Frontiers in Physiology
93 papers in training set
Top 2%
2.1%
12
BMJ Open
554 papers in training set
Top 8%
2.1%
13
Archives of Clinical and Biomedical Research
28 papers in training set
Top 0.4%
2.1%
14
BJGP Open
12 papers in training set
Top 0.3%
1.9%
15
BMC Medical Informatics and Decision Making
39 papers in training set
Top 1%
1.7%
16
Cureus
67 papers in training set
Top 3%
1.7%
17
Royal Society Open Science
193 papers in training set
Top 2%
1.7%
18
PLOS Digital Health
91 papers in training set
Top 2%
1.5%
19
European Respiratory Journal
54 papers in training set
Top 1%
1.3%
20
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.2%
21
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.5%
1.1%
22
Frontiers in Medicine
113 papers in training set
Top 6%
0.8%
23
Informatics in Medicine Unlocked
21 papers in training set
Top 1%
0.8%
24
JAMIA Open
37 papers in training set
Top 1%
0.8%
25
BMJ Health & Care Informatics
13 papers in training set
Top 0.8%
0.8%
26
Clinical Chemistry
22 papers in training set
Top 0.7%
0.8%
27
IEEE Access
31 papers in training set
Top 1.0%
0.7%
28
Bioengineering
24 papers in training set
Top 1%
0.7%
29
Medical Research Archives
11 papers in training set
Top 1.0%
0.5%
30
Genomics
60 papers in training set
Top 4%
0.5%