Back

A Deep Learning Approach for Culture-Free Bacterial Meningitis Diagnosis and ICU Outcome Prediction

Chen, R.; Cai, Y.; Zhang, S.; Huo, Z.; Song, M.; Li, W.; Yang, D.; Zhang, X.

2024-12-21 intensive care and critical care medicine
10.1101/2024.12.20.24318979 medRxiv
Show abstract

BackgroundCerebrospinal fluid (CSF) culture is the diagnostic gold standard for neuroinfectious diseases such as bacterial meningitis, but its sensitivity is limited and results are often delayed. Natural language processing (NLP) offers a powerful approach to extract meaningful clinical signals from unstructured data such as chief complaints and ICD notes. This study applies machine learning, including BioBERT-enhanced NLP models (not traditional TF-IDF approaches), to support early diagnosis and outcome prediction in ICU patients. MethodsTraining and validation datasets were derived from MIMIC-IV (internal) and MIMIC-III/eICU (external) databases. Fully connected neural network (FCNN) and other machine learning models were trained to predict CSF culture results using structured lab features. Labels were refined using clinical criteria to reduce false negatives. For ICU survival prediction, three multimodal deep learning architectures (mCNN, mFCNN, and mLSTM) were developed using two ICU survival cohorts with different inclusion criteria. The Strict ICU Survival Cohort included CSF culture results as an input feature, while the Lenient ICU Survival Cohort excluded this requirement, allowing for a broader patient population. In both cohorts, models integrated structured variables with unstructured text encoded by BioBERT, a deep contextual language model, rather than simpler methods like TF-IDF, effectively capturing clinical meaning from free-text ICD entries and chief complaints. ResultsFor CSF culture prediction (training n = 9261), the FCNN model achieved the highest performance (AUROC = 0.853) in independent validation. For ICU survival prediction in the Strict ICU Survival Cohort (training n = 5,795), the mCNN model achieved an AUROC of 0.889 in external validation. In the expanded Lenient ICU Survival Cohort (n = 58,615), the same model achieved an AUROC of 0.974 and an AUPRC of 0.868 during external validation. During model training and development, the predictive performance declined when text features were excluded (AUROC from 0983 to 0.946) or when ICD entries were converted from free-text (BioBERT-encoded) to coded format (AUROC to 0.947). ConclusionsMultimodal machine learning models, enhanced by advanced NLP through BioBERT embeddings of clinical free text, effectively predicted CSF culture results and ICU survival outcomes.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Clinical Chemistry
22 papers in training set
Top 0.1%
12.9%
2
Scientific Reports
3102 papers in training set
Top 6%
10.3%
3
Biology Methods and Protocols
53 papers in training set
Top 0.1%
6.4%
4
eBioMedicine
130 papers in training set
Top 0.1%
4.9%
5
International Journal of Medical Informatics
25 papers in training set
Top 0.3%
4.4%
6
Annals of Clinical and Translational Neurology
29 papers in training set
Top 0.2%
4.4%
7
PLOS ONE
4510 papers in training set
Top 33%
4.4%
8
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.6%
4.4%
50% of probability mass above
9
Journal of Biomedical Informatics
45 papers in training set
Top 0.4%
3.6%
10
Journal of Medical Internet Research
85 papers in training set
Top 2%
3.1%
11
Bioinformatics
1061 papers in training set
Top 6%
2.4%
12
EClinicalMedicine
21 papers in training set
Top 0.1%
2.1%
13
Critical Care Explorations
15 papers in training set
Top 0.2%
1.9%
14
BMC Medicine
163 papers in training set
Top 3%
1.7%
15
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.7%
16
Informatics in Medicine Unlocked
21 papers in training set
Top 0.4%
1.7%
17
JMIR Public Health and Surveillance
45 papers in training set
Top 2%
1.7%
18
iScience
1063 papers in training set
Top 17%
1.5%
19
Journal of Infection
71 papers in training set
Top 1%
1.5%
20
European Respiratory Journal
54 papers in training set
Top 1%
1.5%
21
npj Digital Medicine
97 papers in training set
Top 2%
1.5%
22
Frontiers in Medicine
113 papers in training set
Top 5%
1.1%
23
JMIR Medical Informatics
17 papers in training set
Top 1%
1.0%
24
Patterns
70 papers in training set
Top 2%
0.9%
25
PLOS Digital Health
91 papers in training set
Top 2%
0.8%
26
The Lancet Digital Health
25 papers in training set
Top 0.9%
0.8%
27
Journal of Neurology
26 papers in training set
Top 1%
0.8%
28
Clinical Infectious Diseases
231 papers in training set
Top 5%
0.8%
29
Acta Psychiatrica Scandinavica
10 papers in training set
Top 0.4%
0.8%
30
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.7%