Back

Contextual Embeddings from Clinical Notes Improves Prediction of Sepsis

Amrollahi, F.; Shashikumar, S.; Razmi, F.; Nemati, S.

2021-03-03 intensive care and critical care medicine
10.1101/2021.03.02.21252779 medRxiv
Show abstract

Sepsis, a life-threatening organ dysfunction, is a clinical syndrome triggered by acute infection and affects over 1 million Americans every year. Untreated sepsis can progress to septic shock and organ failure, making sepsis one of the leading causes of morbidity and mortality in hospitals. Early detection of sepsis and timely antibiotics administration is known to save lives. In this work, we design a sepsis prediction algorithm based on data from electronic health records (EHR) using a deep learning approach. While most existing EHR-based sepsis prediction models utilize structured data including vitals, labs, and clinical information, we show that incorporation of features based on clinical texts, using a pre-trained neural language representation model, allows for incorporation of unstructured data without an explicit need for ontology-based named-entity recognition and classification. The proposed model is trained on a large critical care database of over 40,000 patients, including 2805 septic patients, and is compared against competing baseline models. In comparison to a baseline model based on structured data alone, incorporation of clinical texts improved AUC from 0.81 to 0.84. Our findings indicate that incorporation of clinical text features via a pre-trained language representation model can improve early prediction of sepsis and reduce false alarms.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Journal of Biomedical Informatics
45 papers in training set
Top 0.1%
33.5%
2
Scientific Reports
3102 papers in training set
Top 2%
14.9%
3
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.2%
10.2%
50% of probability mass above
4
PLOS ONE
4510 papers in training set
Top 33%
4.4%
5
npj Digital Medicine
97 papers in training set
Top 1%
4.0%
6
Bioinformatics
1061 papers in training set
Top 5%
3.6%
7
JAMIA Open
37 papers in training set
Top 0.5%
2.8%
8
iScience
1063 papers in training set
Top 7%
2.8%
9
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.9%
10
Journal of Medical Internet Research
85 papers in training set
Top 3%
1.4%
11
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 1%
1.2%
12
International Journal of Medical Informatics
25 papers in training set
Top 1%
1.0%
13
Biology Methods and Protocols
53 papers in training set
Top 2%
0.9%
14
Heliyon
146 papers in training set
Top 5%
0.9%
15
Frontiers in Molecular Biosciences
100 papers in training set
Top 4%
0.9%
16
JMIR Medical Informatics
17 papers in training set
Top 1%
0.8%
17
Informatics in Medicine Unlocked
21 papers in training set
Top 1%
0.8%
18
Patterns
70 papers in training set
Top 2%
0.8%
19
Frontiers in Medicine
113 papers in training set
Top 7%
0.8%
20
Frontiers in Physiology
93 papers in training set
Top 6%
0.7%
21
Clinical Chemistry
22 papers in training set
Top 0.9%
0.7%
22
Artificial Intelligence in Medicine
15 papers in training set
Top 0.8%
0.7%
23
Journal of Personalized Medicine
28 papers in training set
Top 2%
0.7%
24
PLOS Digital Health
91 papers in training set
Top 3%
0.7%
25
eBioMedicine
130 papers in training set
Top 6%
0.5%