Back

Development and Evaluation of Machine Learning Models for the Detection of Emergency Department Patients with Opioid Misuse from Clinical Notes

Shahid, U.; Parde, N.; Smith, D. L.; Dickinson, G.; Bianco, J.; Thorpe, D.; Hota, M.; Afshar, M.; Karnik, N. S.; chhabra, n.

2024-12-12 emergency medicine
10.1101/2024.12.11.24318875 medRxiv
Show abstract

ObjectivesThe accurate identification of Emergency Department (ED) encounters involving opioid misuse is critical for health services, research, and surveillance. We sought to develop natural language processing (NLP)-based models for the detection of ED encounters involving opioid misuse. MethodsA sample of ED encounters enriched for opioid misuse was manually annotated and clinical notes extracted. We evaluated classic machine learning (ML) methods, fine-tuning of publicly available pretrained language models, and a previously developed convolutional neural network opioid classifier for use on hospitalized patients (SMART-AI). Performance was compared to ICD-10-CM codes. Both raw text and text transformed to the United Medical Language System were evaluated. Face validity was evaluated by term feature importance. ResultsThere were 1123 encounters used for training, validation, and testing. Of the classic ML methods, XGBoost had the highest AU_PRC (0.936), accuracy (0.887), and F1 score (0.863) which outperformed ICD-10-CM codes [accuracy 0.870; F1 0.830]. Logistic regression, support vector machine, and XGBoost models had higher AU_PRC using transformed text, while decision trees performed better using raw text. Excluding XGBoost, fine-tuned pre-trained language models outperformed classic ML methods. The best performing model was the fine-tuned SMART-AI based model with domain adaptation [AU_PRC 0.948; accuracy 0.882; F1 0.851]. Explainability analyses showed the most predictive terms were heroin, opioids, alcoholic intoxication, chronic, cocaine, opiates, and suboxone. ConclusionsNLP-based models outperform entry of ICD-10-CM diagnosis codes for the detection of ED encounters with opioid misuse. Fine tuning with domain adaptation for pre-trained language models resulted in improved performance.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
International Journal of Medical Informatics
25 papers in training set
Top 0.1%
44.3%
2
PLOS ONE
4510 papers in training set
Top 26%
6.7%
50% of probability mass above
3
Journal of Biomedical Informatics
45 papers in training set
Top 0.3%
5.2%
4
Scientific Reports
3102 papers in training set
Top 29%
4.2%
5
Journal of Medical Internet Research
85 papers in training set
Top 1%
3.8%
6
Artificial Intelligence in Medicine
15 papers in training set
Top 0.1%
3.8%
7
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.9%
3.5%
8
Cureus
67 papers in training set
Top 1%
3.1%
9
Journal of the American Medical Informatics Association
61 papers in training set
Top 1.0%
2.2%
10
Frontiers in Public Health
140 papers in training set
Top 4%
2.0%
11
JMIR Medical Informatics
17 papers in training set
Top 0.8%
1.6%
12
Emergency Medicine Journal
20 papers in training set
Top 0.3%
1.4%
13
Heliyon
146 papers in training set
Top 3%
1.4%
14
JAMIA Open
37 papers in training set
Top 1%
1.3%
15
Journal of General Internal Medicine
20 papers in training set
Top 0.6%
1.3%
16
npj Digital Medicine
97 papers in training set
Top 3%
1.0%
17
Journal of the American Heart Association
119 papers in training set
Top 3%
0.9%
18
Psychiatry Research
35 papers in training set
Top 1%
0.8%
19
Clinical Infectious Diseases
231 papers in training set
Top 4%
0.8%
20
JAMA Network Open
127 papers in training set
Top 4%
0.8%
21
BMC Health Services Research
42 papers in training set
Top 2%
0.8%
22
Frontiers in Neurology
91 papers in training set
Top 6%
0.7%
23
PLOS Digital Health
91 papers in training set
Top 3%
0.5%
24
BioData Mining
15 papers in training set
Top 1%
0.5%