Back

Structured large language model extraction of clinical factors from electronic health record text supports scalable psychiatric severity prediction

Stephenson, C.; Camassa, A.; Wagner, M.; Shirazi, A. H.; Alavi, N.; Omrani, M.

2026-05-13 psychiatry and clinical psychology
10.64898/2026.05.11.26352839 medRxiv
Show abstract

BackgroundMental health systems face escalating demand that exceeds clinician capacity, making accurate severity-based triage a critical bottleneck. Severity assessment guides treatment intensity, resource allocation, and risk management, yet most clinically relevant information remains embedded in unstructured electronic health record (EHR) narratives, limiting its utility for scalable decision support. ObjectivesThis study evaluates whether a single large language model (LLM) can autonomously extract clinical factors from psychiatric EHR narratives, derive predictive weights from those factors, and use the resulting structured representation to predict clinician-implied severity at scale. MethodsFrom a Mayo Clinic repository of more than 2.7 million encounters, 15,000 de-identified psychiatric notes were sampled into a 5,000-patient discovery cohort and a 10,000-patient replication cohort. The same LLM (Llama 3 8B Instruct) extracted 17 background clinical factors and 3 treatment-action factors from each note. Severity reference labels were derived from the treatment-action factors using pre-specified clinical criteria. The LLM independently derived two factor-weight dictionaries from the discovery cohort: one capturing risk-oriented predictors of severe presentations and one capturing protective predictors. Five weighting conditions were then evaluated against the severity labels: the two LLM-derived dictionaries, two controls (LLM-derived variables with randomized weights; clinically irrelevant variables with arbitrary weights), and an unweighted zero-shot baseline. Performance was assessed across 928 valid iterations in the replication cohort. ResultsLLM-derived structured conditions significantly outperformed all controls and the baseline, with statistically equivalent performance between the two structured conditions. Improvements in precision and recall were balanced, indicating gains in discriminative capacity rather than threshold shifts. The variables and weights the LLM derived as predictors of severe presentations aligned closely with established clinical determinants of psychiatric severity. ConclusionA single LLM can derive clinically meaningful factor weights from unstructured EHR narratives and use them to predict psychiatric severity at scale, supporting a viable path toward interpretable, scalable triage in resource-constrained mental health systems.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
npj Digital Medicine
97 papers in training set
Top 0.2%
19.4%
2
Acta Psychiatrica Scandinavica
10 papers in training set
Top 0.1%
7.2%
3
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.5%
6.4%
4
Journal of Medical Internet Research
85 papers in training set
Top 0.8%
6.3%
5
JAMA Network Open
127 papers in training set
Top 0.6%
4.8%
6
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.6%
4.3%
7
Frontiers in Psychiatry
83 papers in training set
Top 1%
3.6%
50% of probability mass above
8
Nature Medicine
117 papers in training set
Top 0.8%
3.6%
9
Scientific Reports
3102 papers in training set
Top 49%
2.1%
10
Journal of Biomedical Informatics
45 papers in training set
Top 0.6%
2.1%
11
European Psychiatry
10 papers in training set
Top 0.2%
2.1%
12
PLOS ONE
4510 papers in training set
Top 48%
2.1%
13
BioData Mining
15 papers in training set
Top 0.2%
2.1%
14
Journal of Affective Disorders
81 papers in training set
Top 0.9%
1.9%
15
JAMIA Open
37 papers in training set
Top 0.7%
1.9%
16
JAMA Psychiatry
13 papers in training set
Top 0.2%
1.8%
17
Psychiatry Research
35 papers in training set
Top 0.9%
1.7%
18
Frontiers in Digital Health
20 papers in training set
Top 0.6%
1.7%
19
Nature Communications
4913 papers in training set
Top 52%
1.7%
20
Schizophrenia Bulletin
29 papers in training set
Top 0.4%
1.7%
21
Acta Neuropsychiatrica
12 papers in training set
Top 0.6%
1.3%
22
Journal of General Internal Medicine
20 papers in training set
Top 0.9%
0.8%
23
American Journal of Psychiatry
20 papers in training set
Top 0.5%
0.7%
24
Translational Psychiatry
219 papers in training set
Top 4%
0.7%
25
BJPsych Open
25 papers in training set
Top 0.7%
0.7%
26
Biological Psychiatry
119 papers in training set
Top 2%
0.7%
27
JMIR Formative Research
32 papers in training set
Top 2%
0.7%
28
BMC Medicine
163 papers in training set
Top 7%
0.7%
29
JAMA Pediatrics
10 papers in training set
Top 0.2%
0.7%
30
The British Journal of Psychiatry
21 papers in training set
Top 1%
0.6%