Back

Characterizing Documented Psychosocial Stressors in Pediatric Psychiatric Emergencies with an Open-Weight Large Language Model

Hartlage, C. S.; Manning, E. R.; Bernard, J.; Vaish, S.; Gray, J.; Young, M.; Pestian, T.; Folger, A. T.; Tachinardi, P.; Mendonca, E. A.; Brokamp, C.

2026-06-09 health informatics
10.64898/2026.06.08.26354931 medRxiv
Show abstract

Objective: To evaluate whether a locally hosted open-weight large language model (LLM) can extract documented psychosocial factors from pediatric psychiatric intake notes and apply validated extraction to a large emergency psychiatry cohort. Materials and Methods: We identified emergency department presentations at Cincinnati Children's Hospital Medical Center from January 1, 2016, through December 31, 2024, among patients younger than 18 years with psychiatric billing diagnoses. Using full-text intake notes, gpt-oss:120b classified peer conflict, sleep disruption, and school-related academic, attendance, and disciplinary issues as detected, negated, or indeterminate. Four human raters independently reviewed 50 notes. We compared Fleiss' kappa among humans alone versus humans plus the LLM, assessed repeated-query stability across 50 independent calls per note, and applied the workflow to all eligible notes. Results: Among 37,315 eligible admissions, 22,284 had eligible intake notes; 22,270 produced parseable JSON. In detected-versus-not-detected coding, human-plus-LLM reliability did not differ significantly from human-only reliability across measures (human {kappa} 0.71-0.94; human-plus-LLM {kappa} 0.70-0.93). Stability was associated with human agreement: mean LLM-human agreement increased from 42.6% for classifications with less than 80% stability to 82.7% for classifications with 100% stability (Pearson r = 0.36). Full-cohort extraction showed frequent and overlapping documented factors: sleep disruption was most frequently detected (57.7%), followed by peer conflict (47.2%), academic issues (43.4%), disciplinary issues (43.3%), and attendance issues (16.9%). Discussion: Agreement varied by construct and was strongest when repeated model outputs were stable. Conclusion: Locally hosted open-weight LLMs can support scalable structured extraction of documented psychosocial factors from pediatric psychiatric intake notes after local validation.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.1%
40.2%
2
Journal of Medical Internet Research
85 papers in training set
Top 0.9%
4.9%
3
International Journal of Medical Informatics
25 papers in training set
Top 0.2%
4.9%
50% of probability mass above
4
JAMA Pediatrics
10 papers in training set
Top 0.1%
4.9%
5
Journal of Biomedical Informatics
45 papers in training set
Top 0.3%
4.7%
6
Scientific Reports
3102 papers in training set
Top 40%
3.1%
7
npj Digital Medicine
97 papers in training set
Top 2%
2.8%
8
Frontiers in Digital Health
20 papers in training set
Top 0.4%
2.7%
9
JAMIA Open
37 papers in training set
Top 0.6%
2.1%
10
BMC Medical Informatics and Decision Making
39 papers in training set
Top 1%
1.9%
11
PLOS ONE
4510 papers in training set
Top 52%
1.7%
12
BMC Medical Research Methodology
43 papers in training set
Top 0.6%
1.7%
13
JMIR Medical Informatics
17 papers in training set
Top 1%
1.2%
14
Frontiers in Psychiatry
83 papers in training set
Top 2%
1.2%
15
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 1%
1.1%
16
BMC Bioinformatics
383 papers in training set
Top 6%
1.0%
17
iScience
1063 papers in training set
Top 25%
0.9%
18
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.7%
0.9%
19
The Lancet Digital Health
25 papers in training set
Top 0.8%
0.9%
20
BJPsych Open
25 papers in training set
Top 0.6%
0.8%
21
Psychiatry Research
35 papers in training set
Top 1%
0.8%
22
Acta Neuropsychiatrica
12 papers in training set
Top 0.9%
0.8%
23
JAMA Network Open
127 papers in training set
Top 4%
0.8%
24
Heliyon
146 papers in training set
Top 7%
0.7%
25
JAMA Psychiatry
13 papers in training set
Top 0.6%
0.7%
26
BMC Medicine
163 papers in training set
Top 8%
0.7%
27
Journal of General Internal Medicine
20 papers in training set
Top 1%
0.7%
28
BMJ Health & Care Informatics
13 papers in training set
Top 1%
0.5%