Back

Leveraging a Large Language Model to Assess Quality-of-Care: Monitoring ADHD Medication Side Effects

Bannett, Y.; Gunturkun, F.; Pillai, M.; Herrmann, J. E.; Luo, I.; Huffman, L. C.; Feldman, H. M.

2024-04-24 health informatics
10.1101/2024.04.23.24306225 medRxiv
Show abstract

ObjectiveTo assess the accuracy of a large language model (LLM) in measuring clinician adherence to practice guidelines for monitoring side effects after prescribing medications for children with attention-deficit/hyperactivity disorder (ADHD). MethodsRetrospective population-based cohort study of electronic health records. Cohort included children aged 6-11 years with ADHD diagnosis and >2 ADHD medication encounters (stimulants or non-stimulants prescribed) between 2015-2022 in a community-based primary healthcare network (n=1247). To identify documentation of side effects inquiry, we trained, tested, and deployed an open-source LLM (LLaMA) on all clinical notes from ADHD-related encounters (ADHD diagnosis or ADHD medication prescription), including in-clinic/telehealth and telephone encounters (n=15,593 notes). Model performance was assessed using holdout and deployment test sets, compared to manual chart review. ResultsThe LLaMA model achieved excellent performance in classifying notes that contain side effects inquiry (sensitivity= 87.2%, specificity=86.3/90.3%, area under curve (AUC)=0.93/0.92 on holdout/deployment test sets). Analyses revealed no model bias in relation to patient age, sex, or insurance. Mean age (SD) at first prescription was 8.8 (1.6) years; patient characteristics were similar across patients with and without documented side effects inquiry. Rates of documented side effects inquiry were lower in telephone encounters than in-clinic/telehealth encounters (51.9% vs. 73.0%, p<0.01). Side effects inquiry was documented in 61% of encounters following stimulant prescriptions and 48% of encounters following non-stimulant prescriptions (p<0.01). ConclusionsDeploying an LLM on a variable set of clinical notes, including telephone notes, offered scalable measurement of quality-of-care and uncovered opportunities to improve psychopharmacological medication management in primary care.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.1%
49.8%
2
npj Digital Medicine
97 papers in training set
Top 1%
4.3%
50% of probability mass above
3
Journal of Medical Internet Research
85 papers in training set
Top 1%
3.6%
4
JAMA Pediatrics
10 papers in training set
Top 0.1%
3.6%
5
JAMIA Open
37 papers in training set
Top 0.7%
1.9%
6
JAMA Network Open
127 papers in training set
Top 2%
1.9%
7
International Journal of Medical Informatics
25 papers in training set
Top 0.9%
1.7%
8
PLOS ONE
4510 papers in training set
Top 54%
1.7%
9
The Lancet Digital Health
25 papers in training set
Top 0.5%
1.5%
10
Pharmacoepidemiology and Drug Safety
13 papers in training set
Top 0.2%
1.5%
11
Journal of Biomedical Informatics
45 papers in training set
Top 1.0%
1.3%
12
BMJ Open
554 papers in training set
Top 10%
1.3%
13
JMIR Public Health and Surveillance
45 papers in training set
Top 3%
1.2%
14
British Journal of General Practice
22 papers in training set
Top 0.4%
1.1%
15
JMIR Formative Research
32 papers in training set
Top 1%
0.9%
16
Journal of Personalized Medicine
28 papers in training set
Top 0.9%
0.9%
17
Frontiers in Digital Health
20 papers in training set
Top 1%
0.9%
18
JMIR Medical Informatics
17 papers in training set
Top 1%
0.8%
19
BMC Medical Research Methodology
43 papers in training set
Top 1%
0.8%
20
BJPsych Open
25 papers in training set
Top 0.7%
0.8%
21
BMJ Mental Health
15 papers in training set
Top 0.4%
0.7%
22
Journal of General Internal Medicine
20 papers in training set
Top 1%
0.7%
23
Autism Research
32 papers in training set
Top 0.4%
0.7%
24
Psychiatry Research
35 papers in training set
Top 1%
0.7%
25
Journal of the American Geriatrics Society
12 papers in training set
Top 0.2%
0.7%
26
Clinical Infectious Diseases
231 papers in training set
Top 5%
0.7%
27
BioData Mining
15 papers in training set
Top 0.9%
0.7%
28
BMC Medicine
163 papers in training set
Top 8%
0.6%
29
PLOS Digital Health
91 papers in training set
Top 3%
0.6%
30
BMC Medical Informatics and Decision Making
39 papers in training set
Top 3%
0.6%