Back

Predicting clozapine initiation among patients with schizophrenia via machine learning trained on electronic health record data

Perfalk, E.; Damgaard, J. G.; Danielsen, A. A.; Ostergaard, S. D.

2026-04-20 psychiatry and clinical psychology
10.64898/2026.04.17.26351083 medRxiv
Show abstract

Background and HypothesisClozapine is the only medication with proven efficacy for treatment-resistant schizophrenia, yet many patients experience delays of several years before initiation. Our aim was to develop and validate a dynamic prediction model for clozapine initiation among patients with schizophrenia trained solely on electronic health record (EHR) data from routine clinical practice. Study DesignEHR data from all adults ([≥] 18 years) with a schizophrenia (ICD10: F20) or schizoaffective disorder (ICD10: F25) diagnosis who had been in contact with the Psychiatric Services of the Central Denmark Region between 1 January 2013 and 1 June 2024 were retrieved. 179 structured predictors were engineered (covering, e.g.,diagnoses, medications, coercive measures) and 750 predictors derived from clinical notes. At every psychiatric hospital visit, we predicted if an incident clozapine prescription occured within the next 365 days. XGBoost and logistic regression models were trained on 85% of the data with 5-fold stratified cross-validation. Performance was evaluated on the remaining 15% of the data (held out) using the area under the receiver operating characteristic curve (AUROC). Study ResultsThe training/test set comprised of 194,234/35,527 hospital visits, distributed on 4928/878 unique patients. In the test set, the best XGBoost model achieved an AUROC of 0.81, sensitivity of 32%, positive predictive value of 23% at a 7.5% predicted positive rate. ConclusionsA dynamic prediction model based solely on EHR data predicts clozapine initiation with high discrimination. If implemented as a clinical decision support tool, this model may guide clinicians towards more timely initiation of clozapine treatment.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Schizophrenia Bulletin
29 papers in training set
Top 0.1%
22.6%
2
The British Journal of Psychiatry
21 papers in training set
Top 0.1%
12.6%
3
Schizophrenia
19 papers in training set
Top 0.1%
6.9%
4
Acta Psychiatrica Scandinavica
10 papers in training set
Top 0.1%
6.4%
5
European Psychiatry
10 papers in training set
Top 0.1%
3.6%
50% of probability mass above
6
Schizophrenia Research
29 papers in training set
Top 0.2%
3.6%
7
Frontiers in Psychiatry
83 papers in training set
Top 1%
3.6%
8
Translational Psychiatry
219 papers in training set
Top 2%
3.1%
9
JAMA Psychiatry
13 papers in training set
Top 0.1%
3.1%
10
PLOS ONE
4510 papers in training set
Top 50%
1.9%
11
Psychiatry Research
35 papers in training set
Top 0.8%
1.9%
12
Acta Neuropsychiatrica
12 papers in training set
Top 0.4%
1.7%
13
Biological Psychiatry
119 papers in training set
Top 2%
1.7%
14
Epidemiology and Psychiatric Sciences
10 papers in training set
Top 0.1%
1.7%
15
Psychological Medicine
74 papers in training set
Top 1%
1.5%
16
BMC Medicine
163 papers in training set
Top 4%
1.3%
17
BMJ Mental Health
15 papers in training set
Top 0.3%
1.2%
18
Molecular Psychiatry
242 papers in training set
Top 2%
1.2%
19
npj Digital Medicine
97 papers in training set
Top 3%
1.2%
20
Neuropsychopharmacology
134 papers in training set
Top 2%
1.2%
21
BMC Psychiatry
22 papers in training set
Top 0.5%
1.0%
22
JMIR Public Health and Surveillance
45 papers in training set
Top 3%
0.9%
23
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
0.8%
24
Journal of Psychopharmacology
14 papers in training set
Top 0.6%
0.8%
25
BMJ Open
554 papers in training set
Top 13%
0.8%
26
BJPsych Open
25 papers in training set
Top 0.7%
0.8%
27
Progress in Neuro-Psychopharmacology and Biological Psychiatry
36 papers in training set
Top 1.0%
0.8%
28
Scientific Reports
3102 papers in training set
Top 74%
0.8%
29
Frontiers in Pharmacology
100 papers in training set
Top 5%
0.8%
30
Journal of Medical Internet Research
85 papers in training set
Top 5%
0.7%