Predicting clozapine initiation among patients with schizophrenia via machine learning trained on electronic health record data
Perfalk, E.; Damgaard, J. G.; Danielsen, A. A.; Ostergaard, S. D.
Show abstract
Background and HypothesisClozapine is the only medication with proven efficacy for treatment-resistant schizophrenia, yet many patients experience delays of several years before initiation. Our aim was to develop and validate a dynamic prediction model for clozapine initiation among patients with schizophrenia trained solely on electronic health record (EHR) data from routine clinical practice. Study DesignEHR data from all adults ([≥] 18 years) with a schizophrenia (ICD10: F20) or schizoaffective disorder (ICD10: F25) diagnosis who had been in contact with the Psychiatric Services of the Central Denmark Region between 1 January 2013 and 1 June 2024 were retrieved. 179 structured predictors were engineered (covering, e.g.,diagnoses, medications, coercive measures) and 750 predictors derived from clinical notes. At every psychiatric hospital visit, we predicted if an incident clozapine prescription occured within the next 365 days. XGBoost and logistic regression models were trained on 85% of the data with 5-fold stratified cross-validation. Performance was evaluated on the remaining 15% of the data (held out) using the area under the receiver operating characteristic curve (AUROC). Study ResultsThe training/test set comprised of 194,234/35,527 hospital visits, distributed on 4928/878 unique patients. In the test set, the best XGBoost model achieved an AUROC of 0.81, sensitivity of 32%, positive predictive value of 23% at a 7.5% predicted positive rate. ConclusionsA dynamic prediction model based solely on EHR data predicts clozapine initiation with high discrimination. If implemented as a clinical decision support tool, this model may guide clinicians towards more timely initiation of clozapine treatment.
Matching journals
The top 5 journals account for 50% of the predicted probability mass.