Back

Artificial intelligence for detecting bipolar disorder in electronic health records of patients with affective diagnoses: a diagnostic accuracy study

Ferro, E.; Gomez-Puentes, A. M.; Castano-Villegas, N.; Monsalve Barrientos, K.; Torres-Delgado, C.; Ortiz, L.; Esteban Cardenas, M. F.; Zea, J.

2026-05-10 psychiatry and clinical psychology
10.64898/2026.05.07.26352679 medRxiv
Show abstract

BackgroundBipolar disorder (BD) is frequently underdiagnosed, particularly in patients presenting with depressive disorders, leading to delays in appropriate treatment. Artificial intelligence (AI) applied to electronic health records (EHRs) may improve early detection by identifying clinically relevant symptom patterns. ObjectiveTo evaluate the diagnostic performance of a natural language processing (NLP)-based AI model for detecting BD-related features in EHRs of patients with affective diagnoses. MethodsA retrospective diagnostic accuracy study was conducted using 500 EHRs from a psychiatric referral hospital in Bogota, Colombia (2020-2024). The model extracted 18 predefined clinical domains from unstructured text and classified patients into four risk categories. Diagnostic performance was assessed in a validation subset of 100 records using independent psychiatric evaluation as the reference standard. Sensitivity, specificity, positive and negative predictive values, F1-score, and area under the receiver operating characteristic curve (AUC-ROC) were calculated. ResultsThe model achieved high agreement in symptom extraction (mean 91.1%). Sensitivity was 96.4% (95% CI: 87.7%-99.0%) and specificity was 84.4% (95% CI: 71.2%-92.3%), with an F1-score of 0.92 and an AUC-ROC of 0.932 (95% CI: 0.881-0.975). A substantial proportion of patients with depressive diagnoses were identified as having confirmed BD or clinically relevant risk. The model analyzed complete EHRs 120 times faster than human reviewers. ConclusionsNLP-based analysis of EHRs can achieve clinically meaningful performance in identifying BD-related patterns while substantially reducing review time. The model may be useful as a clinical decision support tool for earlier identification of bipolar disorder.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Frontiers in Psychiatry
83 papers in training set
Top 0.1%
14.5%
2
Acta Psychiatrica Scandinavica
10 papers in training set
Top 0.1%
12.2%
3
Journal of Affective Disorders
81 papers in training set
Top 0.2%
10.0%
4
Acta Neuropsychiatrica
12 papers in training set
Top 0.1%
9.0%
5
Psychiatry Research
35 papers in training set
Top 0.2%
7.1%
50% of probability mass above
6
PLOS ONE
4510 papers in training set
Top 28%
6.3%
7
Journal of Medical Internet Research
85 papers in training set
Top 1%
4.3%
8
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.6%
4.3%
9
JMIR Formative Research
32 papers in training set
Top 0.6%
2.1%
10
European Psychiatry
10 papers in training set
Top 0.3%
1.9%
11
Scientific Reports
3102 papers in training set
Top 60%
1.6%
12
npj Digital Medicine
97 papers in training set
Top 2%
1.5%
13
Frontiers in Digital Health
20 papers in training set
Top 0.8%
1.5%
14
BJPsych Open
25 papers in training set
Top 0.5%
1.3%
15
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.4%
1.3%
16
JMIRx Med
31 papers in training set
Top 1.0%
1.3%
17
Translational Psychiatry
219 papers in training set
Top 3%
1.3%
18
BioData Mining
15 papers in training set
Top 0.8%
0.8%
19
JMIR Public Health and Surveillance
45 papers in training set
Top 3%
0.8%
20
Contemporary Clinical Trials Communications
11 papers in training set
Top 0.6%
0.7%
21
NeuroImage: Clinical
132 papers in training set
Top 4%
0.7%
22
Frontiers in Public Health
140 papers in training set
Top 8%
0.7%
23
Journal of Affective Disorders Reports
10 papers in training set
Top 0.4%
0.7%
24
Journal of Clinical Microbiology
120 papers in training set
Top 2%
0.6%
25
Brain and Behavior
37 papers in training set
Top 2%
0.6%