Back

OphthaBERT: Automated Glaucoma Diagnosis from Clinical Notes

Shah, R.; Moradi, M.; Eslami, S.; Fujita, A.; Aziz, K.; Bineshfar, N.; Elze, T.; Eslami, M.; Kazeminasab, S.; Liebman, D.; Rasouli, S.; Vu, D.; Wang, M.; Yohannan, J.; Zebardast, N.

2025-06-09 ophthalmology
10.1101/2025.06.08.25329151 medRxiv
Show abstract

Glaucoma is a leading cause of irreversible blindness worldwide, with early intervention often being crucial. Research into the underpinnings of glaucoma often relies on electronic health records (EHRs) to identify patients with glaucoma and their subtypes. However, current methods for identifying glaucoma patients from EHRs are often inaccurate or infeasible at scale, relying on International Classification of Diseases (ICD) codes or manual chart reviews. To address this limitation, we introduce (1) OphthaBERT, a powerful general clinical ophthalmology language model trained on over 2 million diverse clinical notes, and (2) a fine-tuned variant of OphthaBERT that automatically extracts binary and subtype glaucoma diagnoses from clinical notes. The base OphthaBERT model is a robust encoder, outperforming state-of-the-art clinical encoders in masked token prediction on out-of-distribution ophthalmology clinical notes and binary glaucoma classification with limited data. We report significant binary classification performance improvements in low-data regimes (p < 0.001, Bonferroni corrected). OphthaBERT is also able to achieve superior classification performance for both binary and subtype diagnosis, outperforming even fine-tuned large decoder-only language models at a fraction of the computational cost. We demonstrate a 0.23-point increase in macro-F1 for subtype diagnosis over ICD codes and strong binary classification performance when externally validated at Wilmer Eye Institute. OphthaBERT provides an interpretable, equitable framework for general ophthalmology language modeling and automated glaucoma diagnosis.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Nature Machine Intelligence
61 papers in training set
Top 0.1%
23.6%
2
Ophthalmology Science
20 papers in training set
Top 0.1%
13.3%
3
npj Digital Medicine
97 papers in training set
Top 0.4%
10.6%
4
Scientific Reports
3102 papers in training set
Top 15%
6.7%
50% of probability mass above
5
PLOS ONE
4510 papers in training set
Top 30%
5.1%
6
Nature Communications
4913 papers in training set
Top 31%
5.1%
7
British Journal of Ophthalmology
14 papers in training set
Top 0.1%
3.9%
8
Communications Biology
886 papers in training set
Top 3%
2.9%
9
PLOS Digital Health
91 papers in training set
Top 0.9%
2.9%
10
Medical Image Analysis
33 papers in training set
Top 0.5%
2.2%
11
Nature Medicine
117 papers in training set
Top 2%
1.7%
12
Advanced Science
249 papers in training set
Top 16%
0.9%
13
British Journal of Cancer
42 papers in training set
Top 1%
0.9%
14
Brain
154 papers in training set
Top 4%
0.9%
15
Eye
11 papers in training set
Top 0.3%
0.8%
16
Science Translational Medicine
111 papers in training set
Top 5%
0.8%
17
Clinical and Translational Medicine
30 papers in training set
Top 0.8%
0.8%
18
eLife
5422 papers in training set
Top 54%
0.8%
19
Cell Systems
167 papers in training set
Top 11%
0.8%
20
Journal of Neural Engineering
197 papers in training set
Top 2%
0.8%
21
Journal of Medical Internet Research
85 papers in training set
Top 4%
0.8%
22
npj Genomic Medicine
33 papers in training set
Top 0.9%
0.8%
23
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 46%
0.7%
24
Bioinformatics
1061 papers in training set
Top 10%
0.5%
25
Nature Human Behaviour
85 papers in training set
Top 5%
0.5%
26
Genome Medicine
154 papers in training set
Top 10%
0.5%