Back

Evaluating OCT Device-Reported Image Quality Score: Towards a Task-Specific Quality Gate for Deep Learning-based Outer-Retina and Choroid Boundary Segmentation

Gadari, A.; Vichare, A. A.; Corona, F.; Vupparaboina, S. C.; Lall, S. R.; Gregori, G.; Hasan, N.; Sahel, J.-A.; Chhablani, J.; Bollepalli, S. C.; Vupparaboina, K. K.

2026-05-20 ophthalmology
10.64898/2026.05.17.26353399 medRxiv
Show abstract

Manufacturer-defined signal-strength indices are frequently employed as quality benchmarks for automated optical coherence tomography analysis, yet their empirical relationship with deep learning segmentation accuracy remains unclear. Because these metrics were originally developed for conventional image-processing pipelines, their ability to predict modern model-based segmentation accuracy has not been empirically validated. To address this gap, we evaluated the Heidelberg Spectralis Q-score against U-Net segmentation performance across 5,047 B-scans from 103 eyes for three anatomical boundaries of the posterior segment of the eye: the Ellipsoid Zone (EZ), Bruch's Membrane (BM), and Choroid Outer Boundary (COB). Alongside standard boundary agreement metrics (MAE, MSE, Dice Similarity Coefficient), we adapted the Earth Mover's Distance (EMD) from optimal transport theory as a boundary evaluation metric. Unlike column-wise averages, EMD quantifies boundary agreement as a 2-D geometric displacement, directly measuring residual spatial displacement between the model segmented boundary and the ground-truth boundary. Our results demonstrate that the Q-score - originally designed to gate image-processing-based automated analysis - is a poor predictor of deep learning boundary segmentation accuracy, with explained variance (R2) failing to exceed 1.4% across all three boundaries. We further observed a monotonically increasing error hierarchy with anatomical depth (EZ < BM < COB), consistent across metrics, which is unexplained by the signal strength. At the COB, correlations were paradoxically positive, explained by a B-scan-level mediation chain: higher Q-scores correspond to greater choroidal thickness (r=0.113, {rho}=0.158), which in turn predicts higher COB segmentation error (r=0.165, {rho}=0.191) - a localization difficulty that global signal strength cannot capture. Collectively, these findings challenge the implicit assumption that signal-strength-based quality thresholds are a reliable proxy for deep learning model performance, and motivate a shift toward task-specific acquisition quality criteria calibrated to model performance rather than signal interpretability.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Ophthalmology Science
20 papers in training set
Top 0.1%
22.5%
2
Scientific Reports
3102 papers in training set
Top 4%
12.3%
3
Translational Vision Science & Technology
35 papers in training set
Top 0.2%
6.8%
4
Medical Image Analysis
33 papers in training set
Top 0.2%
6.4%
5
Biomedical Optics Express
84 papers in training set
Top 0.3%
6.4%
50% of probability mass above
6
PLOS ONE
4510 papers in training set
Top 31%
4.8%
7
Communications Biology
886 papers in training set
Top 2%
3.6%
8
British Journal of Ophthalmology
14 papers in training set
Top 0.1%
3.6%
9
npj Digital Medicine
97 papers in training set
Top 1%
3.6%
10
eLife
5422 papers in training set
Top 35%
2.1%
11
NeuroImage
813 papers in training set
Top 3%
2.1%
12
PLOS Digital Health
91 papers in training set
Top 1%
2.1%
13
Investigative Opthalmology & Visual Science
37 papers in training set
Top 0.3%
1.9%
14
Journal of Vision
92 papers in training set
Top 0.3%
1.8%
15
Eye
11 papers in training set
Top 0.3%
1.7%
16
Frontiers in Neuroscience
223 papers in training set
Top 4%
1.7%
17
Investigative Ophthalmology & Visual Science
22 papers in training set
Top 0.2%
1.7%
18
Nature Machine Intelligence
61 papers in training set
Top 2%
1.5%
19
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.2%
20
Nature Communications
4913 papers in training set
Top 58%
1.1%
21
Journal of Clinical Medicine
91 papers in training set
Top 6%
0.8%
22
Neurophotonics
37 papers in training set
Top 0.6%
0.7%
23
Journal of Neural Engineering
197 papers in training set
Top 2%
0.7%
24
Human Brain Mapping
295 papers in training set
Top 5%
0.7%
25
Experimental Eye Research
30 papers in training set
Top 0.6%
0.7%
26
Journal of Biomedical Optics
25 papers in training set
Top 0.8%
0.6%