Back

Predicting bladder cancer molecular subtypes linked to bacillus Calmette-Guerin response from histology images using deep learning

Khoraminia, F.; Olislagers, M.; de Jong, F. C.; Akram, F.; Nakauma Gonzalez, A.; Lichtenberg, D.; Stubbs, A.; Costello, J. C.; Rijstenberg, L.; van Leenders, G. J. L. H.; Vrieling, A.; Aben, K. K. H.; Kiemeney, L. A. L. M.; Hoedemaeker, R. F.; Bangma, C. H.; Vermeulen, S.; Litjens, G.; Khalili, N.; Zuiverloon, T. C. M.

2026-05-06 oncology
10.64898/2026.05.05.26352375 medRxiv
Show abstract

Background and objectiveHighrisk nonmuscleinvasive bladder cancer (HRNMIBC) is treated with transurethral resection and intravesical BCG instillations, yet {approx}50% recur and 20% progress to invasive disease. Although molecular subtyping, e.g., BCG-response-subtype (BRS), is associated with progression risk and may aid risk stratification, yet is costly and time-consuming. Intratumoral heterogeneity complicates accurate subtyping. To address these challenges, we developed a deep-learning model that predicts BRS from routine hematoxylin-eosin-stained images. We verified the models area-by-area predictions against tissue-level gene-expression maps. Methods and participantsHematoxylin-eosin-stained images from 231 HR-NMIBC patients with known BRS were used to develop a deep-learning model through cross-validation, then validated in 83 independent samples. The models spatial predictions were assessed using spatial transcriptomics to map gene expression to tissue locations in five HR-NMIBC tumors. Outcome measurements and statistical analysisDiscriminative ability for BRS3 vs. BRS1/2 was measured by AUC. Spatial alignment was assessed by calculating Pearson and Spearman correlation coefficients between model predictions and BRS fractions; significance was assessed through permutation analysis. Key findings and limitationsThe trained algorithm achieved AUC of 0.79 (development) and 0.71 (external) to detect BRS3 vs BRS1/2. Tile-level correlation between model output and molecular labels was significant (Pearson r = 0.33-0.44; p [≤] 0.002). Limitations include retrospective sampling and limited spatial transcriptomic cases. Conclusions and clinical implicationsOur trained algorithm showed potential to stratify HRNMIBC patients by clinically relevant BCGresponse subtypes using routine hematoxylin-eosin-stained images and showed predicted spatial heterogeneity comparable to molecular profiling. Prospective validation is required before any clinical implementation. Patient summaryStandard pathology images contain hidden details related to tumors molecular subtype. We trained an AI model to read these routine images and identify specific bladder cancer subtypes associated with poor response to BCG therapy. This approach may help reveal molecular subtype-associated information from routine pathology images, without additional laboratory procedures.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
British Journal of Cancer
42 papers in training set
Top 0.1%
10.2%
2
Scientific Reports
3102 papers in training set
Top 8%
9.3%
3
Cancer Epidemiology, Biomarkers & Prevention
17 papers in training set
Top 0.1%
6.9%
4
PLOS ONE
4510 papers in training set
Top 26%
6.5%
5
Clinical Cancer Research
58 papers in training set
Top 0.2%
6.4%
6
Frontiers in Oncology
95 papers in training set
Top 0.7%
4.9%
7
Cancers
200 papers in training set
Top 1%
4.2%
8
BMJ Open
554 papers in training set
Top 5%
3.6%
50% of probability mass above
9
BMC Medicine
163 papers in training set
Top 1%
3.6%
10
Biology Methods and Protocols
53 papers in training set
Top 0.3%
3.3%
11
Journal of Translational Medicine
46 papers in training set
Top 0.3%
2.8%
12
Diagnostics
48 papers in training set
Top 0.6%
2.5%
13
PeerJ
261 papers in training set
Top 7%
1.7%
14
BMC Bioinformatics
383 papers in training set
Top 5%
1.5%
15
JAMA Network Open
127 papers in training set
Top 2%
1.5%
16
BMC Cancer
52 papers in training set
Top 2%
1.4%
17
Frontiers in Immunology
586 papers in training set
Top 5%
1.4%
18
Frontiers in Bioinformatics
45 papers in training set
Top 0.3%
1.4%
19
Journal of Magnetic Resonance Imaging
14 papers in training set
Top 0.4%
1.2%
20
PLOS Computational Biology
1633 papers in training set
Top 21%
1.0%
21
EClinicalMedicine
21 papers in training set
Top 0.6%
1.0%
22
Modern Pathology
21 papers in training set
Top 0.3%
1.0%
23
Journal of Pathology Informatics
13 papers in training set
Top 0.3%
1.0%
24
Clinical Chemistry
22 papers in training set
Top 0.6%
0.9%
25
The Journal of Pharmacology and Experimental Therapeutics
15 papers in training set
Top 0.3%
0.9%
26
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.9%
0.7%
27
eBioMedicine
130 papers in training set
Top 5%
0.7%
28
JCO Precision Oncology
14 papers in training set
Top 0.4%
0.7%
29
European Journal of Cancer
10 papers in training set
Top 0.6%
0.7%
30
Nature Communications
4913 papers in training set
Top 66%
0.7%