Back

Histopathology-inferred spatial transcriptomics characterizes the tumor microenvironment in 1,500 head and neck tumors and predicts clinical outcomes

Biswas, S.; Patiyal, S.; Chen, T.-H.; Stemmer, A.; Dhruba, S. R.; Mukherjee, S.; Cantore, T.; Shulman, E. D.; Campagnolo, E.; Jenkins, B. H.; Tai, S.-K.; Chu, P.-Y.; Kuo, Y.-J.; Yeh, Y.-C.; Day, C.-P.; Hanley, C. J.; Thomas, G. J.; Yang, M.-H.; Hoang, D.-T.; Ruppin, E.

2026-05-19 cancer biology
10.64898/2026.05.16.725687 bioRxiv
Show abstract

Head and neck squamous cell carcinoma (HNSC) is a prevalent malignancy associated with poor prognosis despite recent therapeutic advances. We hypothesized that a comprehensive understanding of the spatial heterogeneity and organization of the tumor microenvironment (TME) can substantially improve risk stratification and prediction of treatment response in HNSC. As spatial transcriptomics (ST) remains labor-intensive and costly, we developed HEiST (H&E-Inferred Spatial Transcriptomics), a deep learning framework that predicts spatially resolved gene expression profiles directly from routine hematoxylin and eosin (H&E)-stained histology slides. After rigorous validation across two independent external ST cohorts, we applied HEiST to infer spatial transcriptomes across 1,500 HNSC patient tumors spanning two publicly available datasets and two newly generated cohorts, one treated with concurrent chemoradiotherapy (CCRT) and one with immunotherapy. This large-scale analysis uncovered reproducible spatial clusters characterizing the HNSC TME, defining two distinct prognostic Spatiotypes, Immune-Exhausted and Immune-Activated, with significantly distinct survival outcomes. Critically, spatial cluster composition accurately predicts HPV status and yields treatment response predictors for both CCRT/radiotherapy and immunotherapy that outperform costly gene-expression and direct image-based approaches. Notably, the ST cluster-based predictor of immunotherapy response markedly surpasses the performance of commonly used FDA-approved biomarkers, including CPS, TPS, and their combination. To the best of our knowledge, this represents the first virtual spatial profiling effort and the most comprehensive large-scale spatial TME analysis in HNSC to date. HEiST thus introduces a scalable, low-cost, and spatially grounded biomarker discovery for precision oncology in HNSC.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 16%
10.5%
2
Cell Reports Medicine
140 papers in training set
Top 0.2%
8.5%
3
Cancer Research
116 papers in training set
Top 0.2%
6.9%
4
Nature Cancer
35 papers in training set
Top 0.1%
6.4%
5
Clinical Cancer Research
58 papers in training set
Top 0.2%
6.4%
6
Genome Medicine
154 papers in training set
Top 1%
4.9%
7
Cancer Cell
38 papers in training set
Top 0.2%
4.9%
8
Advanced Science
249 papers in training set
Top 4%
4.2%
50% of probability mass above
9
Science Advances
1098 papers in training set
Top 8%
3.1%
10
Cell Reports
1338 papers in training set
Top 19%
2.6%
11
Cancer Discovery
61 papers in training set
Top 0.8%
2.6%
12
Nature Medicine
117 papers in training set
Top 2%
2.1%
13
Molecular Cancer
14 papers in training set
Top 0.2%
1.9%
14
Cell Systems
167 papers in training set
Top 7%
1.7%
15
Cancer Research Communications
46 papers in training set
Top 0.5%
1.5%
16
npj Precision Oncology
48 papers in training set
Top 0.7%
1.5%
17
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 36%
1.3%
18
Science Translational Medicine
111 papers in training set
Top 4%
1.3%
19
Nature Biomedical Engineering
42 papers in training set
Top 1%
1.3%
20
npj Digital Medicine
97 papers in training set
Top 3%
1.1%
21
Communications Biology
886 papers in training set
Top 17%
1.0%
22
The Innovation
12 papers in training set
Top 0.6%
1.0%
23
ACS Nano
99 papers in training set
Top 4%
0.8%
24
Cell Stem Cell
57 papers in training set
Top 2%
0.8%
25
PLOS ONE
4510 papers in training set
Top 67%
0.8%
26
Scientific Reports
3102 papers in training set
Top 74%
0.8%
27
Nature Cell Biology
99 papers in training set
Top 5%
0.8%
28
Cell Genomics
162 papers in training set
Top 6%
0.8%
29
Nature Genetics
240 papers in training set
Top 7%
0.8%
30
PLOS Computational Biology
1633 papers in training set
Top 26%
0.7%