Back

Path2Omics: Enhanced transcriptomic and methylation prediction accuracy from tumor histopathology

Hoang, D.-T.; Shulman, E. D.; Dhruba, S. R.; Nair, N. U.; Barman, R. K.; Lalchungnunga, H.; Singh, O.; Nasrallah, M. P.; Stone, E. A.; Aldape, K.; Ruppin, E.

2025-03-16 bioinformatics
10.1101/2025.02.26.640189 bioRxiv
Show abstract

Precision oncology is becoming increasingly integral to clinical practice, demonstrating notable improvements in treatment outcomes. While molecular data provide comprehensive insights, obtaining such data remains costly and time-consuming. To address this challenge, we developed Path2Omics, a deep learning model that predicts gene expression and methylation from histopathology for 23 cancer types. Path2Omics was trained on 20,497 slides (9,456 formalin-fixed and paraffin-embedded (FFPE) and 11,041 fresh frozen (FF)) from 8,007 patients across 23 The Cancer Genome Atlas cohorts. When tested on FFPE slides, the most readily available format in clinical pathology practice, the integrated model outperformed its individual FF and FFPE components, robustly predicting nearly 5,000 genes on average, approximately five times more than our recently published DeepPT model. Externally evaluated on seven independent cohorts, Path2Omics robustly predicted the expression of approximately 4,400 genes, yielding a 30% increase over the FFPE model alone. Finally, we demonstrate that the inferred gene expression is nearly as effective as the actual values in predicting patient survival and treatment response. These results lay the basis for using Path2Omics to advance precision oncology from histopathology slides in a speedy and cost-effective manner. Statement of significancePath2Omics is a deep learning model that accurately predicts gene expression and methylation from histopathology slides across 23 cancer types. Unlike existing approaches that rely solely on FFPE slides for training, Path2Omics leverages both FFPE and FF slides by constructing two separate models and integrating them. Downstream analyses show that the inferred values from Path2Omics are nearly as effective as actual values in predicting patient survival and treatment response.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
npj Precision Oncology
48 papers in training set
Top 0.1%
12.2%
2
Bioinformatics
1061 papers in training set
Top 3%
9.9%
3
Nature Communications
4913 papers in training set
Top 19%
9.9%
4
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.1%
8.0%
5
Scientific Reports
3102 papers in training set
Top 15%
6.7%
6
Cancer Research Communications
46 papers in training set
Top 0.1%
3.5%
50% of probability mass above
7
PLOS ONE
4510 papers in training set
Top 41%
3.5%
8
Cancer Research
116 papers in training set
Top 1%
2.7%
9
Genome Medicine
154 papers in training set
Top 3%
2.5%
10
Clinical Infectious Diseases
231 papers in training set
Top 2%
2.4%
11
BMC Bioinformatics
383 papers in training set
Top 4%
2.0%
12
PLOS Computational Biology
1633 papers in training set
Top 14%
2.0%
13
Nature Medicine
117 papers in training set
Top 2%
1.8%
14
Clinical Cancer Research
58 papers in training set
Top 1%
1.7%
15
Nature Machine Intelligence
61 papers in training set
Top 2%
1.7%
16
Nature Cancer
35 papers in training set
Top 0.8%
1.6%
17
Cell Reports Medicine
140 papers in training set
Top 4%
1.5%
18
Cell Systems
167 papers in training set
Top 9%
1.2%
19
JNCI Cancer Spectrum
10 papers in training set
Top 0.4%
1.1%
20
Science Advances
1098 papers in training set
Top 27%
0.9%
21
npj Digital Medicine
97 papers in training set
Top 3%
0.9%
22
Cancer Cell
38 papers in training set
Top 2%
0.8%
23
The Lancet Digital Health
25 papers in training set
Top 1%
0.8%
24
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 45%
0.7%
25
Annals of Oncology
13 papers in training set
Top 1%
0.7%
26
JAMIA Open
37 papers in training set
Top 2%
0.6%
27
Cancer Discovery
61 papers in training set
Top 2%
0.6%
28
Biomedicines
66 papers in training set
Top 4%
0.6%
29
Breast Cancer Research
32 papers in training set
Top 0.6%
0.6%
30
Cancer Medicine
24 papers in training set
Top 2%
0.6%