Back

Deep learning-based non-invasive profiling of tumor transcriptomes from cell-free DNA for precision oncology

Patton, R. D.; Netzley, A.; Persse, T. W.; Nair, A.; Galipeau, P. C.; Coleman, I. M.; Itagi, P.; Chandra, P.; Adil, M.; Vashisth, M.; Sayar, E.; Hiatt, J. B.; Dumpit, R.; Kollath, L.; Demirci, R. A.; Ghodsi, A.; Lam, H.-M.; Morrissey, C.; Iravani, A.; Chen, D. L.; Hsieh, A. C.; MacPherson, D.; Haffner, M. C.; Nelson, P. S.; Ha, G.

2026-02-12 bioinformatics
10.64898/2026.02.10.705188 bioRxiv
Show abstract

Circulating tumor DNA (ctDNA) profiling from liquid biopsies is increasingly adopted as a minimally invasive solution for clinical cancer diagnostic applications. Current methods for inferring gene expression from ctDNA require specialized assays or ultra-deep, targeted sequencing, which preclude transcriptome-wide profiling at single-gene resolution. Herein we jointly introduce Triton, a tool for comprehensive fragmentomic and nucleosome profiling of cell-free DNA (cfDNA), and Proteus, a multi-modal deep learning framework for predicting single gene expression, using standard depth ([~]30-120x) whole genome sequencing of cfDNA. By synthesizing fragmentation and inferred nucleosome positioning patterns in the promoter and gene body from Triton, Proteus reproduced expression profiles using pure ctDNA from patient-derived xenografts (PDX) with an accuracy similar to RNA-Seq technical replicates. Applying Proteus to cfDNA from four patient cohorts with matched tumor RNA-Seq, we show that the model accurately predicted the expression of specific prognostic and phenotype markers and therapeutic targets. As an analog to RNA-Seq, we further confirmed the immediate applicability of Proteus to existing tools through accurate prediction of gene pathway enrichment scores. Our results demonstrate the potential clinical utility of Triton and Proteus as non-invasive tools for precision oncology applications such as cancer monitoring and therapeutic guidance. SubjectsCirculating tumor DNA, liquid biopsies, patient-derived xenografts, whole genome sequencing, deep learning, convolutional neural network, gene expression

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 3%
8.6%
2
Nature Communications
4913 papers in training set
Top 25%
7.0%
3
Scientific Reports
3102 papers in training set
Top 16%
6.5%
4
npj Precision Oncology
48 papers in training set
Top 0.1%
6.5%
5
Cancer Research Communications
46 papers in training set
Top 0.1%
5.0%
6
Advanced Science
249 papers in training set
Top 3%
5.0%
7
Genome Medicine
154 papers in training set
Top 1%
5.0%
8
Small Methods
26 papers in training set
Top 0.2%
2.8%
9
Nature Machine Intelligence
61 papers in training set
Top 1%
2.8%
10
Cancer Research
116 papers in training set
Top 1%
2.7%
50% of probability mass above
11
Briefings in Bioinformatics
326 papers in training set
Top 3%
1.9%
12
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.4%
1.8%
13
BMC Bioinformatics
383 papers in training set
Top 4%
1.7%
14
PLOS ONE
4510 papers in training set
Top 53%
1.7%
15
Clinical Infectious Diseases
231 papers in training set
Top 3%
1.7%
16
Cell Reports Medicine
140 papers in training set
Top 4%
1.7%
17
Communications Biology
886 papers in training set
Top 10%
1.5%
18
Clinical Cancer Research
58 papers in training set
Top 1%
1.4%
19
Modern Pathology
21 papers in training set
Top 0.3%
1.3%
20
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
1.0%
21
Science Advances
1098 papers in training set
Top 25%
1.0%
22
European Journal of Nuclear Medicine and Molecular Imaging
19 papers in training set
Top 0.2%
0.9%
23
iScience
1063 papers in training set
Top 26%
0.9%
24
PLOS Computational Biology
1633 papers in training set
Top 22%
0.9%
25
npj Digital Medicine
97 papers in training set
Top 3%
0.8%
26
Frontiers in Genetics
197 papers in training set
Top 9%
0.8%
27
Journal of Translational Medicine
46 papers in training set
Top 3%
0.8%
28
Clinical Chemistry
22 papers in training set
Top 0.8%
0.8%
29
Computational and Structural Biotechnology Journal
216 papers in training set
Top 9%
0.8%
30
Journal of Pathology Informatics
13 papers in training set
Top 0.4%
0.8%