Back

Machine learning-based analysis of genomic and transcriptomic data unveils sarcoma clusters with superlative prognostic and predictive value

Esperanca-Martins, M.; Vasques, H.; Ravasqueira, M. S.; Lemos, M. M.; Fonseca, F.; Coutinho, D.; Lopez, J. A.; Huang, R. S. P.; Dias, S.; Gallego-Paez, L.; Costa, L.; Abecasis, N.; Goncalves, E.; Fernandes, I.

2025-02-02 oncology
10.1101/2025.01.31.25321492 medRxiv
Show abstract

Soft tissue sarcomas (STS) histopathological classification system has several conceptual caveats, impacting prognostication and treatment. The clinical and molecular-based tools currently employed to estimate prognosis also have limitations. Clinically driven molecular profiling studies may cover these gaps. We performed DNA sequencing (DNAseq) and RNA sequencing (RNAseq), portraying the molecular profile of 102 samples of 3 of the most common STS subtypes. The RNAseq data was analyzed using unsupervised machine learning models, unravelling previously unknown molecular patterns and identifying 4 well-defined transcriptomic clusters. These transcriptomic clusters have a clear prognostic value, a finding that was externally validated. This transcriptomic cluster-based classifications prognostic value is superior to the prognostic accuracy of currently used clinical-based (SARCULATOR nomograms) and molecular-based (CINSARC) prognostication tools. The analysis of DNAseq data from the same cohort of samples revealed a plethora of unique and, in some cases, never documented molecular targets for precision treatment across different transcriptomic clusters.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Cancers
200 papers in training set
Top 0.1%
37.4%
2
Frontiers in Oncology
95 papers in training set
Top 0.1%
12.2%
3
Scientific Reports
3102 papers in training set
Top 7%
10.0%
50% of probability mass above
4
eLife
5422 papers in training set
Top 25%
3.6%
5
JCO Precision Oncology
14 papers in training set
Top 0.1%
3.0%
6
PLOS ONE
4510 papers in training set
Top 43%
2.8%
7
PLOS Computational Biology
1633 papers in training set
Top 13%
2.3%
8
European Journal of Cancer
10 papers in training set
Top 0.1%
2.1%
9
Nature Communications
4913 papers in training set
Top 49%
1.9%
10
PeerJ
261 papers in training set
Top 8%
1.5%
11
Frontiers in Genetics
197 papers in training set
Top 7%
1.2%
12
npj Precision Oncology
48 papers in training set
Top 0.9%
1.2%
13
Clinical Cancer Research
58 papers in training set
Top 1%
1.2%
14
Oncotarget
15 papers in training set
Top 0.2%
1.2%
15
Cancer Research
116 papers in training set
Top 3%
0.8%
16
Journal of Experimental & Clinical Cancer Research
25 papers in training set
Top 0.2%
0.8%
17
Frontiers in Immunology
586 papers in training set
Top 8%
0.7%
18
Frontiers in Bioinformatics
45 papers in training set
Top 1.0%
0.7%
19
eBioMedicine
130 papers in training set
Top 5%
0.7%
20
Cancer Immunology, Immunotherapy
11 papers in training set
Top 0.4%
0.7%
21
Bone
22 papers in training set
Top 0.4%
0.6%
22
JCO Clinical Cancer Informatics
18 papers in training set
Top 1.0%
0.6%
23
BMC Cancer
52 papers in training set
Top 3%
0.6%