Back

Transcriptomic subtypes in high-grade serous ovarian cancer are driven by tumor cellular composition

Tanis, S.; Lixandrao, M.; Ivich, A.; Grieshober, L.; Lawson-Michod, K. A.; Collin, L. J.; Peres, L. C.; Salas, L. A.; Marks, J. R.; Bitler, B. G.; Greene, C. S.; Schildkraut, J. M.; Doherty, J. A.; Davidson, N. R.

2026-04-21 cancer biology
10.64898/2026.04.16.719000 bioRxiv
Show abstract

High-grade serous ovarian carcinoma (HGSC) is an aggressive malignancy for which bulk transcriptomic subtypes are used to stratify tumors, interpret biology, and guide biomarker development. The four TCGA-derived subtypes, mesenchymal (C1.MES), immunoreactive (C2.IMM), proliferative (C5.PRO), and differentiated (C4.DIF), are consistently observed across cohorts. However, despite their prominence, these subtypes have not translated into therapeutic utility, and their biological basis remains unresolved. Here, we show that HGSC transcriptomic subtypes are largely determined by tumor cellular composition rather than intrinsic malignant transcriptional programs. By integrating controlled single-cell-derived pseudobulk simulations with deconvolution-based analysis of 1,834 primary HGSC tumors across RNA-seq and microarray cohorts, we demonstrate that subtype probabilities align along a composition-driven axis of stromal and immune variation. Cellular composition alone predicted subtype labels with high accuracy (ROC-AUC = 0.81-0.95) and explained a substantial fraction of subtype-associated transcriptomic variation, with the mesenchymal (C1.MES) subtype representing the most robust and reproducible example of composition-driven signal. Although a secondary, composition-independent expression signal is detectable, it does not define the dominant structure of subtype classification. These findings redefine HGSC transcriptomic subtypes as features of the tumor ecosystem rather than discrete malignant states. This reinterpretation has immediate implications for studies that use subtype labels to infer tumor-intrinsic biology and provides a generalizable framework for separating composition-driven and intrinsic signals in bulk tumor data. Significance StatementHGSC transcriptomic subtypes lack consistent clinical utility and remain biologically ambiguous. We show subtype assignments are largely driven by tumor cellular composition, and less so by distinct intrinsic tumor states.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Cancer Research
116 papers in training set
Top 0.1%
10.1%
2
Cell Reports Medicine
140 papers in training set
Top 0.1%
10.1%
3
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 9%
7.1%
4
Cancer Discovery
61 papers in training set
Top 0.2%
6.3%
5
Clinical Cancer Research
58 papers in training set
Top 0.3%
4.8%
6
Nature Communications
4913 papers in training set
Top 35%
4.3%
7
Cell Reports
1338 papers in training set
Top 13%
3.9%
8
PLOS Computational Biology
1633 papers in training set
Top 10%
3.6%
50% of probability mass above
9
eLife
5422 papers in training set
Top 26%
3.6%
10
npj Precision Oncology
48 papers in training set
Top 0.2%
3.1%
11
JCI Insight
241 papers in training set
Top 2%
2.6%
12
Cancer Cell
38 papers in training set
Top 0.7%
2.4%
13
Cell Systems
167 papers in training set
Top 6%
2.1%
14
Cancer Research Communications
46 papers in training set
Top 0.3%
2.1%
15
Genome Medicine
154 papers in training set
Top 3%
2.1%
16
Modern Pathology
21 papers in training set
Top 0.2%
1.7%
17
Cell Genomics
162 papers in training set
Top 3%
1.7%
18
Scientific Reports
3102 papers in training set
Top 62%
1.5%
19
Cell
370 papers in training set
Top 13%
1.3%
20
Cancers
200 papers in training set
Top 4%
1.2%
21
Science Advances
1098 papers in training set
Top 23%
1.2%
22
Communications Biology
886 papers in training set
Top 14%
1.2%
23
Genome Biology
555 papers in training set
Top 6%
0.9%
24
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.9%
25
Science Signaling
55 papers in training set
Top 0.3%
0.9%
26
Journal of Clinical Investigation
164 papers in training set
Top 5%
0.9%
27
Nature Cell Biology
99 papers in training set
Top 4%
0.9%
28
PLOS ONE
4510 papers in training set
Top 66%
0.8%
29
Developmental Cell
168 papers in training set
Top 12%
0.7%
30
PNAS Nexus
147 papers in training set
Top 2%
0.7%