Back

On the predictability of progression-free survival in ovarian cancer from NanoString gene expression data

Van Kleunen, L. B.; Bowman, G.; Stockman, S. E.; Townsend, H. A.; Barrios, L.; Jordan, K. R.; Wolsky, R. J.; Behbakht, K.; Sikora, M. J.; Richer, J. K.; Hu, J.; Bitler, B. G.; Clauset, A.

2026-04-24 cancer biology
10.64898/2026.04.22.719856 bioRxiv
Show abstract

In the treatment of high grade serous ovarian cancer (HGSC), patients initially diagnosed with unresectable tumors are first treated with neoadjuvant chemotherapy (NACT) to reduce tumor burden prior to surgery. Analysis of matched pre- and post-NACT samples from the same patients enables the investigation of chemotherapy impacts and the biomarkers of progression. Although the tumor immune microenvironment (TIME) has increasingly been recognized as critical in shaping the development and progression of HGSC, we lack a comprehensive understanding of how chemotherapy remodels the TIME. Previous studies have found evidence for a general inflammatory response post-NACT, despite inconsistencies regarding which differentially expressed genes and pathways are implicated. We combine matched NanoString gene expression data from multiple sources to create a large dataset of matched pre- and post- NACT samples (N=83, with 29 novel to this study) and investigate reproducibility. Further, we use machine learning methods to investigate whether patient progression-free survival (PFS) can be predicted from the observed impact of chemotherapy on the TIME as represented by the comprehensive set of NanoString features. We find overall low predictability of PFS from all NanoString features, suggesting that previous results may have been limited by small sample size effects and that larger datasets are needed to identify more generalizable and translatable findings. We identify a set of differential expression features that are the most important for predicting patient outcomes that can be validated in future computational and biological studies. Author summaryA subset of patients with high grade serous ovarian cancer are treated with chemotherapy before surgery to reduce tumor burden. We investigate a large dataset of samples taken before and after chemotherapy. These matched samples enable an investigation of how the environment around tumors, for example immune cell infiltration, reacts to chemotherapy, providing insights into biomarkers for treatment response and treatments that could complement chemotherapy. This larger dataset only partially replicates results from previous studies, while also providing new insights. Machine learning models designed to predict the time to patient recurrence from available biomarkers indicate that they are not strongly predictive of patient outcomes, in contrast to past studies. These results suggest that larger datasets are needed. We identify a set of genes that change with chemotherapy and are indicative of and potentially useful for predicting time to disease recurrence and can be further investigated.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Scientific Reports
3102 papers in training set
Top 2%
14.4%
2
Cancers
200 papers in training set
Top 0.4%
10.1%
3
PLOS Computational Biology
1633 papers in training set
Top 3%
10.1%
4
BMC Cancer
52 papers in training set
Top 0.2%
7.2%
5
British Journal of Cancer
42 papers in training set
Top 0.2%
6.3%
6
PLOS ONE
4510 papers in training set
Top 33%
4.3%
50% of probability mass above
7
Clinical Cancer Research
58 papers in training set
Top 0.7%
2.5%
8
Frontiers in Genetics
197 papers in training set
Top 3%
2.4%
9
PeerJ
261 papers in training set
Top 5%
2.1%
10
npj Precision Oncology
48 papers in training set
Top 0.4%
2.1%
11
Frontiers in Bioinformatics
45 papers in training set
Top 0.1%
1.9%
12
iScience
1063 papers in training set
Top 14%
1.7%
13
F1000Research
79 papers in training set
Top 2%
1.7%
14
Cancer Research
116 papers in training set
Top 2%
1.5%
15
Biology Methods and Protocols
53 papers in training set
Top 1%
1.5%
16
Bioinformatics Advances
184 papers in training set
Top 4%
1.1%
17
G3 Genes|Genomes|Genetics
351 papers in training set
Top 2%
1.0%
18
Cancer Epidemiology, Biomarkers & Prevention
17 papers in training set
Top 0.5%
1.0%
19
BMC Medicine
163 papers in training set
Top 6%
0.9%
20
Computational and Structural Biotechnology Journal
216 papers in training set
Top 8%
0.8%
21
Breast Cancer Research
32 papers in training set
Top 0.5%
0.8%
22
Cell Reports Medicine
140 papers in training set
Top 8%
0.7%
23
Bulletin of Mathematical Biology
84 papers in training set
Top 2%
0.7%
24
Cell Communication and Signaling
35 papers in training set
Top 1%
0.7%
25
The Journal of Pathology
22 papers in training set
Top 0.5%
0.7%
26
Frontiers in Immunology
586 papers in training set
Top 8%
0.7%
27
JCI Insight
241 papers in training set
Top 8%
0.7%
28
JCO Precision Oncology
14 papers in training set
Top 0.4%
0.7%
29
Bioengineering & Translational Medicine
21 papers in training set
Top 1.0%
0.7%
30
Frontiers in Oncology
95 papers in training set
Top 4%
0.7%