Back

Integrating weighted correlation network analysis and machine learning identifies common trajectories of prostate cancer

Egger, G.; Sheibani-Tezerji, R.; Perez Malla, C. U.; Malzer, A.; Misura, K.; Kalla, J.; Tran, L.; Wasinger, G.

2023-03-02 bioinformatics
10.1101/2023.03.02.530740 bioRxiv
Show abstract

BackgroundProstate cancer diagnosis and prognosis is currently limited by the availability of sensitive and specific biomarkers. There is an urgent need to develop molecular biomarkers that allow for the distinction of indolent from aggressive disease, the sensitive detection of heterogeneous tumors, or the evaluation of micro-metastases. The availability of multi-omics datasets in publicly accessible databases provides a valuable foundation to develop computational workflows for the identification of suitable biomarkers for clinical management of cancer patients. ResultsWe combined transcriptomic data of primary localized and advanced prostate cancer from two cancer databases. Transcriptomic analysis of metastatic tumors unveiled a distinct overexpression pattern of genes encoding cell surface proteins intricately associated with cell-matrix components and chemokine signaling pathways. Utilizing an integrated approach combining machine learning and weighted gene correlation network modules, we identified the EZH2-TROAP axis as the main trajectory from initial tumor development to lethal metastatic disease. In addition, we identified and independently validated 58 promising biomarkers that were specifically upregulated in primary localized or metastatic disease. Among those biomarkers, 22 were highly significant for predicting biochemical recurrence. Notably, we confirmed TPX2 upregulation at the protein level in an independent cohort of primary prostate cancer and matched lymph node metastases. ConclusionsThis study demonstrates the effectiveness of using advanced bioinformatics approaches to identify the biological factors that drive prostate cancer progression. Furthermore, the targets identified show promise as prognostic biomarkers in clinical settings. Thus, integrative bioinformatics methods provide both deeper understanding of disease dynamics and open the doors for future personalized interventions.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
The Prostate
11 papers in training set
Top 0.1%
18.9%
2
International Journal of Cancer
42 papers in training set
Top 0.1%
10.2%
3
Scientific Reports
3102 papers in training set
Top 6%
10.2%
4
PLOS ONE
4510 papers in training set
Top 22%
8.5%
5
Frontiers in Oncology
95 papers in training set
Top 0.9%
4.0%
50% of probability mass above
6
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1%
4.0%
7
Bioinformatics
1061 papers in training set
Top 5%
3.6%
8
BMC Bioinformatics
383 papers in training set
Top 3%
3.6%
9
Journal of Translational Medicine
46 papers in training set
Top 0.4%
2.4%
10
Cancer Research Communications
46 papers in training set
Top 0.3%
1.9%
11
PeerJ
261 papers in training set
Top 7%
1.7%
12
Genome Medicine
154 papers in training set
Top 4%
1.7%
13
Cancers
200 papers in training set
Top 3%
1.5%
14
Frontiers in Immunology
586 papers in training set
Top 5%
1.2%
15
PLOS Computational Biology
1633 papers in training set
Top 19%
1.2%
16
iScience
1063 papers in training set
Top 24%
1.0%
17
Nature Communications
4913 papers in training set
Top 58%
1.0%
18
eBioMedicine
130 papers in training set
Top 3%
0.9%
19
Frontiers in Genetics
197 papers in training set
Top 10%
0.7%
20
BMC Genomics
328 papers in training set
Top 7%
0.7%
21
Oncogene
76 papers in training set
Top 2%
0.5%
22
Bioinformatics Advances
184 papers in training set
Top 6%
0.5%
23
Heliyon
146 papers in training set
Top 9%
0.5%
24
Neoplasia
22 papers in training set
Top 0.9%
0.5%
25
Modern Pathology
21 papers in training set
Top 0.6%
0.5%
26
Cancer Epidemiology, Biomarkers & Prevention
17 papers in training set
Top 0.8%
0.5%
27
JNCI Cancer Spectrum
10 papers in training set
Top 0.7%
0.5%