Back

Adaptation of a Mutual Exclusivity Framework to Identify Driver Mutations within Biological Pathways

Wang, X.; Kostrzewa, C.; Reiner, A.; Shen, R.; Begg, C.

2023-09-22 cancer biology
10.1101/2023.09.19.558469 bioRxiv
Show abstract

Distinguishing genomic alterations in cancer genes that have functional impact on tumor growth and disease progression from the ones that are passengers and confer no fitness advantage has important clinical implications. Evidence-based methods for nominating drivers are limited by existing knowledge on the oncogenic effects and therapeutic benefits of specific variants from clinical trials or experimental settings. As clinical sequencing becomes a mainstay of patient care, applying computational methods to mine the rapidly growing clinical genomic data holds promise in uncovering novel functional candidates beyond the existing knowledge-base and expanding the patient population that could potentially benefit from genetically targeted therapies. We propose a statistical and computational method (MAGPIE) that builds on a likelihood approach leveraging the mutual exclusivity pattern within an oncogenic pathway for identifying probabilistically both the specific genes within a pathway and the individual mutations within such genes that are truly the drivers. Alterations in a cancer gene are assumed to be a mixture of driver and passenger mutations with the passenger rates modeled in relationship to tumor mutational burden. A limited memory BFGS algorithm is used to facilitate large scale optimization. We use simulations to study the operating characteristics of the method and assess false positive and false negative rates in driver nomination. When applied to a large study of primary melanomas the method accurately identified the known driver genes within the RTK-RAS pathway and nominated a number of rare variants with previously unknown biological and clinical relevance as prime candidates for functional validation.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 0.1%
35.2%
2
PLOS ONE
4510 papers in training set
Top 18%
10.4%
3
Bioinformatics
1061 papers in training set
Top 3%
10.4%
50% of probability mass above
4
Biostatistics
21 papers in training set
Top 0.1%
4.1%
5
Scientific Reports
3102 papers in training set
Top 45%
2.7%
6
Bioinformatics Advances
184 papers in training set
Top 2%
2.5%
7
BMC Bioinformatics
383 papers in training set
Top 3%
2.4%
8
PLOS Genetics
756 papers in training set
Top 7%
2.1%
9
Cell Systems
167 papers in training set
Top 6%
1.8%
10
Nature Communications
4913 papers in training set
Top 51%
1.7%
11
npj Systems Biology and Applications
99 papers in training set
Top 1%
1.7%
12
Journal of Computational Biology
37 papers in training set
Top 0.2%
1.5%
13
Statistics in Medicine
34 papers in training set
Top 0.2%
1.1%
14
Communications Biology
886 papers in training set
Top 18%
0.9%
15
iScience
1063 papers in training set
Top 26%
0.9%
16
Expert Systems with Applications
11 papers in training set
Top 0.3%
0.9%
17
Cancers
200 papers in training set
Top 4%
0.8%
18
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
19
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 45%
0.7%
20
Frontiers in Genetics
197 papers in training set
Top 10%
0.7%
21
Cancer Research
116 papers in training set
Top 3%
0.7%
22
Computational and Structural Biotechnology Journal
216 papers in training set
Top 10%
0.7%
23
The American Journal of Human Genetics
206 papers in training set
Top 4%
0.7%
24
Frontiers in Molecular Biosciences
100 papers in training set
Top 6%
0.7%
25
Physical Biology
43 papers in training set
Top 2%
0.7%
26
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.9%
0.7%
27
Bulletin of Mathematical Biology
84 papers in training set
Top 2%
0.5%
28
Genome Medicine
154 papers in training set
Top 10%
0.5%
29
Journal of The Royal Society Interface
189 papers in training set
Top 6%
0.5%
30
Biology Methods and Protocols
53 papers in training set
Top 3%
0.5%