Back

MOSAIC: Model-based, Subgroup-Aware Identification of Driver Mutations in Cancer

Campbell, K.; Reyna, M. A.

2026-05-03 bioinformatics
10.64898/2026.04.29.721672 bioRxiv
Show abstract

In cancer genomics, recurrent patterns of mutual exclusivity within a gene set can indicate shared biological context and involvement in tumorigenesis. However, existing methods are not designed to distinguish between mutual exclusivity arising from meaningful biological interactions from those influenced by heterogeneity between underlying patient subpopulations. In this work, we introduce MOSAIC, a novel statistical framework that models patient subgroup heterogeneity in mutual exclusivity analyses. In experiments with simulated data and real data from The Cancer Genome Atlas, we show that MOSAIC amplifies subgroup-specific mutual exclusivity signals, including between IDH1 and IDH2 in young low grade glioma patients, while reducing the effect of signals produced by underlying subgroup structures, such as distinct genomic lineages associated with histological subtypes of endometrial cancer. Finally, we demonstrate that MOSAIC is more powerful than existing p-value combination methods for patient subgroup stratification. MOSAIC is available as an open-source tool at https://github.com/reynalab/mosaic.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 1%
22.3%
2
BMC Bioinformatics
383 papers in training set
Top 0.5%
14.6%
3
Nature Communications
4913 papers in training set
Top 21%
9.1%
4
PLOS Computational Biology
1633 papers in training set
Top 7%
4.8%
50% of probability mass above
5
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.9%
6
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.1%
3.6%
7
PLOS ONE
4510 papers in training set
Top 40%
3.6%
8
Genome Biology
555 papers in training set
Top 2%
3.6%
9
Cell Systems
167 papers in training set
Top 4%
3.6%
10
Nucleic Acids Research
1128 papers in training set
Top 7%
3.0%
11
Bioinformatics Advances
184 papers in training set
Top 2%
2.3%
12
Genome Research
409 papers in training set
Top 2%
1.9%
13
Scientific Reports
3102 papers in training set
Top 53%
1.9%
14
Frontiers in Genetics
197 papers in training set
Top 5%
1.7%
15
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.3%
16
Genome Medicine
154 papers in training set
Top 6%
1.2%
17
iScience
1063 papers in training set
Top 25%
0.9%
18
BioData Mining
15 papers in training set
Top 0.7%
0.9%
19
Communications Biology
886 papers in training set
Top 24%
0.7%
20
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
21
Computational and Structural Biotechnology Journal
216 papers in training set
Top 9%
0.7%
22
PLOS Genetics
756 papers in training set
Top 16%
0.7%
23
European Journal of Human Genetics
49 papers in training set
Top 1%
0.7%