Back

Mutation-centric Network Construction using Long-Range Interactions

Huseynov, R.; Otlu, B.

2026-03-18 bioinformatics
10.64898/2026.03.16.712158 bioRxiv
Show abstract

Somatic mutations can alter normal cells and lead to cancer development. Yet distinguishing functional driver mutations from neutral passenger mutations remains a significant challenge. Traditional genomic tools often prioritize linear overlap searches, failing to capture the complex, three-dimensional regulatory environment of the genome. We present a graph-based framework, MutationNetwork, for constructing mutation-centric networks by integrating long-range intrachromosomal interactions with local genomic overlaps. Our method utilizes a unique positive and negative indexing scheme to represent interacting genomic intervals as nodes. By encoding both interactions and overlaps as edges, we enable constant-time retrieval of complex relationship data. By iteratively expanding the graph from a seed mutation, we can quantify a mutations influence on the genomic landscape and assess its proximity to genes. We applied this framework to a dataset of 560 breast cancer whole-genome sequences, focusing on Triple-Negative Breast Cancer (TNBC) and Luminal A subtypes. Our results demonstrate that the generated mutation embeddings successfully cluster samples according to their biological subtypes, with the highest classification performance achieved at specific ranges. This approach provides a comprehensive view of mutation impact, offering a scalable solution for cancer patient stratification and the prioritization of potential non-coding driver mutations by assessing their network-level impact. Availability and implementationThe source code is available at https://github.com/Ramalh/MutationNetwork

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 0.4%
39.0%
2
Nucleic Acids Research
1128 papers in training set
Top 3%
6.3%
3
Bioinformatics Advances
184 papers in training set
Top 0.5%
6.2%
50% of probability mass above
4
Cell Systems
167 papers in training set
Top 3%
4.3%
5
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.1%
3.9%
6
PLOS Computational Biology
1633 papers in training set
Top 9%
3.9%
7
Genome Research
409 papers in training set
Top 1%
3.5%
8
GigaScience
172 papers in training set
Top 0.5%
3.5%
9
BMC Bioinformatics
383 papers in training set
Top 3%
3.0%
10
Nature Communications
4913 papers in training set
Top 44%
2.7%
11
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.1%
12
Genome Biology
555 papers in training set
Top 4%
1.9%
13
iScience
1063 papers in training set
Top 15%
1.7%
14
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.7%
15
Genome Medicine
154 papers in training set
Top 5%
1.7%
16
Nature Biotechnology
147 papers in training set
Top 5%
1.3%
17
Scientific Reports
3102 papers in training set
Top 67%
1.2%
18
PLOS ONE
4510 papers in training set
Top 65%
0.9%
19
Nature Methods
336 papers in training set
Top 6%
0.8%
20
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.7%
21
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.9%
0.7%
22
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.6%