Back

Human and bats genome robustness under COSMIC mutational signatures

Song, J.-H.; Zeng, Y.; Davalos, L. M.; MacCarthy, T.; Larijani, M.; Damaghi, M.

2024-09-07 genomics
10.1101/2024.09.05.611453 bioRxiv
Show abstract

Carcinogenesis is an evolutionary process, and mutations can fix the selected phenotypes in selective microenvironments. Both normal and neoplastic cells are robust to the mutational stressors in the microenvironment to the extent that secure their fitness. To test the robustness of genes under a range of mutagens, we developed a sequential mutation simulator, Sinabro, to simulate single base substitution under a given mutational process. Then, we developed a pipeline to measure the robustness of genes and cells under those mutagenesis processes. We discovered significant human genome robustness to the APOBEC mutational signature SBS2, which is associated with viral defense mechanisms and is implicated in cancer. Robustness evaluations across over 70,000 sequences against 41 signatures showed higher resilience under signatures predominantly causing C-to-T (G-to-A) mutations. Principal component analysis indicates the GC content at the codons wobble position significantly influences robustness, with increased resilience noted under transition mutations compared to transversions. Then, we tested our results in bats at extremes of the lifespan-to-mass relationship and found the long-lived bat is more robust to APOBEC than the short-lived one. By revealing robustness to APOBEC ranked highest in human (and bats with much more than number of APOBEC) genome, this work bolsters the key potential role of APOBECs in aging and cancer, as well as evolved countermeasures to this innate mutagenic process. It also provides the baseline of the human and bat genome robustness under mutational processes associated with aging and cancer. HighlightsO_LISinabro, the sequential mutation simulator, facilitates measuring the robustness of human protein-coding sequences under all COSMIC mutational signatures. C_LIO_LIRobustness under APOBEC mutational signatures showed the largest mean and standard deviation in the human genome. C_LIO_LIRobustness to mutational signatures analysis reveals the role of APOBECs is complementary to cancer in the evolvability of cancer cells in later stages. C_LIO_LIPrincipal component analysis indicates that the GC content at the codons wobble position significantly influences robustness. C_LIO_LIA long-lived bat (Myotis myotis) has higher robustness to APOBECs than a short-lived one (Molossus molossus) than humans. C_LI

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.1%
23.2%
2
PLOS Computational Biology
1633 papers in training set
Top 4%
8.7%
3
Frontiers in Genetics
197 papers in training set
Top 0.5%
8.4%
4
GigaScience
172 papers in training set
Top 0.3%
4.4%
5
Aging
69 papers in training set
Top 0.6%
3.7%
6
Scientific Reports
3102 papers in training set
Top 34%
3.7%
50% of probability mass above
7
Aging Cell
144 papers in training set
Top 1%
3.2%
8
Journal of Theoretical Biology
144 papers in training set
Top 0.5%
3.0%
9
Communications Biology
886 papers in training set
Top 3%
2.7%
10
Genome Biology and Evolution
280 papers in training set
Top 0.7%
2.1%
11
npj Aging
15 papers in training set
Top 0.4%
1.9%
12
Briefings in Bioinformatics
326 papers in training set
Top 3%
1.8%
13
Nucleic Acids Research
1128 papers in training set
Top 10%
1.7%
14
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 3%
1.7%
15
Nature Communications
4913 papers in training set
Top 53%
1.5%
16
iScience
1063 papers in training set
Top 17%
1.5%
17
PLOS Genetics
756 papers in training set
Top 10%
1.4%
18
Bioinformatics
1061 papers in training set
Top 8%
1.4%
19
PLOS ONE
4510 papers in training set
Top 59%
1.3%
20
BMC Bioinformatics
383 papers in training set
Top 6%
0.8%
21
Cell Reports
1338 papers in training set
Top 31%
0.8%
22
Genetic Epidemiology
46 papers in training set
Top 0.8%
0.8%
23
GeroScience
97 papers in training set
Top 2%
0.8%
24
eLife
5422 papers in training set
Top 56%
0.8%
25
International Journal of Molecular Sciences
453 papers in training set
Top 15%
0.8%
26
Cell Genomics
162 papers in training set
Top 7%
0.7%
27
Cells
232 papers in training set
Top 7%
0.7%
28
Chaos, Solitons & Fractals
32 papers in training set
Top 2%
0.7%
29
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
30
Patterns
70 papers in training set
Top 3%
0.7%