Back

Oncogenes and tumor suppressor genes are enriched in stop-loss mutations generating protein extensions

Boll, L. M.; Martorell, J. A.; Khelghati, N.; Camarena, M. E.; Vianello, C.; Garcia-Soriano, J. C.; Santamaria, E.; Artoleta, I.; Saez-Valle, S.; Perera-Bel, J.; Fortes, P.; Alba, M. M.

2026-03-16 genomics
10.64898/2026.03.12.711331 bioRxiv
Show abstract

Cancer genomes tend to accumulate a large number of mutations, and even rare mutations such as those causing the loss of a stop codon can be observed in a significant fraction of the tumors. Stop-loss mutations extend protein translation into the 3 untranslated region (3 UTR), generating altered proteins carrying extra amino acid sequences. These C-terminal extensions can potentially have consequences for tumorigenesis and immune recognition. To investigate the prevalence of stop-loss mutations in cancer, and to identify recurrent mutations with a possible tumor-promoting effect, we have interrogated mutation data from the tumor samples of 20,801 patients. This search has resulted in the annotation of 3,757 stop-loss mutations in 3,249 different protein-coding genes. Around 11% of the mutated genes contain recurrent stop-loss mutations, occurring in more than one patient. The protein extensions created by the mutations tend to be hydrophobic and/or positively charged, and these features are associated with an increased propensity to generate MHC I-bound peptides. We have also found that cancer-related genes contain 37% more stop-loss mutations than non-cancer-related genes, with both oncogenes and tumor suppressor genes showing similar enrichments. Furthermore, three out of the four genes with the highest number of stop-loss recurrences, PTMA, PCDH9 and SOX9, are cancer-related. In PTMA, the gene with the largest number of stop-loss mutations (14 patients), the mutation results in an extension of 9 amino acids. We provide experimental evidence that the mutation is associated with impaired cleavage of thymosin alpha 1, a peptide with immunostimulatory functions that is generated from the N-terminal part of the PTMA protein. The study provides evidence that stop-loss mutations are enriched in cancer-associated genes and constitutes a valuable resource for further studies on the effects of stop-loss mutations in cancer.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Scientific Reports
3102 papers in training set
Top 0.4%
22.2%
2
Nucleic Acids Research
1128 papers in training set
Top 2%
7.1%
3
BMC Cancer
52 papers in training set
Top 0.5%
3.9%
4
iScience
1063 papers in training set
Top 4%
3.9%
5
Cancers
200 papers in training set
Top 2%
3.5%
6
Molecular Systems Biology
142 papers in training set
Top 0.2%
3.5%
7
Cell Reports
1338 papers in training set
Top 17%
3.2%
8
eLife
5422 papers in training set
Top 30%
3.0%
50% of probability mass above
9
Frontiers in Genetics
197 papers in training set
Top 3%
2.3%
10
PLOS ONE
4510 papers in training set
Top 47%
2.3%
11
Genome Medicine
154 papers in training set
Top 3%
2.3%
12
G3 Genes|Genomes|Genetics
351 papers in training set
Top 1%
2.0%
13
EMBO reports
136 papers in training set
Top 2%
1.9%
14
Nature Communications
4913 papers in training set
Top 50%
1.8%
15
PLOS Computational Biology
1633 papers in training set
Top 17%
1.6%
16
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.5%
17
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.5%
18
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.3%
19
BMC Medical Genomics
36 papers in training set
Top 0.7%
1.3%
20
Communications Biology
886 papers in training set
Top 13%
1.3%
21
PLOS Genetics
756 papers in training set
Top 11%
1.2%
22
Oncotarget
15 papers in training set
Top 0.2%
1.1%
23
Cell Genomics
162 papers in training set
Top 5%
0.9%
24
PLOS Biology
408 papers in training set
Top 16%
0.9%
25
BMC Genomics
328 papers in training set
Top 4%
0.9%
26
International Journal of Cancer
42 papers in training set
Top 1.0%
0.9%
27
PeerJ
261 papers in training set
Top 12%
0.9%
28
Genome Biology
555 papers in training set
Top 6%
0.9%
29
Human Mutation
29 papers in training set
Top 0.7%
0.8%
30
Bioinformatics
1061 papers in training set
Top 9%
0.8%