Back

Divergent RNA structures support accurate splicing of the SF3B1-sensitive MAP3K7 intron

Herbert, A.; Randazza, A.; Hatfield, A.; Lackey, L.

2026-05-21 molecular biology
10.64898/2026.05.19.726294 bioRxiv
Show abstract

Splicing is governed by interactions between the spliceosome and precursor RNA sequence and structural elements. However, the relative contributions of RNA sequence and structural elements remain unclear. Here, we systematically dissect these determinants using a high-throughput mutagenesis approach with the MAP3K7 intron reporter. The MAP3K7 gene encodes a serine/threonine kinase involved in response to environmental stress. MAP3K7 precursor RNA contains a cryptic 3 splice site that increases in use when the core spliceosomal protein SF3B1 is mutated. SF3B1 mutations are known to promote aberrant splicing and are associated with cancer, particularly the lysine 700 to glutamate mutation (K700E). We designed a pooled library of 249 MAP3K7 mutants targeting branch points, RNA-binding protein motifs, nucleotide composition and predicted structural elements. The impact of these mutants on splicing was measured in the context of normal and SF3B1 K700E expression. RNA structure was assessed in parallel using in vitro high-throughput SHAPE-MAP chemical probing. We found that branchpoint mutations drive the strongest increases in cryptic splice-site use. There is no overall correlation between cryptic splice-site use and structural similarity to the wild-type MAP3K7 RNA. However, mutants within an RNA binding protein hotspot (containing U2AF2, U2AF1, KHSRP and SRSF2 sites) are associated with cryptic splice-site use and structural similarity to wild-type MAP3K7 RNA. These structural changes are associated with increased ensemble diversity. Our results demonstrate that although there are key structured regions within an RNA, there is also extensive variability where divergent RNA structures allow for accurate splicing.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 7%
18.0%
2
Nucleic Acids Research
1128 papers in training set
Top 2%
8.8%
3
Molecular Cell
308 papers in training set
Top 2%
8.1%
4
eLife
5422 papers in training set
Top 9%
8.1%
5
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 13%
6.1%
6
RNA
169 papers in training set
Top 0.1%
4.7%
50% of probability mass above
7
Cell Reports
1338 papers in training set
Top 14%
3.8%
8
PLOS Genetics
756 papers in training set
Top 5%
3.5%
9
Cell Systems
167 papers in training set
Top 5%
3.0%
10
Journal of Molecular Biology
217 papers in training set
Top 0.8%
3.0%
11
Cell Genomics
162 papers in training set
Top 3%
1.8%
12
Scientific Reports
3102 papers in training set
Top 54%
1.8%
13
Nature Structural & Molecular Biology
218 papers in training set
Top 3%
1.6%
14
Science
429 papers in training set
Top 15%
1.6%
15
Structure
175 papers in training set
Top 2%
1.6%
16
Genes & Development
90 papers in training set
Top 0.5%
1.6%
17
PLOS ONE
4510 papers in training set
Top 55%
1.6%
18
Genome Research
409 papers in training set
Top 3%
1.4%
19
iScience
1063 papers in training set
Top 22%
1.2%
20
Genome Biology
555 papers in training set
Top 6%
1.2%
21
Nature Genetics
240 papers in training set
Top 7%
0.9%
22
PLOS Computational Biology
1633 papers in training set
Top 23%
0.9%
23
RNA Biology
70 papers in training set
Top 0.5%
0.8%
24
Nature Biotechnology
147 papers in training set
Top 8%
0.7%
25
Journal of Biological Chemistry
641 papers in training set
Top 5%
0.7%
26
Biophysical Journal
545 papers in training set
Top 6%
0.7%
27
Science Advances
1098 papers in training set
Top 34%
0.6%