Back

Evaluating the reliability of tools for mRNA annotation and IRES studies

May, G. E.; Akirtava, C.; McManus, J.

2026-03-31 genomics
10.64898/2026.03.29.707813 bioRxiv
Show abstract

Since the discovery of viral Internal Ribosome Entry Sites (IRESes), researchers have sought to find similar elements in mammalian host genes, termed "cellular IRESes". However, the plasmid systems used to measure cellular IRES activity are vulnerable to false positives due to promoter activity in candidate IRESes. Orthogonal methods are needed to validate putative IRESes while carefully avoiding artifacts known to cause false positives. Recently, Koch et al. proposed approaches for studying IRESes, primarily circular RNA-generating plasmids, and for validating mRNA transcripts using smFISH and qRT-PCR. Here, we demonstrate confounding variables and artifacts in each of these approaches that can lead to inappropriate conclusions about potential cellular IRES activity. We show the back-splicing circRNA plasmid creates linear mRNA artifacts associated with false-positive IRES signals. Using orthogonal, gold-standard assays validated with viral IRESes, we find putative cellular IRESes reported using the back-splicing plasmid have no IRES activity. Furthermore, we demonstrate that smFISH and qRT-PCR can misidentify nuclear non-coding RNAs as mRNAs and we validate a single molecule sequencing assay for identifying genuine mRNA 5 ends. Our work establishes reliable methods for robust transcript annotation and IRES studies that avoid documented artifacts arising from bicistronic and back-splicing circRNA plasmid reporters.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 18%
10.2%
2
Frontiers in Genetics
197 papers in training set
Top 0.8%
6.4%
3
Mobile DNA
27 papers in training set
Top 0.1%
6.4%
4
NAR Genomics and Bioinformatics
214 papers in training set
Top 0.2%
6.4%
5
Nucleic Acids Research
1128 papers in training set
Top 3%
6.4%
6
BMC Bioinformatics
383 papers in training set
Top 2%
4.9%
7
Scientific Reports
3102 papers in training set
Top 23%
4.9%
8
PLOS Computational Biology
1633 papers in training set
Top 8%
4.0%
9
RNA
169 papers in training set
Top 0.1%
3.6%
50% of probability mass above
10
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 0.1%
2.8%
11
RNA Biology
70 papers in training set
Top 0.1%
2.5%
12
BMC Genomics
328 papers in training set
Top 2%
2.1%
13
G3 Genes|Genomes|Genetics
351 papers in training set
Top 1%
1.8%
14
Journal of Virology
456 papers in training set
Top 2%
1.8%
15
F1000Research
79 papers in training set
Top 2%
1.7%
16
Journal of Biological Chemistry
641 papers in training set
Top 2%
1.7%
17
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.5%
18
PeerJ
261 papers in training set
Top 9%
1.3%
19
Genes
126 papers in training set
Top 1%
1.3%
20
Genome Biology
555 papers in training set
Top 5%
1.2%
21
Genome Research
409 papers in training set
Top 3%
1.2%
22
Bioinformatics Advances
184 papers in training set
Top 4%
1.2%
23
Current Protocols
13 papers in training set
Top 0.1%
1.1%
24
eLife
5422 papers in training set
Top 53%
0.9%
25
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
26
mSphere
281 papers in training set
Top 6%
0.8%
27
Heliyon
146 papers in training set
Top 7%
0.7%
28
Methods
29 papers in training set
Top 0.7%
0.7%
29
Journal of Molecular Biology
217 papers in training set
Top 4%
0.7%
30
Bioinformatics
1061 papers in training set
Top 10%
0.7%