Back

Verification of human nucleotide sequence reagents and cell line identities in original circRNA articles published in high impact factor journals

Pathmendra, P.; Enguita, F. J.; Byrne, J. A.

2026-05-29 genomics
10.64898/2026.05.28.728608 bioRxiv
Show abstract

Numbers of research articles studying circRNAs have increased rapidly since 2017. Previous analyses of human circRNA articles in two high impact factor cancer research journals identified papers with wrongly identified nucleotide sequence reagents and circRNAs whose identities could not be independently verified. In the present study, verification of human nucleotide sequence reagent and cell line identities in retracted circRNA articles published from 2017-2021 in high impact factor journals found wrongly identified nucleotide sequences and/or cell lines in all 13 retracted papers. Similar analyses of human circRNA papers published in high impact factor journals in 2022 found wrongly identified, non-verifiable and/or questionable reagents in 71% (84/118) papers, where 51% (60/118) papers described at least one wrongly identified reagent. When individual error types and features of concern were considered, 2022 circRNA papers described wrongly identified nucleotide sequence reagents (52/118, 44%), questionable circRNA probes that did not meet accepted targeting requirements (34/118, 29%), non-verifiable nucleotide sequences (25/118, 21%), wrongly identified cell lines (22/118, 19%), and/or non-verifiable cell line identifiers (6/118, 5%). In summary, wrongly identified, non-verifiable and/or questionable reagents were unexpectedly frequent in human circRNA papers in high impact journals, highlighting the need for critical engagement with the circRNA literature.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
Scientific Reports
3102 papers in training set
Top 6%
10.2%
2
Nucleic Acids Research
1128 papers in training set
Top 2%
7.3%
3
PLOS ONE
4510 papers in training set
Top 24%
6.9%
4
PeerJ
261 papers in training set
Top 0.6%
6.4%
5
BMC Genomics
328 papers in training set
Top 0.4%
4.9%
6
F1000Research
79 papers in training set
Top 0.5%
3.6%
7
The Journal of Molecular Diagnostics
36 papers in training set
Top 0.1%
3.6%
8
Frontiers in Genetics
197 papers in training set
Top 2%
3.6%
9
BMC Biology
248 papers in training set
Top 0.4%
2.9%
10
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
2.9%
50% of probability mass above
11
Genome Biology
555 papers in training set
Top 3%
2.6%
12
BMC Cancer
52 papers in training set
Top 0.9%
2.4%
13
Database
51 papers in training set
Top 0.3%
2.1%
14
RNA Biology
70 papers in training set
Top 0.2%
2.1%
15
International Journal of Cancer
42 papers in training set
Top 0.6%
1.8%
16
Cancer Medicine
24 papers in training set
Top 0.7%
1.7%
17
Genes
126 papers in training set
Top 0.9%
1.7%
18
npj Genomic Medicine
33 papers in training set
Top 0.5%
1.3%
19
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.2%
20
Communications Biology
886 papers in training set
Top 14%
1.2%
21
BMC Bioinformatics
383 papers in training set
Top 5%
1.2%
22
PLOS Biology
408 papers in training set
Top 13%
1.2%
23
Annals of Oncology
13 papers in training set
Top 0.7%
1.1%
24
Nature Communications
4913 papers in training set
Top 59%
1.0%
25
DNA Research
23 papers in training set
Top 0.4%
0.9%
26
Life Science Alliance
263 papers in training set
Top 1%
0.8%
27
Life
27 papers in training set
Top 0.4%
0.8%
28
RNA
169 papers in training set
Top 0.4%
0.8%
29
Scientific Data
174 papers in training set
Top 3%
0.7%
30
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%