Back

An generative-AI framework for target-Specific MicroRNAs towards RNAi-based drug design

Gu, J.; Li, Y.

2026-05-11 genomics
10.64898/2026.05.07.723585 bioRxiv
Show abstract

MicroRNA (miRNAs) are small non-coding RNAs that regulate gene expression by binding to the target messenger RNA (mRNA), whose versatility has inspired RNA-interference (RNAi)-based drug designs. However, off-target effects lead to unintended gene silencing and toxicity. Existing methods suffer from experimental data scarcity and fail to effectively integrate target specificity into designing de novo small interference RNAs (siRNA). To overcome the above challenges, we present SO_SCPLOWPECIC_SCPLOWMO_SCPLOWIC_SCPLOWR, a specificity-guided generative framework that synthesizes target-conditioned miRNAs. By training on a large experimental data containing 2.2M miRNA-mRNA pairs, SO_SCPLOWPECIC_SCPLOWMO_SCPLOWIC_SCPLOWR minimizes off-target effects with enhanced on-target potency. As a result, SO_SCPLOWPECIC_SCPLOWMO_SCPLOWIC_SCPLOWR-generated miRNAs bind more strongly to the target mRNAs than the observed miRNAs and much less so to off-target mRNAs. We tested SO_SCPLOWPECIC_SCPLOWMO_SCPLOWIC_SCPLOWR on mRNA targets for liver disease, for which 6 FDA-approved siRNA-based drugs were available. SO_SCPLOWPECIC_SCPLOWMO_SCPLOWIC_SCPLOWR recovers binding regions that correspond to FDA-approved siRNA drugs across 3 targets, and demonstrates greater structural specificity for on-target mRNAs than for off-target mRNAs. Together, SO_SCPLOWPECIC_SCPLOWMO_SCPLOWIC_SCPLOWR offers an AI solution to synthesize miRNA-inspired and target-specific siRNA sequences towards RNAi-based drug design.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Nucleic Acids Research
1128 papers in training set
Top 2%
10.2%
2
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.4%
6.9%
3
Bioinformatics Advances
184 papers in training set
Top 0.4%
6.4%
4
Bioinformatics
1061 papers in training set
Top 4%
6.4%
5
Nature Machine Intelligence
61 papers in training set
Top 0.6%
4.9%
6
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 1%
4.9%
7
Nature Communications
4913 papers in training set
Top 33%
4.9%
8
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.9%
9
Frontiers in Genetics
197 papers in training set
Top 1%
4.2%
50% of probability mass above
10
Advanced Science
249 papers in training set
Top 5%
3.6%
11
PLOS Computational Biology
1633 papers in training set
Top 9%
3.6%
12
Cell Genomics
162 papers in training set
Top 2%
2.9%
13
iScience
1063 papers in training set
Top 7%
2.8%
14
Scientific Reports
3102 papers in training set
Top 43%
2.8%
15
NAR Genomics and Bioinformatics
214 papers in training set
Top 1%
2.4%
16
Communications Biology
886 papers in training set
Top 5%
2.1%
17
PLOS ONE
4510 papers in training set
Top 53%
1.7%
18
Nature Biotechnology
147 papers in training set
Top 5%
1.7%
19
Genome Research
409 papers in training set
Top 3%
1.3%
20
Genome Biology
555 papers in training set
Top 5%
1.3%
21
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.4%
1.1%
22
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 41%
0.9%
23
Cell Systems
167 papers in training set
Top 11%
0.8%
24
Nature Computational Science
50 papers in training set
Top 1%
0.8%
25
Genome Medicine
154 papers in training set
Top 8%
0.8%
26
Cell Reports
1338 papers in training set
Top 34%
0.7%
27
Nature Biomedical Engineering
42 papers in training set
Top 3%
0.6%
28
Journal of Chemical Information and Modeling
207 papers in training set
Top 3%
0.6%
29
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 11%
0.6%
30
Science Bulletin
22 papers in training set
Top 1%
0.5%