Back

RNAiSpline: A Deep learning model for siRNA efficacy prediction

Surkanti, S. R.; Kasturi, V. V.; Saligram, S. S.; Basangari, B. C.; Kondaparthi, V.

2026-02-17 bioinformatics
10.64898/2026.02.14.705949 bioRxiv
Show abstract

RNA interference (RNAi) is a crucial biological post-transcriptional gene silencing mechanism where small interfering RNA (siRNA) guides RNA-induced silencing complex (RISC) to bind with messenger RNA (mRNA) thereby silencing it and stopping protein formation. We exploit this process to prevent the formation of harmful proteins by silencing mRNA before it is translated into protein through an effective siRNA. There exists a need to develop a computational model that predicts the effectiveness of siRNA on a given mRNA. Designing a model is challenging, as the data availability is either scarce or biased, and existing models lack generalization ability, even though the parameters to training samples ratio is very high. To overcome these challenges, we introduce RNAiSpline, which incorporates self-supervised pretraining and fine-tuning with Kalmogorov-Arnold Network (KAN), Convolutional Neural Network (CNN), and Transformer Encoder. Evaluation on the independent test dataset yields an ROC-AUC of 0.8175, an F1 score of 0.7717, and Pearson correlation of 0.6032, making RNAiSpline a robust model for siRNA efficacy prediction.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
17.2%
2
Nucleic Acids Research
1128 papers in training set
Top 1%
12.5%
3
Briefings in Bioinformatics
326 papers in training set
Top 1.0%
6.2%
4
Nature Communications
4913 papers in training set
Top 33%
4.8%
5
Bioinformatics Advances
184 papers in training set
Top 0.9%
4.3%
6
Scientific Reports
3102 papers in training set
Top 36%
3.6%
7
PLOS ONE
4510 papers in training set
Top 41%
3.5%
50% of probability mass above
8
Frontiers in Genetics
197 papers in training set
Top 2%
3.5%
9
BMC Bioinformatics
383 papers in training set
Top 3%
3.5%
10
PLOS Computational Biology
1633 papers in training set
Top 10%
3.5%
11
iScience
1063 papers in training set
Top 8%
2.6%
12
Communications Biology
886 papers in training set
Top 4%
2.3%
13
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.0%
14
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 3%
1.9%
15
Molecular Therapy Nucleic Acids
32 papers in training set
Top 0.3%
1.8%
16
Cell Systems
167 papers in training set
Top 7%
1.8%
17
Nature Machine Intelligence
61 papers in training set
Top 2%
1.7%
18
Journal of Molecular Biology
217 papers in training set
Top 2%
1.3%
19
Advanced Science
249 papers in training set
Top 14%
1.2%
20
Genome Research
409 papers in training set
Top 4%
0.8%
21
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.5%
0.8%
22
Quantitative Biology
11 papers in training set
Top 0.7%
0.8%
23
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 43%
0.8%
24
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
25
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.7%
26
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.7%
0.7%
27
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.7%
28
BMC Biology
248 papers in training set
Top 6%
0.6%