Back

From sequences to therapeutics: Machine learning predicts chemically modified siRNA activity

Martinelli, D. D.

2023-08-18 bioinformatics
10.1101/2023.08.16.553554 bioRxiv
Show abstract

AO_SCPLOWBSTRACTC_SCPLOWSmall interfering RNAs (siRNAs) exemplify the promise of genetic medicine in the discovery of novel therapeutic modalities. Their ability to selectively suppress gene expression makes them ideal candidates for development as oligonucleotide pharmaceuticals. Recent advancements in machine learning (ML) have facilitated unmodified siRNA design and efficacy prediction, but a model trained to predict the silencing activity of siRNAs with diverse chemical modification patterns has yet to be published, despite the importance of such chemical modifications in designing siRNAs with the potential to advance to the clinic. This study presents the first application of ML to classify efficient chemically modified siRNAs from sequence and chemical modification patterns alone. Three algorithms are evaluated at three classification thresholds and compared according to sensitivity, specificity, consistency of feature weights with empirical knowledge, and performance on an external validation dataset. Finally, possible directions for future research are proposed.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Molecular Therapy Nucleic Acids
32 papers in training set
Top 0.1%
38.9%
2
Scientific Reports
3102 papers in training set
Top 22%
5.0%
3
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1%
4.3%
4
Briefings in Bioinformatics
326 papers in training set
Top 2%
4.1%
50% of probability mass above
5
Journal of Chemical Information and Modeling
207 papers in training set
Top 1%
3.8%
6
Nucleic Acids Research
1128 papers in training set
Top 5%
3.7%
7
PLOS ONE
4510 papers in training set
Top 38%
3.7%
8
Frontiers in Genetics
197 papers in training set
Top 3%
2.4%
9
Bioinformatics Advances
184 papers in training set
Top 2%
2.1%
10
Bioinformatics
1061 papers in training set
Top 7%
1.9%
11
PLOS Computational Biology
1633 papers in training set
Top 14%
1.9%
12
International Journal of Molecular Sciences
453 papers in training set
Top 7%
1.7%
13
BMC Bioinformatics
383 papers in training set
Top 4%
1.7%
14
RNA Biology
70 papers in training set
Top 0.2%
1.7%
15
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.3%
16
ACS Omega
90 papers in training set
Top 3%
1.0%
17
Biology Methods and Protocols
53 papers in training set
Top 2%
0.9%
18
PeerJ
261 papers in training set
Top 12%
0.9%
19
Molecules
37 papers in training set
Top 1%
0.9%
20
Molecular Therapy - Nucleic Acids
24 papers in training set
Top 0.2%
0.9%
21
Pharmaceuticals
33 papers in training set
Top 1%
0.8%
22
Synthetic and Systems Biotechnology
10 papers in training set
Top 0.6%
0.7%
23
ACS Chemical Biology
150 papers in training set
Top 2%
0.7%
24
Cell Reports Physical Science
18 papers in training set
Top 1%
0.7%
25
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
26
Patterns
70 papers in training set
Top 3%
0.5%
27
Biosystems
18 papers in training set
Top 0.6%
0.5%
28
RSC Chemical Biology
32 papers in training set
Top 0.8%
0.5%
29
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 12%
0.5%
30
NAR Molecular Medicine
18 papers in training set
Top 0.4%
0.5%