Back

Heli-SMACC: Helicase-targeting SMAll Molecule Compound Collection

Martin, H.-J.; Hossain, M. A.; Wellnitz, J.; Kelestemur, E.; Hochuli, J.; Perveen, S.; Arrowsmith, C.; Willson, T. M.; Muratov, E.; Tropsha, A.

2024-07-08 bioinformatics
10.1101/2024.07.04.602122 bioRxiv
Show abstract

Helicases have emerged as promising targets for the development of antiviral drugs; however, the family remains largely undrugged. To support the focused development of viral helicase inhibitors we identified, collected, and integrated all chemogenomics data for all available helicases from the ChEMBL database. After thoroughly curating and enriching the data with relevant annotations we have created a derivative database of helicase inhibitors which we dubbed Heli-SMACC (Helicase-targeting SMAll Molecule Compound Collection). The current version of Heli-SMACC contains 20,432 bioactivity entries for viral, human, and bacterial helicases. We have selected 30 compounds with promising viral helicase activity and tested them in a SARS-CoV-2 NSP13 ATPase assay. Twelve compounds demonstrated ATPase inhibition and a consistent dose-response curve. The Heli-SMACC database may serve as a reference for virologists and medicinal chemists working on the development of novel helicase inhibitors. Heli-SMACC is publicly available at https://smacc.mml.unc.edu. HighlightsO_LIWe created a curated Helicase-Targeting SMAll Molecule Compound Collection (Heli-SMACC). C_LIO_LIHeli-SMACC covers 29 human, viral, and bacterial helicases. C_LIO_LITwelve of thirty selected compounds demonstrated inhibitory activity in a SARS-CoV-2 NSP13 ATPase Assay. C_LIO_LIHeli-SMACC is freely available online at https://smacc.mml.unc.edu. C_LI TOC Graphic O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=125 SRC="FIGDIR/small/602122v1_ufig1.gif" ALT="Figure 1"> View larger version (36K): org.highwire.dtl.DTLVardef@8e098forg.highwire.dtl.DTLVardef@115cb2borg.highwire.dtl.DTLVardef@1cd9da3org.highwire.dtl.DTLVardef@2870a6_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 11 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.6%
9.2%
2
Nature
575 papers in training set
Top 4%
6.9%
3
Nucleic Acids Research
1128 papers in training set
Top 3%
6.4%
4
Scientific Data
174 papers in training set
Top 0.3%
4.9%
5
Nature Protocols
30 papers in training set
Top 0.1%
4.4%
6
Bioinformatics
1061 papers in training set
Top 5%
4.4%
7
iScience
1063 papers in training set
Top 4%
3.6%
8
PLOS ONE
4510 papers in training set
Top 42%
3.3%
9
Nature Methods
336 papers in training set
Top 3%
3.1%
10
Nature Communications
4913 papers in training set
Top 43%
2.8%
11
Nature Biotechnology
147 papers in training set
Top 3%
2.6%
50% of probability mass above
12
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 27%
2.1%
13
Cell Chemical Biology
81 papers in training set
Top 1%
2.1%
14
Science
429 papers in training set
Top 13%
1.8%
15
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
16
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.7%
17
ACS Omega
90 papers in training set
Top 2%
1.5%
18
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.1%
1.3%
19
Frontiers in Pharmacology
100 papers in training set
Top 3%
1.3%
20
Patterns
70 papers in training set
Top 1%
1.2%
21
Computational and Structural Biotechnology Journal
216 papers in training set
Top 6%
1.2%
22
Protein Science
221 papers in training set
Top 1%
1.1%
23
Viruses
318 papers in training set
Top 4%
1.0%
24
International Journal of Molecular Sciences
453 papers in training set
Top 13%
0.9%
25
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
26
Nature Genetics
240 papers in training set
Top 6%
0.9%
27
PLOS Computational Biology
1633 papers in training set
Top 23%
0.8%
28
Scientific Reports
3102 papers in training set
Top 74%
0.8%
29
Cell Systems
167 papers in training set
Top 12%
0.8%
30
Frontiers in Molecular Biosciences
100 papers in training set
Top 5%
0.8%