Back

A Graph-Directed Approach for Creation of a Homology Modeling Library: Application to Venom Structure Prediction

Mansbach, R. A.; Chakraborty, S.; Travers, T.; Gnanakaran, S.

2019-11-01 bioinformatics
10.1101/828129 bioRxiv
Show abstract

Many toxins are short, cysteine-rich peptides that are of great interest as novel therapeutic leads and of great concern as lethal biological agents due to their high affinity and specificity for various receptors involved in neuromuscular transmission. To perform initial candidate identification for design of a drug impacting a particular receptor or for threat assessment as a harmful toxin, one requires a set of candidate structures of reasonable accuracy with potential for interaction with the target receptor. In this article, we introduce a graph-based algorithm for identifying good extant template structures from a library of evolutionarily-related cysteine-containing sequences for structural determination of target sequences by homology modeling. We employ this approach to study the conotoxins, a set of toxin peptides produced by the family of aquatic cone snails. Currently, of the approximately six thousand known conotoxin sequences, only about three percent have experimentally characterized three-dimensional structures, leading to a serious bottleneck in identifying potential drug candidates. We demonstrate that the conotoxin template library generated by our approach may be employed to perform homology modeling and greatly increase the number of characterized conotoxin structures. We also show how our approach can guide experimental design by identifying and ranking sequences for structural characterization in a similar manner. Overall, we present and validate an approach for venom structure modeling and employ it to expand the library of extant conotoxin structures by almost 300% through homology modeling employing the template library determined in our approach.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.4%
12.6%
2
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.1%
12.5%
3
PLOS ONE
4510 papers in training set
Top 17%
10.5%
4
Scientific Reports
3102 papers in training set
Top 18%
6.4%
5
PLOS Computational Biology
1633 papers in training set
Top 7%
4.9%
6
Bioinformatics
1061 papers in training set
Top 5%
4.0%
50% of probability mass above
7
BMC Bioinformatics
383 papers in training set
Top 3%
2.6%
8
Molecules
37 papers in training set
Top 0.4%
2.6%
9
Bioinformatics Advances
184 papers in training set
Top 2%
2.1%
10
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.1%
11
BMC Genomics
328 papers in training set
Top 2%
1.7%
12
PeerJ
261 papers in training set
Top 8%
1.5%
13
Biology Methods and Protocols
53 papers in training set
Top 1%
1.2%
14
ImmunoInformatics
11 papers in training set
Top 0.1%
1.2%
15
Computational and Structural Biotechnology Journal
216 papers in training set
Top 6%
1.2%
16
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
1.2%
17
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.0%
18
Archives of Clinical and Biomedical Research
28 papers in training set
Top 2%
1.0%
19
Journal of Experimental Biology
249 papers in training set
Top 2%
1.0%
20
Journal of Cheminformatics
25 papers in training set
Top 0.5%
0.9%
21
Biochemistry and Biophysics Reports
28 papers in training set
Top 1%
0.8%
22
Pharmaceuticals
33 papers in training set
Top 1%
0.8%
23
Biophysical Journal
545 papers in training set
Top 5%
0.8%
24
Frontiers in Physiology
93 papers in training set
Top 6%
0.8%
25
Journal of Structural Biology
58 papers in training set
Top 2%
0.8%
26
Biomolecules
95 papers in training set
Top 2%
0.8%
27
Physical Biology
43 papers in training set
Top 2%
0.8%
28
Protein Science
221 papers in training set
Top 2%
0.8%
29
Journal of Biosciences
12 papers in training set
Top 0.1%
0.8%
30
Antibiotics
32 papers in training set
Top 1%
0.7%