Back

CRD: a De novo Design algorithm for prediction of Cognate Protein Receptors for small molecule ligands

Sankar, S.; Chandra, N.

2023-04-02 bioinformatics
10.1101/2023.03.30.534983 bioRxiv
Show abstract

While predicting a new ligand to bind to a protein is possible with current methods, the converse of predicting a receptor for a ligand is highly challenging, except for very closely-related known protein-ligand complexes. Predicting a receptor for any given ligand will be path-breaking in understanding protein function, mapping sequence-structure-function relationships and for several aspects of drug discovery including studying the mechanism of action of phenotypically discovered drugs, off-target effects and drug repurposing. We use a novel approach for predicting receptors for a given ligand through de novo design combined with structural bioinformatics. We have developed a new algorithm CRD, that has multiple modules which combines fragment-based sub-site finding, a machine learning function to estimate the size of the site, a genetic algorithm that encodes knowledge on protein structures and a physics-based fitness scoring scheme. CRD has a pseudo-receptor design component followed by a mapping component to identify possible proteins that house the site. CRD is designed to cater to ligands with known and unknown complexes. CRD accurately recovers sites and receptors for several known natural ligands including ATP, SAM, Glucose and FAD. It designs similar sites for similar ligands, yet to some extent distinguishes between closely related ligands. More importantly CRD correctly predicts receptor classes for several drugs such as penicillins and NSAIDs. We expect CRD to be a valuable tool in fundamental biology research as well as in the drug discovery and biotechnology industry.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
18.3%
2
Journal of Cheminformatics
25 papers in training set
Top 0.1%
14.1%
3
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.5%
12.1%
4
PLOS Computational Biology
1633 papers in training set
Top 4%
9.0%
50% of probability mass above
5
Frontiers in Molecular Biosciences
100 papers in training set
Top 0.2%
4.8%
6
PLOS ONE
4510 papers in training set
Top 32%
4.8%
7
BMC Bioinformatics
383 papers in training set
Top 2%
3.8%
8
Journal of Molecular Biology
217 papers in training set
Top 1.0%
2.4%
9
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.3%
2.3%
10
Bioinformatics Advances
184 papers in training set
Top 2%
2.0%
11
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.7%
12
Scientific Reports
3102 papers in training set
Top 60%
1.7%
13
Journal of Computational Chemistry
11 papers in training set
Top 0.1%
1.3%
14
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.3%
15
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.1%
1.2%
16
Molecules
37 papers in training set
Top 2%
0.9%
17
The Journal of Physical Chemistry B
158 papers in training set
Top 2%
0.9%
18
Protein Science
221 papers in training set
Top 2%
0.8%
19
Chemical Science
71 papers in training set
Top 2%
0.7%
20
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.9%
0.7%
21
PeerJ
261 papers in training set
Top 16%
0.7%
22
Nucleic Acids Research
1128 papers in training set
Top 18%
0.7%