Back

Drug design using unique conformations to preferentially target a specific site on collagen-bound MMP1

SARKAR, S. K.; Nash, A.; Harms, C.

2026-05-17 biophysics
10.64898/2026.05.14.725194 bioRxiv
Show abstract

Precise site-specific drug design remains a challenge in structure-based drug discovery. Most existing approaches screen for ligands to target binding pockets on a protein surface based on static structures obtained from techniques such as X-ray, NMR, cryo-EM, and AlphaFold. However, the structure-function paradigm is, in reality, a structure-dynamics-function relationship that determines a proteins binding and activity. As such, drug screening or design without evaluating binding competition across the protein surface or considering the receptors dynamic, substrate-dependent conformational states is incomplete. Substrate-specific unique protein conformations are underexplored and offer novel opportunities for selective therapeutic targeting, though systematic workflows for identifying and exploiting such sites remain limited. Previously, we showed that collagen alters matrix metalloprotease-1 (MMP1) dynamics and that R405 is an allosteric residue on the MMP1 surface that exhibits strong dynamic correlations with its active site. Here, we present a substrate-specific allosteric drug-design framework that targets specific sites on a protein, using collagen-bound MMP1 as a model system. We determined the conformational dynamics of free and collagen-bound MMP1 using all-atom molecular dynamics (MD) simulations and categorized conformations into clusters of similar conformations. We then compared and identified unique conformations that occur only in collagen-bound MMP1 to design drugs against them using a machine-learning approach. The top three unique clusters were used to generate approximately 150,000 candidate compounds that were then screened against both the R405-centered region and all detectable binding pockets across the MMP1 surface. We have found several compounds that bind preferentially around R405 by at least 0.3 kcal/mol relative to competing sites across the surface. This strategy establishes a generalizable framework for designing ligands that preferentially target substrate-specific allosteric sites, providing new opportunities for precision therapeutics that modulate proteins in their biologically relevant functional states. Simple SummaryIn this paper, we establish a substrate-specific allosteric drug-design strategy that integrates all-atom molecular dynamics simulations, conformational clustering, machine-learning-based ligand design, and surface-wide binding-selectivity screening, using collagen-bound MMP1 as a model system. We show that collagen binding reshapes the conformational ensemble of MMP1, creating unique conformational states that are absent or inaccessible in the free enzyme. By identifying these substrate-specific conformations, generating ligands based on the corresponding dynamic fingerprints around the collagen-specific allosteric residue R405, and screening compounds across all binding pockets on the MMP1 surface, we demonstrate preferential targeting of the collagen-specific site relative to competing pockets. These results establish a generalizable framework for designing ligands that selectively recognize biologically relevant substrate-bound conformations rather than static protein structures alone. Substrate-specific allosteric targeting may enable selective modulation of individual protein functions while minimizing off-target interactions, providing new opportunities for precision therapeutics against dynamic protein systems.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 3%
14.5%
2
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.4%
14.2%
3
PLOS Computational Biology
1633 papers in training set
Top 4%
8.3%
4
eLife
5422 papers in training set
Top 18%
4.8%
5
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.3%
3.5%
6
Scientific Reports
3102 papers in training set
Top 38%
3.5%
7
Chemical Science
71 papers in training set
Top 0.5%
3.0%
50% of probability mass above
8
Protein Science
221 papers in training set
Top 0.5%
2.8%
9
Structure
175 papers in training set
Top 1%
2.6%
10
Biophysical Journal
545 papers in training set
Top 2%
2.6%
11
Nature Communications
4913 papers in training set
Top 46%
2.4%
12
Cell Systems
167 papers in training set
Top 5%
2.3%
13
Frontiers in Molecular Biosciences
100 papers in training set
Top 1%
2.1%
14
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.4%
1.9%
15
The Journal of Physical Chemistry B
158 papers in training set
Top 1.0%
1.9%
16
Journal of The Royal Society Interface
189 papers in training set
Top 3%
1.7%
17
PLOS ONE
4510 papers in training set
Top 54%
1.7%
18
Journal of Molecular Biology
217 papers in training set
Top 2%
1.6%
19
IUCrJ
29 papers in training set
Top 0.2%
1.3%
20
Bioinformatics Advances
184 papers in training set
Top 4%
0.9%
21
Physical Biology
43 papers in training set
Top 2%
0.9%
22
Journal of Biological Chemistry
641 papers in training set
Top 3%
0.9%
23
International Journal of Molecular Sciences
453 papers in training set
Top 13%
0.9%
24
iScience
1063 papers in training set
Top 29%
0.8%
25
Advanced Science
249 papers in training set
Top 18%
0.8%
26
Communications Biology
886 papers in training set
Top 22%
0.8%
27
Journal of the American Chemical Society
199 papers in training set
Top 5%
0.7%
28
ACS Omega
90 papers in training set
Top 4%
0.7%
29
Communications Chemistry
39 papers in training set
Top 1%
0.7%
30
Bioinformatics
1061 papers in training set
Top 10%
0.6%