Back

Brewing COFFEE: a sequence-specific coarse-grained energy function for simulations of DNA-protein complexes

Chakraborty, D.; Mondal, B.; Thirumalai, D.

2023-06-08 biophysics
10.1101/2023.06.07.544064 bioRxiv
Show abstract

DNA-protein interactions are pervasive in a number of biophysical processes ranging from transcription, gene expression, to chromosome folding. To describe the structural and dynamic properties underlying these processes accurately, it is important to create transferable computational models. Toward this end, we introduce Coarse grained force field for energy estimation, COFFEE, a robust framework for simulating DNA-protein complexes. To brew COFFEE, we integrated the energy function in the Self-Organized Polymer model with Side Chains for proteins and the Three Interaction Site model for DNA in a modular fashion, without re-calibrating any of the parameters in the original force-fields. A unique feature of COFFEE is that it describes sequence-specific DNA-protein interactions using a statistical potential (SP) derived from a dataset of high-resolution crystal structures. The only parameter in COFFEE is the strength ({lambda}DNAPRO) of the DNA-protein contact potential. For an optimal choice of{lambda} DNAPRO, the crystallographic B-factors for DNA-protein complexes, with varying sizes and topologies, are quantitatively reproduced. Without any further readjustments to the force-field parameters, COFFEE predicts the scattering profiles that are in quantitative agreement with SAXS experiments as well as chemical shifts that are consistent with NMR. We also show that COFFEE accurately describes the salt-induced unraveling of nucleosomes. Strikingly, our nucleosome simulations explain the destabilization effect of ARG to LYS mutations, which does not alter the balance of electrostatic interactions, but affects chemical interactions in subtle ways. The range of applications attests to the transferability of COFFEE, and we anticipate that it would be a promising framework for simulating DNA-protein complexes at the molecular length-scale. Graphical TOC Entry O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=81 SRC="FIGDIR/small/544064v2_ufig1.gif" ALT="Figure 1"> View larger version (22K): org.highwire.dtl.DTLVardef@1190c18org.highwire.dtl.DTLVardef@169098eorg.highwire.dtl.DTLVardef@f24e75org.highwire.dtl.DTLVardef@1fd0bd1_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.1%
23.1%
2
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.4%
14.7%
3
Biophysical Journal
545 papers in training set
Top 0.7%
7.0%
4
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1.0%
4.4%
5
Frontiers in Molecular Biosciences
100 papers in training set
Top 0.2%
4.3%
50% of probability mass above
6
PLOS Computational Biology
1633 papers in training set
Top 9%
3.8%
7
The Journal of Chemical Physics
49 papers in training set
Top 0.1%
3.7%
8
Journal of Computational Chemistry
11 papers in training set
Top 0.1%
3.7%
9
Bioinformatics
1061 papers in training set
Top 6%
2.7%
10
The Journal of Physical Chemistry B
158 papers in training set
Top 0.8%
2.4%
11
The Journal of Physical Chemistry Letters
58 papers in training set
Top 0.6%
2.1%
12
Nucleic Acids Research
1128 papers in training set
Top 9%
1.9%
13
eLife
5422 papers in training set
Top 39%
1.8%
14
PLOS ONE
4510 papers in training set
Top 53%
1.7%
15
Protein Science
221 papers in training set
Top 1.0%
1.5%
16
Biophysical Reports
36 papers in training set
Top 0.3%
1.4%
17
Physical Chemistry Chemical Physics
34 papers in training set
Top 0.4%
1.4%
18
Journal of Molecular Biology
217 papers in training set
Top 2%
1.3%
19
ACS Omega
90 papers in training set
Top 3%
0.9%
20
Physical Biology
43 papers in training set
Top 2%
0.9%
21
The European Physical Journal E
15 papers in training set
Top 0.1%
0.8%
22
Nature Communications
4913 papers in training set
Top 62%
0.8%
23
Scientific Reports
3102 papers in training set
Top 74%
0.8%
24
Bioinformatics Advances
184 papers in training set
Top 5%
0.8%
25
Entropy
20 papers in training set
Top 0.5%
0.7%
26
iScience
1063 papers in training set
Top 36%
0.7%
27
Chemical Science
71 papers in training set
Top 3%
0.5%