Back

Bayesian-Steered Structure Prediction of Mechanical Biomolecules Using Twisted Diffusion

Klaus, C.; Sotomayor, M.

2026-05-13 bioinformatics
10.64898/2026.05.11.724187 bioRxiv
Show abstract

Deep learning approaches have revolutionized protein structure prediction. These tools are trained using experimental data and recapitulate reported conformations, but there is great interest in predicting conformations that may be functionally relevant although experimentally underrepresented. Since many modern structure prediction tools use generative artificial intelligence diffusion models, we reframe the search for alternative molecular conformations as that of sampling from a diffusion distribution conditioned using any arbitrary Bayesian likelihood. We implement a twisted diffusion sampler in Boltz-2 to sample this conditioned distribution and demonstrate the utility of this approach, which does not require any additional training of the neural network, by implementing a diffusion analog of steered molecular dynamics simulations applied to mechanical systems. We can reproduce predicted stretched states of fragments of DNA, the muscle protein titin, and the inner-ear protocadherin-15 protein, as well as open states of the MscL ion channel consistent with experimental results. We expect that steered structure predictions will help sample underrepresented and non-equilibrium conformations for many macromolecular systems.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.5%
12.5%
2
PLOS Computational Biology
1633 papers in training set
Top 3%
10.0%
3
Biophysical Journal
545 papers in training set
Top 1.0%
6.2%
4
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.2%
6.2%
5
eLife
5422 papers in training set
Top 20%
4.2%
6
Bioinformatics
1061 papers in training set
Top 5%
4.2%
7
Nature Machine Intelligence
61 papers in training set
Top 1%
3.5%
8
Nature Computational Science
50 papers in training set
Top 0.2%
3.0%
9
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 24%
2.8%
50% of probability mass above
10
Cell Systems
167 papers in training set
Top 5%
2.7%
11
Scientific Reports
3102 papers in training set
Top 45%
2.6%
12
Nature Communications
4913 papers in training set
Top 45%
2.6%
13
Nano Letters
63 papers in training set
Top 1%
2.6%
14
Communications Biology
886 papers in training set
Top 7%
1.9%
15
PLOS ONE
4510 papers in training set
Top 50%
1.9%
16
Journal of Computational Chemistry
11 papers in training set
Top 0.1%
1.7%
17
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.5%
18
The Journal of Physical Chemistry B
158 papers in training set
Top 1%
1.5%
19
Nature Methods
336 papers in training set
Top 5%
1.2%
20
Nature Biotechnology
147 papers in training set
Top 6%
1.2%
21
Nucleic Acids Research
1128 papers in training set
Top 14%
1.2%
22
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.7%
1.1%
23
Molecular Biology and Evolution
488 papers in training set
Top 3%
1.1%
24
Genetics
225 papers in training set
Top 3%
0.9%
25
Patterns
70 papers in training set
Top 2%
0.9%
26
Quantitative Biology
11 papers in training set
Top 0.5%
0.9%
27
Science
429 papers in training set
Top 19%
0.9%
28
Journal of Cheminformatics
25 papers in training set
Top 0.5%
0.8%
29
iScience
1063 papers in training set
Top 30%
0.8%
30
PRX Life
34 papers in training set
Top 0.8%
0.8%