Back

How Generative Models Approach Molecular Conformational Sampling

B E, N.; Mondal, J.

2026-04-14 biophysics
10.64898/2026.04.10.717851 bioRxiv
Show abstract

Characterising equilibrium conformational ensembles with deep generative models requires assessing not only whether a model reproduces the target distribution, but also the mechanism of how it arrives here. Here we examine two distinct routes to generative conformational sampling-- stochastic relaxation and deterministic transport--through a study of denoising diffusion probabilistic models (DDPM) and rectified-flow (RF) models across molecular systems of increasing complexity. Using systems of increasing complexity, including a multimodal two-dimensional potential, the folded mini-protein Trp-cage, and a high-dimensional dihedral subspace of the intrinsically disordered protein -synuclein, we show that the key distinction between these paradigms lies not only in endpoint fidelity but in how distributional error is resolved during sampling. Diffusion models converge via pronounced late-stage stochastic relaxation and exhibits robust recovery of configurational breadth across neural architectures. Rectified flow approaches the target more gradually through deterministic transport and therefore depends much more strongly on architectural expressivity, particularly in heterogeneous high-dimensional landscapes. Analyses of entropy and moment evolution further show that diffusion more reliably restores both ensemble location and fluctuation structure, whereas RF requires Transformer-level feature mixing to represent the transport geometry accurately. These results establish convergence mechanism as a key design principle for generative sampling.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 2%
14.4%
2
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 3%
14.1%
3
Physical Review X
23 papers in training set
Top 0.1%
12.3%
4
Journal of The Royal Society Interface
189 papers in training set
Top 0.5%
6.7%
5
eLife
5422 papers in training set
Top 14%
6.3%
50% of probability mass above
6
Nature Communications
4913 papers in training set
Top 31%
6.2%
7
Biophysical Journal
545 papers in training set
Top 1%
4.2%
8
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.3%
3.5%
9
Journal of Chemical Information and Modeling
207 papers in training set
Top 1%
2.7%
10
Cell Systems
167 papers in training set
Top 6%
1.9%
11
Scientific Reports
3102 papers in training set
Top 59%
1.7%
12
Physical Review Research
46 papers in training set
Top 0.4%
1.7%
13
Chemical Science
71 papers in training set
Top 1%
1.7%
14
PRX Life
34 papers in training set
Top 0.5%
1.5%
15
Nature Computational Science
50 papers in training set
Top 0.9%
1.3%
16
The Journal of Physical Chemistry B
158 papers in training set
Top 1%
1.3%
17
Physical Biology
43 papers in training set
Top 2%
0.9%
18
The Journal of Physical Chemistry Letters
58 papers in training set
Top 1%
0.9%
19
Communications Physics
12 papers in training set
Top 0.4%
0.9%
20
Cell Reports
1338 papers in training set
Top 32%
0.8%
21
PLOS ONE
4510 papers in training set
Top 68%
0.7%
22
ACS Nano
99 papers in training set
Top 5%
0.6%
23
The Journal of Chemical Physics
49 papers in training set
Top 0.5%
0.6%