Back

The Untangle Challenge for accurate ensemble models

Hopkins, M. S.; Terwilliger, T. C.; Afonine, P.; Ginn, H. M.; HOLTON, J. M.

2026-02-22 biophysics
10.64898/2026.02.21.706873 bioRxiv
Show abstract

We report the discovery of a new class of local minima that has severely limited the accuracy of macromolecular models. Termed density misfit barrier traps, these minima explain much of the poor fit between macromolecular models and experimental data relative to that of smaller molecules: not just high R factors, but distorted chemical geometry. We postulated that proteins exist as an ensemble of conformations that each have good geometry, but refinement algorithms have been unable to converge to them due to a tangling phenomenon arising from these traps. To demonstrate, a synthetic ground truth data set was generated, consisting of a 2-member ensemble with excellent geometry. A series of starting models, each trapped in increasingly difficult local minima, were prepared, a unified validation score defined, and an open Challenge issued. This Challenge inspired algorithms for escaping such traps, and new programs have been released that are expected to substantially improve the accuracy of macromolecular ensemble models. SynopsisA synthetic 2-member conformational ensemble of a small protein and corresponding electron density data was generated to demonstrate how topological local minima hinder simultaneous agreement with density data and chemical geometry restraints in conventional structure refinement.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Structure
175 papers in training set
Top 0.1%
22.4%
2
Acta Crystallographica Section D Structural Biology
54 papers in training set
Top 0.1%
22.4%
3
IUCrJ
29 papers in training set
Top 0.1%
10.0%
50% of probability mass above
4
Protein Science
221 papers in training set
Top 0.2%
6.3%
5
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.1%
6.3%
6
Nature Communications
4913 papers in training set
Top 33%
4.8%
7
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 16%
4.3%
8
PLOS ONE
4510 papers in training set
Top 54%
1.7%
9
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
1.7%
10
eLife
5422 papers in training set
Top 45%
1.5%
11
Journal of Structural Biology
58 papers in training set
Top 0.9%
1.5%
12
Science
429 papers in training set
Top 16%
1.3%
13
Journal of Molecular Biology
217 papers in training set
Top 2%
1.2%
14
Nature
575 papers in training set
Top 13%
1.2%
15
The Journal of Physical Chemistry B
158 papers in training set
Top 2%
1.1%
16
Nature Structural & Molecular Biology
218 papers in training set
Top 4%
0.9%
17
Journal of Structural Biology: X
15 papers in training set
Top 0.2%
0.9%
18
Journal of Applied Crystallography
14 papers in training set
Top 0.1%
0.9%
19
PLOS Computational Biology
1633 papers in training set
Top 25%
0.7%
20
The Journal of Physical Chemistry Letters
58 papers in training set
Top 2%
0.7%
21
Frontiers in Molecular Biosciences
100 papers in training set
Top 5%
0.7%