Back

Divide and Cluster: The DIVINE Framework for Deterministic Top-Down Analysis of Molecular Dynamics Trajectories

Brylle Woody Santos, J.; Chen, L.; Miranda Quintana, R. A.

2026-03-07 biophysics
10.1101/2025.06.20.660828 bioRxiv
Show abstract

We present DIVIsive N-ary Ensembles (DIVINE), a deterministic, top-down clustering framework designed for molecular dynamics (MD) trajectories. DIVINE constructs a complete clustering hierarchy by recursively splitting clusters based on n-ary similarity principles, avoiding the need for O(N2) pairwise distance matrices. It supports multiple cluster selection criteria, including a weighted variance metric, and deterministic anchor initialization strategies such as NANI (N-ary Natural Initiation), ensuring reproducible and structurally meaningful partitions. Testing DIVINE up to a 305 s folding trajectory of the villin headpiece (HP35) revealed that it matched or exceeded the clustering quality of bisecting k-means while reducing runtime and eliminating stochastic variability. Its single-pass design enables efficient exploration of clustering resolutions without repeated executions. By combining scalability, interpretability, and determinism, DIVINE offers a robust and practical alternative to conventional MD clustering methods. DIVINE is publicly available as part of the MDANCE package: https://github.com/mqcomplab/MDANCE.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Nature Methods
336 papers in training set
Top 0.3%
18.5%
2
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.4%
12.6%
3
Nature Communications
4913 papers in training set
Top 26%
6.8%
4
Nature Computational Science
50 papers in training set
Top 0.1%
4.8%
5
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 16%
4.3%
6
PLOS ONE
4510 papers in training set
Top 34%
4.3%
50% of probability mass above
7
Bioinformatics
1061 papers in training set
Top 5%
3.9%
8
PLOS Computational Biology
1633 papers in training set
Top 10%
3.6%
9
Nucleic Acids Research
1128 papers in training set
Top 6%
3.6%
10
Scientific Reports
3102 papers in training set
Top 47%
2.4%
11
Bioinformatics Advances
184 papers in training set
Top 2%
1.9%
12
Nature Biotechnology
147 papers in training set
Top 4%
1.9%
13
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.5%
1.8%
14
eLife
5422 papers in training set
Top 42%
1.7%
15
Journal of Computational Chemistry
11 papers in training set
Top 0.1%
1.7%
16
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.7%
17
Biophysical Journal
545 papers in training set
Top 3%
1.5%
18
IUCrJ
29 papers in training set
Top 0.2%
1.2%
19
Structure
175 papers in training set
Top 2%
1.1%
20
Cell Systems
167 papers in training set
Top 10%
0.9%
21
The Journal of Physical Chemistry B
158 papers in training set
Top 2%
0.7%
22
Communications Biology
886 papers in training set
Top 24%
0.7%
23
iScience
1063 papers in training set
Top 32%
0.7%
24
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
25
Genome Biology
555 papers in training set
Top 7%
0.7%
26
Journal of Molecular Biology
217 papers in training set
Top 4%
0.7%
27
Acta Crystallographica Section D Structural Biology
54 papers in training set
Top 0.4%
0.7%
28
Science
429 papers in training set
Top 21%
0.6%
29
The Journal of Chemical Physics
49 papers in training set
Top 0.5%
0.6%
30
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 1%
0.6%