Back

DynMoCo: a Novel AI Framework to Reveal Modular Substructures of Protein From Molecular Dynamics

Mao, L.; Kwak, M.; Ashkezari, A. H. K.; Li, Z.; Chen, Y.; Cong, P.; Phee, J. H.; Kang, S.; Li, J.; Zhu, C.

2026-02-10 biophysics
10.64898/2026.02.08.704355 bioRxiv
Show abstract

Proteins are dynamic molecular machines whose functions are determined by their structures. While static structures can offer initial insights or hypotheses about protein function, they are often insufficient for a detailed mechanistic understanding. Molecular dynamics (MD) simulations provide atomistic view of proteins dynamic motion and conformational change, but the resulting high-dimensional data are challenging to interpret. Traditional summary statistics and dimensionality-reduction methods often focus on global motions and can overlook regional, yet functionally critical motions. Inspired by approaches from social network science, we introduce a novel perspective for analyzing MD simulations through dynamic community detection, where molecules are modeled as time-evolving graphs, and communities of residues or atoms that move coherently or exhibit functional coupling are identified. We present DynMoCo, a novel deep learning framework that integrates graph convolutional networks with recurrent models for end-to-end dynamic community detection on molecular graphs. Given a MD trajectory, DynMoCo identifies spatially grounded substructures, tracks their evolution over time, and can incorporate structural knowledge to ensure physically meaningful communities. We provide a library of custom-written scripts to allow users to extract and visualize these communites on the MD simulated molecules in motion. We demonstrate the method on force-ramp and force-clamp steered MD simulations of three integrin systems, revealing modular substructures within known domains and characterizing their conformational rearrangements during force-induced unbending. By reducing high-dimensional MD data into interpretable communities, this approach offers new insights into the intrinsic organization and dynamic function of complex biomolecular systems. SIGNIFICANCEProteins often perform their functions through dynamic, locally coordinated motions. Molecular dynamics simulations provide detailed views of these motions but produce high-dimensional data that are challenging to analyze and interpret. We present a novel deep learning model that analyzes molecular dynamics simulations data and identifies structurally coherent and potentially functionally related communities, while tracking their temporal evolution. This analysis tool provides a novel way to analyze MD data transforming it into interpretable representations of modular dynamic, enabling discovery of new mechanistic insights and advancing our understanding of how molecular motions drive biological function.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Methods
336 papers in training set
Top 0.8%
12.2%
2
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 6%
9.8%
3
PLOS Computational Biology
1633 papers in training set
Top 3%
9.8%
4
Nature Computational Science
50 papers in training set
Top 0.1%
9.8%
5
Cell Systems
167 papers in training set
Top 3%
4.2%
6
Nature Communications
4913 papers in training set
Top 36%
4.1%
7
Patterns
70 papers in training set
Top 0.2%
3.9%
50% of probability mass above
8
eLife
5422 papers in training set
Top 27%
3.5%
9
Nature Biotechnology
147 papers in training set
Top 3%
3.5%
10
Bioinformatics
1061 papers in training set
Top 6%
2.8%
11
Science
429 papers in training set
Top 12%
2.0%
12
Nucleic Acids Research
1128 papers in training set
Top 9%
2.0%
13
Scientific Reports
3102 papers in training set
Top 54%
1.8%
14
PLOS ONE
4510 papers in training set
Top 51%
1.8%
15
Genome Biology
555 papers in training set
Top 5%
1.7%
16
Nature
575 papers in training set
Top 11%
1.6%
17
PRX Life
34 papers in training set
Top 0.4%
1.6%
18
iScience
1063 papers in training set
Top 18%
1.4%
19
Bioinformatics Advances
184 papers in training set
Top 3%
1.3%
20
Cell Reports Methods
141 papers in training set
Top 3%
1.2%
21
Structure
175 papers in training set
Top 2%
1.1%
22
Journal of Chemical Information and Modeling
207 papers in training set
Top 3%
0.9%
23
Communications Biology
886 papers in training set
Top 20%
0.9%
24
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.9%
25
Biophysical Journal
545 papers in training set
Top 5%
0.8%
26
Protein Science
221 papers in training set
Top 2%
0.8%
27
Genome Research
409 papers in training set
Top 4%
0.7%
28
Frontiers in Molecular Biosciences
100 papers in training set
Top 6%
0.7%
29
Cell Reports
1338 papers in training set
Top 34%
0.7%
30
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%