Back

ProteinFlux: accurate, rapid and scalable generative prediction of protein dynamics driven by post-translational modifications

Qian, Q.; Peng, J.; Ma, D.; Liu, K.; Cheng, Y.; Deng, Y.; Zhao, J.; Su, S.; Yao, Y.; Qu, Y.; Fu, R.; Liu, J.; Zhao, M.; Xiao, Y.; Wang, K.; Wu, Y.; Wang, Y.; Xu, Q.; Wang, J.; Hay, D. C.; Ke, Y.; Wang, Y.; Shipston, M. J.; Chi, Y.

2026-05-11 bioinformatics
10.64898/2026.05.06.721822 bioRxiv
Show abstract

The function of proteins, the building blocks of life, in health and disease depends not only on their 3D-conformational states but most importantly on the dynamic transition between states controlled by a wide array of post-translational modifications (PTMs). Recent major advances have been made in our ability to predict static 3D structures; however, understanding and predicting the impact of PTMs on protein conformational dynamics remains a major question and challenge in the field. Molecular dynamics (MD) simulation remains the major computational approach for studying protein dynamics. However, the high computational cost, lack of integration of PTMs as conditioning inputs and inefficient generation of continuous protein dynamics largely precludes PTM-regulated conformational dynamics and the study of slow conformational processes. To address this critical bottleneck, we developed ProteinFlux, a flow-matching generative framework that links PTM-conditioned conformational dynamics to evolutionary constraints encoded by PTM sites. Evolutionary information plays a critical role in capturing conformational dynamics beyond sequence identity, and PTM sites inherently encode evolutionary constraints critical to protein functional regulation. We therefore built FluxSite, a dual-modal PTM site predictor that integrates sequence evolutionary information and 3D structural features to generate a continuous conditional signal encoding conservation and functional importance for each predicted site. FluxSite achieves robust generalization across 18 PTM types and 30 disease-associated proteomes. ProteinFlux generates phosphorylation-conditioned, all-atom conformational trajectories across diverse protein fold classes, faithfully reproducing both thermodynamic properties such as free energy landscapes and kinetic features such as conformational transition pathways. It outperforms state-of-the-art predictors while achieving inference speeds several orders of magnitude faster than traditional MD. In addition, we introduce DynaMo-phos, a benchmark dataset of phosphorylated protein MD simulations. Together, ProteinFlux, FluxSite and DynaMo-phos provide a scalable, high-throughput platform for elucidating PTM-driven conformational mechanisms, with potential applications across allosteric drug design, functional annotation of disease-associated modifications and mechanism-guided therapeutic development.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
14.1%
2
Cell Systems
167 papers in training set
Top 0.7%
14.1%
3
Nature Communications
4913 papers in training set
Top 19%
9.9%
4
Nature Methods
336 papers in training set
Top 2%
7.1%
5
Nature Machine Intelligence
61 papers in training set
Top 0.4%
6.7%
50% of probability mass above
6
Advanced Science
249 papers in training set
Top 4%
4.2%
7
Nature Biotechnology
147 papers in training set
Top 3%
3.5%
8
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 23%
3.2%
9
Briefings in Bioinformatics
326 papers in training set
Top 2%
2.8%
10
PLOS Computational Biology
1633 papers in training set
Top 15%
1.9%
11
Science
429 papers in training set
Top 13%
1.9%
12
Nucleic Acids Research
1128 papers in training set
Top 10%
1.9%
13
Communications Chemistry
39 papers in training set
Top 0.3%
1.7%
14
Nature Computational Science
50 papers in training set
Top 0.6%
1.7%
15
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
1.2%
16
Genome Biology
555 papers in training set
Top 6%
1.2%
17
Bioinformatics Advances
184 papers in training set
Top 4%
1.1%
18
Structure
175 papers in training set
Top 3%
0.9%
19
Patterns
70 papers in training set
Top 2%
0.9%
20
Computational and Structural Biotechnology Journal
216 papers in training set
Top 8%
0.9%
21
Protein Science
221 papers in training set
Top 1%
0.9%
22
Science Advances
1098 papers in training set
Top 29%
0.8%
23
Scientific Reports
3102 papers in training set
Top 75%
0.7%
24
Genome Medicine
154 papers in training set
Top 8%
0.7%
25
Cell Reports Methods
141 papers in training set
Top 5%
0.7%
26
Communications Biology
886 papers in training set
Top 25%
0.7%
27
The American Journal of Human Genetics
206 papers in training set
Top 4%
0.7%
28
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
29
Nature
575 papers in training set
Top 17%
0.6%
30
eLife
5422 papers in training set
Top 62%
0.6%