Back

AI-Driven Reconstruction of the Research Paradigm for Phase Separation in Membraneless Organelle

ding, y.; lu, t.; Li, y.

2026-04-02 cell biology
10.64898/2026.03.31.715491 bioRxiv
Show abstract

Liquid-liquid phase separation (LLPS) of biomacromolecules is a key mechanism driving the formation of membraneless organelles (MLOs) within cells, playing a crucial role in fundamental biological processes such as cell proliferation and stress response. Accurately understanding and predicting the phase separation propensity of proteins is essential for unraveling the assembly mechanisms of MLOs and their functions under both physiological and pathological conditions. Traditional research methods primarily rely on biochemical experiments, which are limited by low throughput, high cost, and difficulty in systematically exploring sequence-phase transition relationships. This study proposes and implements a novel three-stage, iterative paradigm based on artificial intelligence (AI) to propel phase separation research towards systematization, predictability, and mechanistic understanding. O_LIBenchmark Model Construction: A preliminary predictive model was established based on a Multilayer Perceptron (MLP) neural network, and the driving effect of phenylalanine/tyrosine (F/Y) residue-mediated {pi}-{pi} interactions on LLPS was validated. C_LIO_LIModel Robustness Enhancement: The model was optimized through adversarial training strategies, which effectively identified and eliminated misclassifications of "highly disordered non-phase-separating" trap sequences. This significantly improved the models generalization capability and reliability when handling complex, real-world sequences. C_LIO_LIPhysical Mechanism Integration and Functional Expansion: Incorporating the Uniform Manifold Approximation and Projection (UMAP) manifold learning method and constraints from non-equilibrium thermodynamics, a "fingerprint space" capable of characterizing the thermodynamic behavior of phase separation was constructed. This space enables cluster analysis of different MLO types, and the model can output a thermodynamic stability score for protein phase separation. Based on this score, we identified 10 high-confidence candidate proteins with the potential to form novel MLOs. The paradigm established in this study upgrades phase separation prediction from the traditional "binary classification" approach to a novel research framework characterized by "physical mechanism analysis + novel MLO discovery." It provides the phase separation field with a computational tool that combines high accuracy, strong robustness, and good physical interpretability. C_LI

Matching journals

The top 14 journals account for 50% of the predicted probability mass.

1
Advanced Science
249 papers in training set
Top 0.5%
15.0%
2
eLife
5422 papers in training set
Top 16%
4.9%
3
Computers in Biology and Medicine
120 papers in training set
Top 0.8%
3.6%
4
Communications Biology
886 papers in training set
Top 3%
3.1%
5
International Journal of Biological Macromolecules
65 papers in training set
Top 0.8%
2.9%
6
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
2.9%
7
Patterns
70 papers in training set
Top 0.3%
2.8%
8
Nature Communications
4913 papers in training set
Top 43%
2.8%
9
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 2%
2.6%
10
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 3%
2.5%
11
International Journal of Molecular Sciences
453 papers in training set
Top 4%
2.4%
12
Communications Chemistry
39 papers in training set
Top 0.1%
2.1%
13
Scientific Reports
3102 papers in training set
Top 49%
2.1%
14
iScience
1063 papers in training set
Top 10%
2.1%
50% of probability mass above
15
Briefings in Bioinformatics
326 papers in training set
Top 3%
1.9%
16
Nature Machine Intelligence
61 papers in training set
Top 2%
1.9%
17
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
1.9%
18
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
19
Communications Physics
12 papers in training set
Top 0.1%
1.7%
20
PLOS ONE
4510 papers in training set
Top 53%
1.7%
21
Angewandte Chemie International Edition
81 papers in training set
Top 2%
1.5%
22
Heliyon
146 papers in training set
Top 3%
1.4%
23
Chemical Engineering Journal
10 papers in training set
Top 0.3%
1.4%
24
npj Systems Biology and Applications
99 papers in training set
Top 1%
1.4%
25
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
1.2%
26
Journal of Proteome Research
215 papers in training set
Top 2%
0.8%
27
Frontiers in Plant Science
240 papers in training set
Top 5%
0.8%
28
Cell Discovery
54 papers in training set
Top 5%
0.8%
29
Protein Science
221 papers in training set
Top 2%
0.8%
30
Computational Biology and Chemistry
23 papers in training set
Top 0.5%
0.8%