Back

Several multiple sequence alignment perturbation methods enhance AlphaFold3 sampling of alternative protein states

Eriksson Lidbrink, S.; Nissen, I.; Ahrlind, J. K.; Howard, R. J.; Lindahl, E.

2026-04-03 biophysics
10.64898/2026.04.02.716037 bioRxiv
Show abstract

Protein function often involves multiple conformational states. Several multiple sequence alignment-perturbing strategies, including stochastic subsampling, clustering, and column masking, have been shown to enhance AlphaFold2 (AF2) sampling of alternative protein states. Here, we evaluate these strategies on AlphaFold3 (AF3) and compare their performance with the BioEmu Boltzmann sampling model on 107 proteins with multiple experimentally solved conformational states. We find that unperturbed AF3 samples alternative states with significantly higher TM-scores compared to AF2 and comparable to BioEmu. In particular, all MSA perturbation methods improve AF3 sampling at a statistically significant level, improving the top 1% TM-score by at least 0.05 in approximately 20% of cases each, while rarely worsening the performance. Furthermore, we find that different choices of amino acid masks can improve column-masked AF3 sampling for specific targets. Our results highlight how MSA perturbations remain relevant in AF3, providing a useful tool for understanding dynamic biological processes.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Biophysical Journal
545 papers in training set
Top 0.2%
18.0%
2
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.1%
13.9%
3
PLOS Computational Biology
1633 papers in training set
Top 4%
8.1%
4
The Journal of Physical Chemistry B
158 papers in training set
Top 0.3%
6.2%
5
Journal of Chemical Information and Modeling
207 papers in training set
Top 1%
4.7%
50% of probability mass above
6
Nature Communications
4913 papers in training set
Top 38%
3.8%
7
Structure
175 papers in training set
Top 0.9%
3.5%
8
Frontiers in Molecular Biosciences
100 papers in training set
Top 0.5%
3.5%
9
Protein Science
221 papers in training set
Top 0.5%
3.0%
10
Scientific Reports
3102 papers in training set
Top 48%
2.3%
11
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 29%
2.0%
12
Nature Methods
336 papers in training set
Top 4%
1.8%
13
Journal of Molecular Biology
217 papers in training set
Top 1%
1.8%
14
eLife
5422 papers in training set
Top 39%
1.8%
15
The Journal of Physical Chemistry Letters
58 papers in training set
Top 0.8%
1.7%
16
Physical Biology
43 papers in training set
Top 1%
1.6%
17
Bioinformatics Advances
184 papers in training set
Top 3%
1.3%
18
Cell Reports Methods
141 papers in training set
Top 3%
1.3%
19
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.6%
1.3%
20
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.2%
21
Nature Computational Science
50 papers in training set
Top 1%
1.2%
22
Biophysical Reports
36 papers in training set
Top 0.4%
0.9%
23
Cell Systems
167 papers in training set
Top 11%
0.9%
24
PLOS ONE
4510 papers in training set
Top 67%
0.8%
25
Bioinformatics
1061 papers in training set
Top 10%
0.7%
26
Nucleic Acids Research
1128 papers in training set
Top 18%
0.7%
27
Chemical Science
71 papers in training set
Top 2%
0.7%
28
Communications Biology
886 papers in training set
Top 27%
0.7%