Back

Has AlphaFold 3 Solved the Protein Folding Problem for D-Peptides?

Childs, H.; Zhou, P.; Donald, B. R.

2025-03-17 bioinformatics
10.1101/2025.03.14.643307 bioRxiv
Show abstract

Due to the favorable chemical properties of mirrored chiral centers (such as improved stability, bioavailability, and membrane permeability) the computational design of D-peptides targeting biological L-proteins is a valuable area of research. To design these structures in silico, a computational workflow should correctly dock and fold a peptide while maintaining chiral centers. The latest AlphaFold 3 (AF3) from Abramson et al. (2024) enforces a strict chiral violation penalty to maintain chiral centers from model inputs and is reported to have a low chiral violation rate of only 4.4% on a PoseBusters benchmark containing diverse chiral molecules. Herein, we report the results of 3,255 experiments with AF3 to evaluate its ability to predict the fold, chirality, and binding pose of D-peptides in heterochiral complexes. Despite our inputs specifying explicit D-stereocenters, we report that the AF3 chiral violation for D-peptide binders is much higher at 51% across all evaluated predictions; on average the model is as accurate as chance (random chirality choice, L or D, for each peptide residue). Increasing the number of seeds failed to improve this violation rate. The AF3 predictions exhibit incorrect folds and binding poses, with D-peptides commonly oriented incorrectly in the L-protein binding pocket. Confidence metrics returned by AF3 also fail to distinguish predictions with low chirality violation and correct docking vs. predictions with high chirality violation and incorrect docking. We conclude that AF3 is a poor predictor of D-peptide chirality, fold, and binding pose. Finally, we propose solutions to improve this model. Significance StatementAlphaFold 3 (AF3) is a model trained to predict protein interactions. This algorithm is tuned to respect chiral centers (L and D). Changing the chirality of even one protein residue can significantly alter chemical properties such as binding and stability. Therefore, an algorithm should exhibit a chiral center error rate of 0%. Although the original AF3 authors reported a 4.4% chirality violation, we have found that the rate for D-peptides is much higher at [~]50%. Our data highlights a crucial structural prediction error in AF3 and demonstrates that this widely used model is as accurate on average as chance (random chirality choice, L or D, for each peptide residue). These results indicate structure prediction of D-peptides is an outstanding problem.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Bioinformatics Advances
184 papers in training set
Top 0.1%
22.3%
2
Bioinformatics
1061 papers in training set
Top 2%
18.4%
3
Protein Science
221 papers in training set
Top 0.1%
10.0%
50% of probability mass above
4
Journal of Chemical Information and Modeling
207 papers in training set
Top 1%
4.8%
5
Journal of Cheminformatics
25 papers in training set
Top 0.1%
4.8%
6
Structure
175 papers in training set
Top 0.8%
3.6%
7
PLOS Computational Biology
1633 papers in training set
Top 11%
3.0%
8
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.3%
9
BMC Bioinformatics
383 papers in training set
Top 4%
2.3%
10
Frontiers in Bioinformatics
45 papers in training set
Top 0.1%
2.1%
11
Scientific Reports
3102 papers in training set
Top 51%
2.1%
12
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.8%
13
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.5%
1.7%
14
PeerJ
261 papers in training set
Top 8%
1.5%
15
PLOS ONE
4510 papers in training set
Top 59%
1.3%
16
Biophysical Journal
545 papers in training set
Top 3%
1.3%
17
Computational Biology and Chemistry
23 papers in training set
Top 0.3%
1.2%
18
ImmunoInformatics
11 papers in training set
Top 0.2%
0.9%
19
International Journal of Molecular Sciences
453 papers in training set
Top 13%
0.9%
20
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
21
Biomolecules
95 papers in training set
Top 3%
0.7%
22
Protein Engineering, Design and Selection
14 papers in training set
Top 0.1%
0.6%