Back

ABAG-Rank: Improving Model Selection of AlphaFold Antibody-Antigen Complexes by Learning to Rank

Tadiello, M.; Ludaic, M.; Viliuga, V.; Elofsson, A.

2026-03-19 bioinformatics
10.64898/2026.03.17.712376 bioRxiv
Show abstract

MotivationAlphaFold has transformed structural biology with an unprecedented accuracy in modeling protein structures and their interactions with biomolecules, with AlphaFold3 (AF3) achieving state-of-the-art performance. However, AF3 and other methods often struggle to accurately predict the structure of protein complexes that lack strong co-evolutionary information, such as antibody-antigen (Ab-Ag) complexes. One of the fundamental issues is that AF3 often generates accurate predictions, but fails to reliably distinguish them from the much larger set of incorrect ones. ResultsTo address this, we propose ABAG-Rank, a deep neural network that provides an efficient and robust solution for model selection of Ab-Ag interactions from a pool of structural ensembles predicted with AlphaFold. Built on the permutation-invariant DeepSets architecture, ABAG-Rank can process variable-sized ensembles of structural decoys and is directly applicable to prediction settings in which the number of candidates may vary. We train a model on a redundancy-reduced set of all known antibody-antigen complexes and find that simple geometric descriptors, along with confidence scores from AlphaFold, provide rich information about interface quality without requiring intensive physics-based calculations. Our experiments demonstrate that ABAG-Rank significantly outperforms AF3 internal scoring and the ranking performance of existing deep learning baselines. ImplementationSource code can be found at: https://github.com/tadteo/ABAG-Rank

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 0.7%
28.4%
2
Nature Methods
336 papers in training set
Top 0.8%
10.7%
3
Cell Systems
167 papers in training set
Top 2%
6.5%
4
Nature Communications
4913 papers in training set
Top 32%
5.0%
50% of probability mass above
5
PLOS Computational Biology
1633 papers in training set
Top 8%
4.1%
6
Bioinformatics Advances
184 papers in training set
Top 1%
3.7%
7
Protein Science
221 papers in training set
Top 0.3%
3.7%
8
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.7%
9
Nature Machine Intelligence
61 papers in training set
Top 1%
2.8%
10
Journal of Cheminformatics
25 papers in training set
Top 0.2%
2.7%
11
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
2.4%
12
Nature Biotechnology
147 papers in training set
Top 4%
1.8%
13
Structure
175 papers in training set
Top 2%
1.7%
14
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 36%
1.4%
15
Patterns
70 papers in training set
Top 1%
1.3%
16
Cell Reports Methods
141 papers in training set
Top 4%
1.0%
17
Nucleic Acids Research
1128 papers in training set
Top 14%
1.0%
18
BMC Bioinformatics
383 papers in training set
Top 6%
0.9%
19
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.8%
0.9%
20
Nature Computational Science
50 papers in training set
Top 1%
0.8%
21
Scientific Reports
3102 papers in training set
Top 72%
0.8%
22
Science
429 papers in training set
Top 19%
0.8%
23
Communications Biology
886 papers in training set
Top 20%
0.8%
24
Computational and Structural Biotechnology Journal
216 papers in training set
Top 9%
0.7%
25
Journal of Structural Biology
58 papers in training set
Top 2%
0.7%
26
PLOS ONE
4510 papers in training set
Top 71%
0.7%
27
Chemical Science
71 papers in training set
Top 3%
0.5%
28
Protein Engineering, Design and Selection
14 papers in training set
Top 0.1%
0.5%
29
Journal of Molecular Biology
217 papers in training set
Top 5%
0.5%