Back

PAbFold: Linear Antibody Epitope Prediction using AlphaFold2

DeRoo, J.; Terry, J. S.; Zhao, N.; Stasevich, T. J.; Snow, C.; Geiss, B. J.

2024-12-20 molecular biology
10.1101/2024.04.19.590298 bioRxiv
Show abstract

Defining the binding epitopes of antibodies is essential for understanding how they bind to their antigens and perform their molecular functions. However, while determining linear epitopes of monoclonal antibodies can be accomplished utilizing well-established empirical procedures, these approaches are generally labor- and time-intensive and costly. To take advantage of the recent advances in protein structure prediction algorithms available to the scientific community, we developed a calculation pipeline based on the localColabFold implementation of AlphaFold2 that can predict linear antibody epitopes by predicting the structure of the complex between antibody heavy and light chains and target peptide sequences derived from antigens. We found that this AlphaFold2 pipeline, which we call PAbFold, was able to accurately flag known epitope sequences for several well-known antibody targets (HA / Myc) when the target sequence was broken into small overlapping linear peptides and antibody complementarity determining regions (CDRs) were grafted onto several different antibody framework regions in the single-chain antibody fragment (scFv) format. To determine if this pipeline was able to identify the epitope of a novel antibody with no structural information publicly available, we determined the epitope of a novel anti-SARS-CoV-2 nucleocapsid targeted antibody using our method and then experimentally validated our computational results using peptide competition ELISA assays. These results indicate that the AlphaFold2-based PAbFold pipeline we developed is capable of accurately identifying linear antibody epitopes in a short time using just antibody and target protein sequences. This emergent capability of the method is sensitive to methodological details such as peptide length, AlphaFold2 neural network versions, and multiple-sequence alignment database. PAbFold is available at https://github.com/jbderoo/PAbFold.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Antibody Therapeutics
16 papers in training set
Top 0.1%
22.8%
2
Bioinformatics
1061 papers in training set
Top 3%
10.2%
3
Protein Science
221 papers in training set
Top 0.1%
6.9%
4
mAbs
28 papers in training set
Top 0.1%
6.9%
5
PLOS ONE
4510 papers in training set
Top 31%
4.9%
50% of probability mass above
6
PLOS Computational Biology
1633 papers in training set
Top 9%
3.6%
7
Nature Machine Intelligence
61 papers in training set
Top 0.9%
3.6%
8
Journal of Molecular Biology
217 papers in training set
Top 0.9%
2.6%
9
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.4%
10
Frontiers in Immunology
586 papers in training set
Top 4%
1.9%
11
Frontiers in Bioinformatics
45 papers in training set
Top 0.2%
1.7%
12
eLife
5422 papers in training set
Top 41%
1.7%
13
Viruses
318 papers in training set
Top 3%
1.7%
14
Nature Communications
4913 papers in training set
Top 53%
1.5%
15
Journal of Biological Chemistry
641 papers in training set
Top 2%
1.3%
16
Scientific Reports
3102 papers in training set
Top 66%
1.2%
17
Structure
175 papers in training set
Top 2%
1.2%
18
Biology Methods and Protocols
53 papers in training set
Top 2%
0.9%
19
ImmunoInformatics
11 papers in training set
Top 0.2%
0.9%
20
Journal of Chemical Information and Modeling
207 papers in training set
Top 3%
0.9%
21
BMC Bioinformatics
383 papers in training set
Top 6%
0.8%
22
Cell Systems
167 papers in training set
Top 12%
0.8%
23
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
24
Biophysical Journal
545 papers in training set
Top 5%
0.8%
25
Biomolecules
95 papers in training set
Top 2%
0.8%
26
iScience
1063 papers in training set
Top 31%
0.8%
27
Bioinformatics Advances
184 papers in training set
Top 5%
0.7%
28
Journal of Proteome Research
215 papers in training set
Top 2%
0.7%
29
Communications Biology
886 papers in training set
Top 32%
0.5%
30
Nucleic Acids Research
1128 papers in training set
Top 21%
0.5%