Back

The Synthetic Epitope Atlas: High-Throughput Design and Validation of De Novo Antibody-Antigen Complexes

Altieri, N.; Harman, J. L.; Noble, D.; Murakowska, N.; Eng, A.; McGowan, K. L.; Goodnight, D.; DiPeso, L.; Shikany, C.; Engelhart, E.; Homad, L. J.; Lahman, M. C.; Gandhi, S.; Goodwin, M.; Herbst, K.; Lin, C.; McMurray, M.; Barrett, J.; Agarwal, A. A.; Harrang, J.; Emerson, R. O.; Lopez, R. M.; Younger, D. A.; Lange, A. W.

2026-04-18 synthetic biology
10.64898/2026.04.17.719295 bioRxiv
Show abstract

AO_SCPLOWBSTRACTC_SCPLOWDe novo antibody design models lack sufficient training data to reliably generalize. We demonstrate scalable generation of structural training data for machine learning-driven antibody design by linking in silico designs of antibody-antigen complexes to high-throughput experimental binding validation. Using AlphaSeq, a yeast-based platform for measuring protein binding affinities, we measure the affinity and specificity of thousands of de novo "synthetic epitope proteins" (SEPs) designed to bind to VHHs. The resulting Synthetic Epitope Atlas (SEPIA) pairs over 26 million on- and off-target affinity measurements with computationally designed VHH-SEP "pseudo-structures." We validate strong, specific binding for 1,161 pseudo-structures and >75,000 VHH and SEP mutational variants. We show that these pseudo-structures complement existing structural databases and enable ML models to outperform confidence metrics commonly used to rank de novo antibody designs. Taken together, SEPIA establishes a scalable framework for improving de novo antibody design by augmenting sparse structural data with large-scale experimental binding data.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.