Back

Staged heavy-chain filtering enables Fab discovery from combinatorially intractable library spaces

Kim, Y.; Kwon, H.; Hong, J.; Kang, C. K.; Park, W. B.; Kim, H.-R.; Lee, C.-H.

2026-05-13 bioengineering
10.64898/2026.05.10.724059 bioRxiv
Show abstract

BackgroundCombinatorial fragment antigen-binding (Fab) libraries encode an immense heavy-light chain pairing space, often exceeding 10{superscript 1} possible combinations, which far surpasses the diversity that can be experimentally constructed and screened in display systems. As a result, direct Fab screening samples only a small fraction of the theoretical search space, creating a practical bottleneck for functional binder discovery. ResultsHere, we frame Fab discovery as a staged search problem by decoupling heavy-chain (HC) and light-chain (LC) exploration. We implemented a sequential HC preselection-remating workflow in yeast surface display, in which antigen-reactive HC variants are first enriched and subsequently recombined with a diverse LC repertoire to reconstruct a focused Fab library. In a SARS-CoV-2 spike-targeted campaign, HC and LC libraries of 2.05 x 10 and 2.33 x 10 members corresponded to a theoretical pairing space of approximately 4.8 x 10{superscript 1} combinations. Sequential HC enrichment followed by LC remating allowed recovery of multiple functional Fab clones from a tractable library scale of approximately 10, including clones that shared a common HC scaffold but carried distinct LC partners. A representative recombinant IgG output showed broad but heterogeneous spike/RBD binding, measurable pseudovirus neutralization activity (EC = 11.1 nM), and compatibility with standard early biophysical characterization after full-length IgG reformatting. ConclusionsThese results provide proof of principle that combinatorial Fab discovery can be approached as a staged exploration problem under realistic library-size constraints. By focusing downstream Fab reconstruction on an antigen-compatible HC subspace, sequential HC preselection followed by LC remating offers a practical strategy for exploring otherwise intractable antibody pairing landscapes in eukaryotic display systems.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
mAbs
28 papers in training set
Top 0.1%
18.4%
2
Nature Communications
4913 papers in training set
Top 19%
10.0%
3
Cell Systems
167 papers in training set
Top 1%
8.3%
4
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 12%
6.3%
5
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.9%
4.8%
6
Advanced Science
249 papers in training set
Top 5%
3.9%
50% of probability mass above
7
Antibody Therapeutics
16 papers in training set
Top 0.1%
3.9%
8
Protein Engineering, Design and Selection
14 papers in training set
Top 0.1%
3.6%
9
Communications Biology
886 papers in training set
Top 3%
3.0%
10
Nature Methods
336 papers in training set
Top 4%
1.9%
11
PLOS Computational Biology
1633 papers in training set
Top 15%
1.9%
12
Cell Reports Methods
141 papers in training set
Top 2%
1.7%
13
Bioinformatics
1061 papers in training set
Top 7%
1.6%
14
Nature Biotechnology
147 papers in training set
Top 5%
1.5%
15
Nature Biomedical Engineering
42 papers in training set
Top 1%
1.3%
16
Frontiers in Immunology
586 papers in training set
Top 5%
1.2%
17
Nucleic Acids Research
1128 papers in training set
Top 14%
1.1%
18
Science Advances
1098 papers in training set
Top 24%
1.1%
19
Scientific Reports
3102 papers in training set
Top 68%
1.1%
20
Protein Science
221 papers in training set
Top 1%
0.9%
21
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 5%
0.9%
22
Cell Reports Medicine
140 papers in training set
Top 7%
0.9%
23
PLOS ONE
4510 papers in training set
Top 66%
0.8%
24
Cell Reports
1338 papers in training set
Top 33%
0.7%
25
BMC Genomics
328 papers in training set
Top 6%
0.7%