High-PepBinder: A pLM-Guided Latent Diffusion Framework for Affinity-Aware Target-Specific Peptide Design
Qingyi, M.; Zhai, S.; Cao, S.; Zhu, R.; Xu, W.; Zhang, C.; Zhu, N.; Guo, J.; Duan, H.
Show abstract
Peptides, as therapeutic molecules, offer unique advantages in targeting complex protein surfaces, yet their rational design remains limited by the vastness of the sequence space and the constraints of traditional approaches. Here, we propose High-PepBinder, a sequence-only conditional diffusion framework for target-specific peptide generation. Guided by the target protein sequence, High-PepBinder adopts a dual encoder architecture that integrates protein language models (pLMs) with the diffusion model. This approach cascades the peptide generation model with an affinity classifier and enables the generation process to capture affinity-related features of the peptides through lightweight joint optimization. Due to the scarcity of protein-peptide affinity data, we constructed PepPBA, to our knowledge the most comprehensive dataset to date, and established a structure- and physics-based screening pipeline to prioritize top candidates. Results show that High-PepBinder demonstrates competitive performance across multiple peptide generation and affinity-related tasks. For representative targets, including KEAP1, XIAP, and EGFR, the generated peptides preserve key binding geometries and interface patterns of reference peptides in predicted complexes, while maintaining sequence diversity and favorable predicted properties. Overall, High-PepBinder contributes toward a general and sequence-only strategy for peptide design, offering a computational framework for expanding peptide discovery against challenging targets.
Matching journals
The top 5 journals account for 50% of the predicted probability mass.