Back

Structure-Based TCR-pMHC Binding Prediction and Generalization to Unseen Peptides

Abeer, A. N. M. N.; Roy, R. S.; Qian, X.; Yoon, B.-J.

2026-02-23 bioinformatics
10.64898/2026.02.21.707231 bioRxiv
Show abstract

The interaction between T-cell receptors (TCRs) with the peptide-bound major histocompatibility complex (MHC) intricately impacts the functional specificity of T-cell-mediated adaptive immune response. Consequently, implication in immunotherapy has contributed to the ever-growing computational methods for TCR recognition, which have recently attracted structure-based approaches due to advancements in protein structure modeling. Despite access to structural information of the predicted binding interface, graph neural network (GNN)-based TCR-pMHC binding specificity classifiers tend to show poor accuracy for samples with unseen peptides. In this work, we comprehensively assess the potential factors that critically impact the generalization performance of classifiers trained with computationally predicted structures. Specifically, our experiments focus on analyzing the sensitivity of such predictors to the interaction features in the TCR-pMHC interface and the structural uncertainty. Building on the analysis, we demonstrate how the design of classifier architecture with auxiliary training objectives can improve the generalization performance to novel peptides not yet seen during model training. Overall, our work highlights the challenges of unseen peptide generalization from different perspectives of the GNN-based classifier paradigm, showcasing the strengths and weaknesses of the current state-of-the-art approaches in the generalization landscape.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.2%
23.5%
2
Briefings in Bioinformatics
326 papers in training set
Top 0.3%
10.5%
3
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.4%
6.6%
4
PLOS Computational Biology
1633 papers in training set
Top 5%
6.6%
5
Bioinformatics
1061 papers in training set
Top 5%
4.5%
50% of probability mass above
6
Frontiers in Immunology
586 papers in training set
Top 2%
4.5%
7
Computers in Biology and Medicine
120 papers in training set
Top 0.5%
4.3%
8
ImmunoInformatics
11 papers in training set
Top 0.1%
3.7%
9
Scientific Reports
3102 papers in training set
Top 43%
2.8%
10
Computational Biology and Chemistry
23 papers in training set
Top 0.1%
2.0%
11
International Journal of Molecular Sciences
453 papers in training set
Top 6%
2.0%
12
Bioinformatics Advances
184 papers in training set
Top 3%
1.4%
13
Nature Machine Intelligence
61 papers in training set
Top 2%
1.4%
14
PLOS ONE
4510 papers in training set
Top 62%
1.0%
15
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.7%
1.0%
16
Molecules
37 papers in training set
Top 1%
1.0%
17
Journal of Cheminformatics
25 papers in training set
Top 0.5%
0.9%
18
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.8%
19
Advanced Science
249 papers in training set
Top 17%
0.8%
20
BMC Bioinformatics
383 papers in training set
Top 6%
0.8%
21
Frontiers in Bioinformatics
45 papers in training set
Top 0.7%
0.8%
22
Expert Systems with Applications
11 papers in training set
Top 0.3%
0.8%
23
Frontiers in Genetics
197 papers in training set
Top 9%
0.8%
24
Biomolecules
95 papers in training set
Top 2%
0.8%
25
GigaScience
172 papers in training set
Top 3%
0.8%
26
Journal of Structural Biology
58 papers in training set
Top 2%
0.7%
27
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.8%
0.7%
28
Frontiers in Molecular Biosciences
100 papers in training set
Top 6%
0.7%
29
Nature Communications
4913 papers in training set
Top 65%
0.7%
30
Biophysical Journal
545 papers in training set
Top 6%
0.7%