Back

A structure-informed deep learning framework for modeling TCR-peptide-HLA interactions

Cao, K.; Li, R.; Strazar, M.; Brown, E. M.; Nguyen, P. N. U.; Pust, M.-M.; Park, J.; Graham, D. B.; Ashenberg, O.; Uhler, C.; Xavier, R.

2026-04-02 bioinformatics
10.64898/2026.03.31.715361 bioRxiv
Show abstract

The interaction between T cell receptors (TCRs), peptides, and human leukocyte antigens (HLAs) underlies antigen-specific T cell immunity. Despite substantial advances in peptide- HLA presentation prediction, accurate modeling of coupled TCR-peptide-HLA recognition remains underdeveloped, limiting applications such as TCR and neoepitope prioritization in cancer and antigen identification in autoimmunity. Here we present StriMap, a unified framework for predicting TCR-peptide-HLA interactions by integrating physicochemical, se uence-context, and structural features at recognition interfaces. StriMap achieves state-of-the-art performance with improved generalizability and enables applications in both cancer and autoimmunity. As a case study in ankylosing spondylitis (AS), we screened 13 million peptides derived from 43,241 bacterial proteins and identified candidate molecular mimics that were experimentally validated to activate T cells expressing an AS-associated TCR. Notably, a top validated peptide was enriched in patients with inflammatory bowel disease (IBD), suggesting potential shared microbial triggers between AS and IBD. Overall, StriMap provides a generalizable framework for rational immunotherapy design and for dissecting antigenic drivers of autoimmunity.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Advanced Science
249 papers in training set
Top 0.4%
17.5%
2
Nature Communications
4913 papers in training set
Top 10%
14.7%
3
Nature Machine Intelligence
61 papers in training set
Top 0.1%
12.7%
4
Frontiers in Immunology
586 papers in training set
Top 2%
4.0%
5
Genome Medicine
154 papers in training set
Top 2%
3.9%
50% of probability mass above
6
Cell Systems
167 papers in training set
Top 3%
3.7%
7
Nucleic Acids Research
1128 papers in training set
Top 5%
3.7%
8
Cell Reports Medicine
140 papers in training set
Top 1%
3.6%
9
Briefings in Bioinformatics
326 papers in training set
Top 2%
2.7%
10
Bioinformatics
1061 papers in training set
Top 6%
2.4%
11
PLOS Computational Biology
1633 papers in training set
Top 14%
2.1%
12
Cell Research
49 papers in training set
Top 1%
1.8%
13
Science Advances
1098 papers in training set
Top 17%
1.7%
14
Cell Genomics
162 papers in training set
Top 3%
1.7%
15
Nature Biotechnology
147 papers in training set
Top 5%
1.3%
16
Patterns
70 papers in training set
Top 2%
0.9%
17
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 40%
0.9%
18
Communications Biology
886 papers in training set
Top 17%
0.9%
19
iScience
1063 papers in training set
Top 25%
0.9%
20
Computational and Structural Biotechnology Journal
216 papers in training set
Top 8%
0.9%
21
Cell Discovery
54 papers in training set
Top 4%
0.9%
22
Nature Biomedical Engineering
42 papers in training set
Top 2%
0.8%
23
eLife
5422 papers in training set
Top 58%
0.7%
24
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 6%
0.7%
25
Cancer Cell
38 papers in training set
Top 2%
0.7%
26
Nature Methods
336 papers in training set
Top 6%
0.7%