Back

COMBO-RATE: An experimentally validated bioinformatic tool to identify promiscuous HLA restrictions

Nevarez-Mejia, J.; Trevizani, R.; Abawi, A.; Johansson, E.; Sutherland, A.; Grifoni, A.; da Silva Antunes, R.; Sette, A.

2026-04-17 immunology
10.64898/2026.04.14.718521 bioRxiv
Show abstract

Defining HLA restriction of T cell epitopes is essential for understanding immune responses in infectious disease, autoimmunity, and vaccine design. Current bioinformatic programs, including the IEDB RATE tool, enable inference of single-HLA restrictions from immune response data of HLA-typed donors. However, T cell epitopes are frequently presented by multiple HLA alleles, a phenomenon termed promiscuous restriction, limiting the utility of single-allele approaches. To address this limitation, we developed COMBO-RATE, an extension of RATE that systematically evaluates combinations of HLA alleles to identify multi-allelic restriction patterns. Analysis of three independent datasets spanning distinct antigen systems and different epitope discovery strategies revealed that promiscuous restriction is a near-universal feature of immunodominant epitopes. Focusing on the 43 immunodominant CD4 T cell epitopes identified in a B. pertussis genome-wide screen, COMBO-RATE outperformed conventional RATE, identifying restrictions for 32 of 43 epitopes, compared to 24 by RATE alone, and uncovered 49 additional allele restrictions, including 30 unique alleles. Experimental validation using single-HLA transfected cell lines and antigen presentation assays confirmed COMBO-RATE-inferred restrictions, demonstrating that a single epitope can be independently presented by distinct HLA alleles. Overall, COMBO-RATE provides a robust and scalable framework for defining complete HLA restriction profiles from existing population response data, with important implications for the design of vaccines requiring broad HLA coverage across genetically diverse populations. This pipeline is available as both a Python package and a user-friendly web application.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 3%
10.1%
2
Cell Systems
167 papers in training set
Top 1%
10.1%
3
PLOS Computational Biology
1633 papers in training set
Top 4%
8.4%
4
Nature Communications
4913 papers in training set
Top 29%
6.3%
5
Genome Medicine
154 papers in training set
Top 1%
4.9%
6
Frontiers in Immunology
586 papers in training set
Top 2%
4.3%
7
mAbs
28 papers in training set
Top 0.1%
3.7%
8
eLife
5422 papers in training set
Top 29%
3.1%
50% of probability mass above
9
Cell Reports
1338 papers in training set
Top 18%
2.9%
10
PLOS ONE
4510 papers in training set
Top 48%
2.1%
11
Nucleic Acids Research
1128 papers in training set
Top 9%
1.9%
12
Nature Computational Science
50 papers in training set
Top 0.5%
1.9%
13
PLOS Pathogens
721 papers in training set
Top 5%
1.9%
14
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.8%
15
Scientific Reports
3102 papers in training set
Top 58%
1.7%
16
iScience
1063 papers in training set
Top 15%
1.7%
17
Science Advances
1098 papers in training set
Top 21%
1.3%
18
Cell Reports Medicine
140 papers in training set
Top 5%
1.3%
19
Nature Immunology
71 papers in training set
Top 1%
1.2%
20
Computational and Structural Biotechnology Journal
216 papers in training set
Top 7%
1.1%
21
Cell Genomics
162 papers in training set
Top 5%
1.1%
22
Science Translational Medicine
111 papers in training set
Top 4%
1.0%
23
Nature Methods
336 papers in training set
Top 5%
1.0%
24
Cell Host & Microbe
113 papers in training set
Top 4%
0.9%
25
Vaccine
189 papers in training set
Top 2%
0.9%
26
PLOS Biology
408 papers in training set
Top 16%
0.9%
27
npj Vaccines
62 papers in training set
Top 0.4%
0.9%
28
Bioinformatics Advances
184 papers in training set
Top 4%
0.8%
29
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 43%
0.8%
30
Advanced Science
249 papers in training set
Top 19%
0.7%