Back

CD8scape: an accessible, command-line tool for predicting viral escape from the CD8+ T cell response

Smith, E. W.; Hughes, J.; Robertson, D. L.; Illingworth, C.

2026-04-22 bioinformatics
10.64898/2026.04.20.719634 bioRxiv
Show abstract

The CD8+ T cell response is a critical component of antiviral immunity, particularly in hosts who are immunocompromised or undergoing B cell-depleting therapy, such as rituximab. As viral evolution can lead to escape from CD8+ T cell recognition, tools that predict such escape are increasingly relevant. Here, we present CD8scape, an accessible command-line tool designed to predict viral escape from the CD8+ T cell response based on within-host sequence variation and HLA class I genotype. CD8scape is primarily a Julia wrapper for NetMHCpan v4.2, a neural network-based predictor trained on mass spectrometry-derived peptide presentation data. CD8scape integrates variant data and viral reading frames to identify all overlapping 8-11mer peptides at variant sites in both ancestral and derived states. These peptides are evaluated using NetMHCpan, which outputs eluted ligand (EL) scores as allele-specific percentile ranks to account for differences in MHC binding fastidiousness, and these are passed back to CD8scape itself. For each variant, the best-ranking peptide across all alleles is identified, and a harmonic mean is used to summarize presentation likelihood across the hosts HLA genotype. A fold-change between ancestral and derived harmonic means quantifies the likelihood of immune escape, with values >1 indicating reduced predicted presentation, and therefore a potential escape from the CD8+ T cell response. This is converted to a log2 value of this fold-change so that the metric is symmetric around 0, with positive values representing predicted escape. CD8scape can operate with known HLA genotypes or a representative HLA supertype panel for generalizable predictions. We demonstrate our method by application to within-host SARS-CoV-2 evolution in a rituximab-treated patient and discuss its implications for population-level CD8+ T cell escape.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
17.6%
2
PLOS Computational Biology
1633 papers in training set
Top 4%
8.4%
3
Bioinformatics Advances
184 papers in training set
Top 0.3%
7.2%
4
Cell Systems
167 papers in training set
Top 2%
4.9%
5
Nature Biotechnology
147 papers in training set
Top 2%
4.9%
6
PLOS ONE
4510 papers in training set
Top 35%
4.2%
7
Journal of Proteome Research
215 papers in training set
Top 0.7%
4.0%
50% of probability mass above
8
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
2.7%
9
Genome Medicine
154 papers in training set
Top 3%
2.6%
10
ImmunoInformatics
11 papers in training set
Top 0.1%
2.4%
11
mAbs
28 papers in training set
Top 0.1%
2.4%
12
Nature Methods
336 papers in training set
Top 4%
2.1%
13
NAR Genomics and Bioinformatics
214 papers in training set
Top 1%
2.1%
14
Nucleic Acids Research
1128 papers in training set
Top 9%
2.1%
15
Molecular & Cellular Proteomics
158 papers in training set
Top 0.9%
2.1%
16
Viruses
318 papers in training set
Top 3%
1.8%
17
BMC Bioinformatics
383 papers in training set
Top 4%
1.8%
18
Cell Reports Methods
141 papers in training set
Top 2%
1.7%
19
Nature Communications
4913 papers in training set
Top 53%
1.5%
20
Nature Machine Intelligence
61 papers in training set
Top 2%
1.3%
21
Genome Biology
555 papers in training set
Top 6%
1.2%
22
Frontiers in Immunology
586 papers in training set
Top 5%
1.2%
23
iScience
1063 papers in training set
Top 23%
1.1%
24
eLife
5422 papers in training set
Top 51%
1.0%
25
Journal of Chemical Information and Modeling
207 papers in training set
Top 3%
1.0%
26
Scientific Reports
3102 papers in training set
Top 71%
0.9%
27
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
28
Science Advances
1098 papers in training set
Top 30%
0.7%
29
BMC Genomics
328 papers in training set
Top 6%
0.7%
30
Cell Reports Medicine
140 papers in training set
Top 8%
0.7%