Back

A Multi-Omics Computational Pipeline for Systematic Discovery of Retired Self-Antigens as Cancer Vaccine Targets

Wang, V.; Deng, S.; Aguilar, R.

2026-04-22 genetic and genomic medicine
10.64898/2026.04.20.26351288 medRxiv
Show abstract

BackgroundThe retired antigen hypothesis, introduced by Tuohy and colleagues, proposes that tissue-specific proteins expressed conditionally during early life or reproductive stages, then silenced in normal aging tissue, represent safe and effective cancer vaccine targets when re-expressed in tumors. To date, discovery of retired antigens has relied entirely on hypothesis-driven wet lab work, limiting throughput. MethodsHere we present RADAR (Retired Antigen Discovery and Ranking), a multi-omics computational pipeline implemented on a standard server that systematically identifies retired antigen candidates. RADAR comprises four core discovery layers integrating: 1) The Genotype-Tissue Expression Portal (GTEx) normal tissue expression, 2) TCGA tumor re-expression, 3) DNA methylation, and 4) miRNA regulatory networks, each applied sequentially to identify genes exhibiting the epigenetic and post-transcriptional hallmarks of tissue-specific retirement followed by tumor re-activation. Candidate characterization is further supported by three automated modules: 1) protein-level safety screening via the Human Protein Atlas, 2) molecular subtype enrichment analysis, and 3) cross-cancer confirmation, which execute automatically when the relevant data are available for the selected cancer type. ResultsThe pipeline independently validated known targets including alpha-lactalbumin (LALBA, the basis of the Tuohy Phase 1 triple-negative breast cancer vaccine trial) and anti-Mullerian hormone (AMH), consistent with Tuohys ovarian cancer vaccine program targeting AMHR2, and rediscovered multiple known cancer-testis antigens (MAGEA1, MAGEC1, SSX1) as positive controls. Among 4,664 initial candidates derived from GTEx, the pipeline identified 20 high-confidence retired antigen candidates passing all filters. DCAF4L2, COX7B2, TEX19, and CT83 emerge as the highest-priority novel candidates for experimental validation, demonstrating zero expression in critical somatic organs, strong epigenetic silencing, and significant re-expression across multiple cancer types. ConclusionRADAR provides the first systematic computational framework for retired antigen discovery, offering a reproducible and scalable approach to expanding the cancer immunoprevention pipeline beyond individually characterized targets. The pipeline is fully reproducible, requires no specialized hardware, and is immediately extensible to additional TCGA cancer types.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
Genome Medicine
154 papers in training set
Top 0.1%
19.0%
2
Cell Genomics
162 papers in training set
Top 0.5%
6.5%
3
Nature Communications
4913 papers in training set
Top 28%
6.5%
4
Nucleic Acids Research
1128 papers in training set
Top 5%
3.6%
5
Bioinformatics
1061 papers in training set
Top 6%
3.1%
6
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.1%
7
iScience
1063 papers in training set
Top 9%
2.4%
8
Cell Systems
167 papers in training set
Top 6%
2.1%
9
Scientific Reports
3102 papers in training set
Top 49%
2.1%
10
Nature Medicine
117 papers in training set
Top 1%
2.1%
50% of probability mass above
11
Frontiers in Bioinformatics
45 papers in training set
Top 0.1%
1.9%
12
Aging Cell
144 papers in training set
Top 2%
1.7%
13
Communications Medicine
85 papers in training set
Top 0.2%
1.7%
14
Journal for ImmunoTherapy of Cancer
64 papers in training set
Top 0.6%
1.7%
15
European Journal of Cancer
10 papers in training set
Top 0.2%
1.7%
16
eBioMedicine
130 papers in training set
Top 1%
1.7%
17
PLOS ONE
4510 papers in training set
Top 54%
1.7%
18
Cell Reports Medicine
140 papers in training set
Top 4%
1.7%
19
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
20
GigaScience
172 papers in training set
Top 2%
1.2%
21
Frontiers in Oncology
95 papers in training set
Top 3%
1.2%
22
PLOS Computational Biology
1633 papers in training set
Top 19%
1.2%
23
Scientific Data
174 papers in training set
Top 2%
1.1%
24
Journal of Translational Medicine
46 papers in training set
Top 2%
1.1%
25
npj Digital Medicine
97 papers in training set
Top 3%
0.9%
26
npj Precision Oncology
48 papers in training set
Top 1%
0.9%
27
BMC Medicine
163 papers in training set
Top 6%
0.9%
28
Alzheimer's & Dementia
143 papers in training set
Top 3%
0.8%
29
Nature Aging
51 papers in training set
Top 2%
0.8%
30
Biomedicines
66 papers in training set
Top 3%
0.8%