Back

Self-reported health history from 70,724 individuals reveals novel HLA associations with allergy and other frequently underreported conditions

Boquett, J. A.; Lin, S. Y.-T.; House, J. S.; Ahn, K.; Suseno, R.; BakenRa, A.; Guthrie, K.; Wright, M.; Motsinger-Reif, A.; Maiers, M.; Hollenbach, J. A.

2026-02-19 genetic and genomic medicine
10.64898/2026.02.18.26346586 medRxiv
Show abstract

BackgroundVariation in the HLA loci, located on human chromosome 6p, has been associated with hundreds of diseases and conditions. However, high levels of polymorphism that characterize the HLA system, coupled with generally modest effect sizes for most phenotypes, necessitate relatively large sample sizes to power association studies; meanwhile, high resolution HLA genotyping remains relatively resource intensive. These constraints limit identification of novel associations. While phenome-wide association studies (PheWAS) in the context of large registries with available electronic health records (EHR) have revealed new insights into the role of HLA in disease, many common health conditions are poorly represented in EHR due to the temporal nature of their occurrence or general underreporting. Further, these studies have generally been conducted with HLA genotyping data imputed from microarrays, rather than direct measurement of high-resolution genotypes. ObjectiveTo overcome these limitations and reveal novel HLA associations we undertook a PheWAS in many previously understudied health conditions. MethodsWe queried over 300 hundred conditions, diseases and traits from 70,724 subjects registered with NMDP with available high-resolution HLA genotyping (HLA-A, HLA-B, HLA-C, HLA-DRB1, and HLA-DQB1). After stratifying according to ancestry, we performed a logistic regression analysis adjusting for sex and age for HLA-phenotype association. ResultsWe identified 48 significant HLA associations across ancestry groups, confirming several known associations and uncovered fifteen novel associations. Most novel associations pertained to common infectious or allergic phenotypes that often go under-reported in the EHR. Of particular translational importance, we identified a previously undetected yet very strong association between HLA-DRB1*04:01 and sensitivity to cefaclor, a specific class of cephalosporin (OR = 3.74, p-value 5.10E-28). Molecular docking simulations predict cefaclor binding in the P4 pocket of HLA-DRB1*04:01, with substantially greater affinity than non-associated antibiotics, including other cephalosporins. This pharmacogenomic signal highlights an opportunity for risk stratification and targeted prevention of adverse drug reactions. Other novel associations found, such as susceptibility to genital warts (HPV) and allergic rhinitis, reveals new insights into the role of specific HLA alleles in immune-mediated disease. The vast majority of these novel associations were replicated in the independent All of Us cohort, confirming the validity of this approach. ConclusionCollectively, our findings demonstrate the value of integrating population-scale, high-resolution HLA genotypes with phenotyping beyond the EHR to reveal immunogenetic influences on common health outcomes. They also point to immediate translational avenues - particularly for drug hypersensitivity - while motivating future functional studies and prospective clinical validation to refine mechanistic understanding and clinical utility.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Genome Medicine
154 papers in training set
Top 0.1%
28.7%
2
Nature Communications
4913 papers in training set
Top 21%
8.7%
3
Nature Genetics
240 papers in training set
Top 1.0%
7.4%
4
Journal of Clinical Investigation
164 papers in training set
Top 0.6%
4.5%
5
The American Journal of Human Genetics
206 papers in training set
Top 1%
3.7%
50% of probability mass above
6
Scientific Reports
3102 papers in training set
Top 52%
2.0%
7
BMC Medical Genomics
36 papers in training set
Top 0.3%
2.0%
8
eBioMedicine
130 papers in training set
Top 1.0%
1.8%
9
Journal of Allergy and Clinical Immunology
25 papers in training set
Top 0.3%
1.8%
10
The Journal of Infectious Diseases
182 papers in training set
Top 2%
1.8%
11
Cell Reports Medicine
140 papers in training set
Top 3%
1.8%
12
Clinical Infectious Diseases
231 papers in training set
Top 3%
1.7%
13
Med
38 papers in training set
Top 0.3%
1.5%
14
Cell Genomics
162 papers in training set
Top 4%
1.5%
15
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 37%
1.3%
16
eLife
5422 papers in training set
Top 48%
1.3%
17
Communications Biology
886 papers in training set
Top 18%
0.9%
18
Science Translational Medicine
111 papers in training set
Top 5%
0.9%
19
Circulation
66 papers in training set
Top 2%
0.8%
20
Communications Medicine
85 papers in training set
Top 0.8%
0.8%
21
Nature Medicine
117 papers in training set
Top 4%
0.8%
22
npj Digital Medicine
97 papers in training set
Top 3%
0.8%
23
The Lancet Rheumatology
11 papers in training set
Top 0.2%
0.8%
24
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%
25
Nature Immunology
71 papers in training set
Top 2%
0.8%
26
Nature Human Behaviour
85 papers in training set
Top 4%
0.8%
27
Science Advances
1098 papers in training set
Top 30%
0.7%
28
PLOS Genetics
756 papers in training set
Top 15%
0.7%
29
European Respiratory Journal
54 papers in training set
Top 2%
0.7%
30
Brain
154 papers in training set
Top 5%
0.7%