Back

Minority populations exhibit distinct clinical and genetic features of celiac disease in the United States

Karnati, H.; Ying, W.; Long, X.; Touma, M.-J.; Smith, I.; Lewis, S. K.; Xing, C.; Burstein, E.; Bolze, A.; Green, P. H. R.; Alkalay, M. J.; Kong, X.-F.

2024-12-24 gastroenterology
10.1101/2024.12.20.24319436 medRxiv
Show abstract

Celiac disease (CeD) is a heterogeneous autoimmune disorder influenced by genetic, environmental, and socioeconomic factors. However, little is known about clinical manifestations and genetic risks in minority populations. Using data from the All of Us Research Program, we analyzed 3,040 CeD patients, referred to as the AoU-CeD cohort, to identify clinical and genetic differences across racial and ethnic groups in the United States. CeD prevalence was highest among White individuals (1.08%) and significantly lower among Hispanic (0.36%) and Black (0.16%) populations. The majority of CeD patients were female (78.4%) and diagnosed between the ages of 18 and 64. Minority groups reported poorer physical and mental quality of life (QoL) and higher levels of pain. Ancestry-specific patterns emerged in CeD-associated conditions, with minorities more likely to report diarrhea and non-infectious gastroenteritis but less likely to have osteoporosis, hypothyroidism, chronic fatigue, or a family history of CeD. Compared to previously reported data showing that over 90% of CeD patients carry the HLA-DQ2.5 haplotype, genetic analysis revealed that only 49% of patients in the AoU-CeD cohort carried the high-risk HLA-DQ2.5 haplotype. Additionally, 16.5% lacked known HLA-DQ risk haplotypes, suggesting potential diagnostic or reporting inaccuracies. Minority groups exhibited higher rates of atypical symptoms, lower frequencies of the DQ2.5 haplotype, and distinct distributions of HLA-DQ genotypes. A long haplotype block spanning HLA-A1, B8, C7 and HLA-DQ2.5 was found in Europeans but absent in other ancestries. A genome-wide association study (GWAS) using over 11 million variants from whole-genome sequencing data identified 1,651 significant single-nucleotide polymorphisms (SNPs), primarily within the MHC locus, with the strongest signals observed predominantly among individuals of European ancestry. A predictive model incorporating HLA-DQ genotype, family history, and clinical features achieved 83% accuracy for identifying seropositive CeD. These results highlight the importance of ancestry-specific clinical presentations and genetic features in CeD.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
American Journal of Gastroenterology
15 papers in training set
Top 0.1%
8.5%
2
Science Translational Medicine
111 papers in training set
Top 0.1%
8.5%
3
Gastroenterology
40 papers in training set
Top 0.2%
8.3%
4
Diabetes
53 papers in training set
Top 0.1%
6.4%
5
The Journal of Clinical Endocrinology & Metabolism
35 papers in training set
Top 0.3%
4.9%
6
Inflammatory Bowel Diseases
15 papers in training set
Top 0.1%
3.6%
7
Cell Reports Medicine
140 papers in training set
Top 2%
3.3%
8
Viruses
318 papers in training set
Top 2%
3.1%
9
Gut
36 papers in training set
Top 0.3%
2.9%
10
eBioMedicine
130 papers in training set
Top 0.5%
2.6%
50% of probability mass above
11
Frontiers in Immunology
586 papers in training set
Top 3%
2.5%
12
JCI Insight
241 papers in training set
Top 2%
2.1%
13
Journal of Allergy and Clinical Immunology
25 papers in training set
Top 0.3%
2.1%
14
Clinical Pharmacology & Therapeutics
25 papers in training set
Top 0.3%
2.1%
15
Clinical and Experimental Immunology
12 papers in training set
Top 0.1%
1.8%
16
Nature Communications
4913 papers in training set
Top 51%
1.7%
17
Scientific Reports
3102 papers in training set
Top 57%
1.7%
18
eLife
5422 papers in training set
Top 42%
1.7%
19
PLOS Neglected Tropical Diseases
378 papers in training set
Top 3%
1.5%
20
PLOS ONE
4510 papers in training set
Top 56%
1.5%
21
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 4%
1.3%
22
Diabetes Care
12 papers in training set
Top 0.2%
1.2%
23
Nature Genetics
240 papers in training set
Top 6%
1.0%
24
Cell Reports
1338 papers in training set
Top 33%
0.8%
25
mBio
750 papers in training set
Top 12%
0.7%
26
Frontiers in Pharmacology
100 papers in training set
Top 5%
0.6%
27
Arthritis & Rheumatology
33 papers in training set
Top 0.6%
0.6%
28
The FASEB Journal
175 papers in training set
Top 4%
0.6%
29
Immunology & Cell Biology
11 papers in training set
Top 0.4%
0.6%
30
The American Journal of Human Genetics
206 papers in training set
Top 4%
0.6%