Back

Genetic Profiling of Autoimmune Diseases and Exploring Clusters Through Polygenic Risk Score Analysis Using Cohort Data from the UK Biobank

Saurabh, R.; Wohlers, I.; Moeller, M.; Busch, H.

2026-05-13 genetic and genomic medicine
10.64898/2026.05.09.26352677 medRxiv
Show abstract

Autoimmune diseases result from immune responses against self-antigens but exhibit marked phenotypic diversity shaped by genetic and environmental factors. Genome-wide association studies (GWAS) have identified susceptibility loci that inform polygenic scores (PGS) for risk prediction. This study integrates phenotypic and genetic data from the UK Biobank(UKB) to characterize disease overlap, genetic heterogeneity, and shared biological mechanisms across autoimmune conditions. Comorbidity patterns were further assessed using patient records from UKB and the TriNetX(TNX). Phenotypic data from 502,371 UKB participants were used to evaluate diagnostic overlap, with a subset of 104,544 individuals analyzed for PGS distributions. Significant variants were identified using genome-wide thresholds, allele frequency, and predicted impact, and shared genes were subsequently mapped to pathways using Hallmark gene sets. Comorbidity across rare and common autoimmune diseases was assessed in the UKB and TNX using ICD-10 codes, focusing on White individuals (71,069,654 in TNX; 502,371 in UKB). Odds ratios for 15 diseases were estimated, and cross-cohort comparisons evaluated reproducibility and cohort-specific differences. PGS analyses revealed both shared and distinct genetic architectures, indicating partial genetic overlap and supporting poly-autoimmunity. Integration of common, rare and impactful variants identified both known and novel gene associations, while pathway analysis highlighted systemic and tissue-specific immune dysregulation. Cross-dataset comparisons confirmed consistent comorbidity patterns but underscored the impact of dataset-specific factors, emphasizing the need for standardized approaches in autoimmune disease research.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
The Lancet Rheumatology
11 papers in training set
Top 0.1%
8.4%
2
Nature Communications
4913 papers in training set
Top 29%
6.3%
3
Cell Genomics
162 papers in training set
Top 0.5%
6.3%
4
Journal of Allergy and Clinical Immunology
25 papers in training set
Top 0.1%
6.3%
5
Clinical Immunology
21 papers in training set
Top 0.1%
4.8%
6
Frontiers in Immunology
586 papers in training set
Top 2%
4.3%
7
JCI Insight
241 papers in training set
Top 1%
3.8%
8
eLife
5422 papers in training set
Top 26%
3.6%
9
Genome Medicine
154 papers in training set
Top 2%
3.6%
10
Brain, Behavior, and Immunity
105 papers in training set
Top 0.7%
3.6%
50% of probability mass above
11
Rheumatology
21 papers in training set
Top 0.2%
2.6%
12
Brain
154 papers in training set
Top 3%
1.8%
13
Arthritis & Rheumatology
33 papers in training set
Top 0.3%
1.7%
14
Frontiers in Genetics
197 papers in training set
Top 5%
1.7%
15
Journal of Clinical Investigation
164 papers in training set
Top 3%
1.7%
16
Scientific Reports
3102 papers in training set
Top 58%
1.7%
17
The American Journal of Human Genetics
206 papers in training set
Top 2%
1.7%
18
BMC Medical Genomics
36 papers in training set
Top 0.5%
1.7%
19
Annals of the Rheumatic Diseases
32 papers in training set
Top 0.4%
1.7%
20
iScience
1063 papers in training set
Top 19%
1.3%
21
Gut
36 papers in training set
Top 0.6%
1.2%
22
Communications Biology
886 papers in training set
Top 16%
1.1%
23
Cell Reports Medicine
140 papers in training set
Top 7%
0.9%
24
EBioMedicine
39 papers in training set
Top 0.8%
0.9%
25
Human Molecular Genetics
130 papers in training set
Top 3%
0.8%
26
Cell
370 papers in training set
Top 16%
0.8%
27
Human Genetics and Genomics Advances
70 papers in training set
Top 0.7%
0.8%
28
Genes & Immunity
11 papers in training set
Top 0.1%
0.7%
29
Journal of Investigative Dermatology
42 papers in training set
Top 0.5%
0.7%
30
Nature Immunology
71 papers in training set
Top 2%
0.7%