Back

Identifying genetic regulations on immune cell type proportions and their impacts on autoimmune diseases

Lin, C.; Shen, J.; Sun, J.; Xie, Y.; Xu, L.; Lin, Y.; Hu, J.; Zhao, H.

2026-03-01 genetics
10.64898/2026.02.26.708418 bioRxiv
Show abstract

Genetic regulation of immune cell composition plays a crucial role in the etiology of complex diseases, yet remains poorly understood. We propose a unified analytical framework that integrates genome-wide association studies (GWAS) of cell type proportions with cell-type-wide association studies (cWAS) to systematically characterize both the genetic regulation of immune cell composition and its downstream effects on disease risk. Using single-cell RNA sequencing data from the OneK1K cohort, we conducted a GWAS of immune cell-type proportions with a depth-weighted quasi-binomial model designed for bounded, overdispersed traits. We identified 47 genome-wide significant loci influencing eight fine-labeled immune cell subtypes. Leveraging these identified genetic effects, we further imputed genetically regulated proportions (GRPs) using polygenic risk score (PRS)-based imputation and assessed their associations with complex diseases through cWAS. We identified five significant cell type-disease associations, including two with type 1 diabetes, two with Crohns disease, and one with ulcerative colitis. Together, our results demonstrate that cell type proportions observed in scRNA-seq can reveal regulatory loci and offer insights into how genetic variations regulate immune cell type proportions to affect disease risk. Although we focused on immune single-cell data, our framework is applicable to other tissues or cellular compositions as scRNA-seq datasets expand. Author SummaryGenome-wide association studies (GWASs) have uncovered many disease-associated signals, yet most lie in noncoding regions and are difficult to interpret. Mapping GWAS signals to the relevant cell types is therefore important for better understanding the biological mechanisms that drive disease. A major challenge is that observed gene expression and measured cell-type proportions can be influenced by environmental factors and disease status. In contrast, genotypes are less affected by these factors, making them more reliable for interpreting factors of diseases. Moreover, the cell-type proportions are bounded and often skewed, so standard GWAS models that rely on Gaussian assumptions may lose power. To address this, we developed a quasi-binomial approach that better matches the data and improves discovery while controlling false positives. In real data, our method identified more genetic loci associated with cell-type proportions than a traditional linear model. To further investigate how genetic variation regulates immune cell composition to influence disease risk, we integrated our results with disease GWAS summary statistics to identify immune cell types that may contribute to disease susceptibility. Together, our results link disease-associated GWAS signals to specific immune cell types and provide insights into the cellular mechanisms that may underlie these diseases.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
PLOS Genetics
756 papers in training set
Top 0.2%
23.0%
2
Cell Genomics
162 papers in training set
Top 0.4%
7.3%
3
The American Journal of Human Genetics
206 papers in training set
Top 0.7%
7.0%
4
Nature Genetics
240 papers in training set
Top 1%
6.5%
5
eLife
5422 papers in training set
Top 19%
4.4%
6
PLOS Computational Biology
1633 papers in training set
Top 8%
4.1%
50% of probability mass above
7
Human Genetics and Genomics Advances
70 papers in training set
Top 0.1%
3.7%
8
Human Molecular Genetics
130 papers in training set
Top 0.9%
2.9%
9
Frontiers in Genetics
197 papers in training set
Top 3%
2.9%
10
Genetics
225 papers in training set
Top 2%
2.1%
11
Genome Biology
555 papers in training set
Top 3%
2.1%
12
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 30%
1.8%
13
Genome Medicine
154 papers in training set
Top 4%
1.8%
14
Frontiers in Immunology
586 papers in training set
Top 4%
1.7%
15
Genome Research
409 papers in training set
Top 2%
1.7%
16
Nature Communications
4913 papers in training set
Top 53%
1.5%
17
Bioinformatics
1061 papers in training set
Top 7%
1.5%
18
iScience
1063 papers in training set
Top 17%
1.5%
19
Science Translational Medicine
111 papers in training set
Top 3%
1.4%
20
Cell Reports
1338 papers in training set
Top 28%
1.3%
21
Genetic Epidemiology
46 papers in training set
Top 0.6%
1.0%
22
BMC Genomics
328 papers in training set
Top 5%
0.8%
23
Cell
370 papers in training set
Top 16%
0.8%
24
Communications Biology
886 papers in training set
Top 23%
0.8%
25
Genes & Immunity
11 papers in training set
Top 0.2%
0.7%
26
Science
429 papers in training set
Top 21%
0.7%
27
PLOS ONE
4510 papers in training set
Top 71%
0.7%
28
Immunity
58 papers in training set
Top 5%
0.7%
29
Nucleic Acids Research
1128 papers in training set
Top 21%
0.5%
30
Nature Biotechnology
147 papers in training set
Top 9%
0.5%