Back

Reconstructing N-Glycan Profiles of Individual Glycoproteins from the Total Blood Plasma N-Glycome: Implications for Immunoglobulin G N-Glycosylation GWAS

Soplenkova, A.; Maslov, D.; Timoshchuk, A.; Kifer, D.; Cvetko, A.; Georges, M.; Steves, C. J.; Menni, C.; Sharapov, S.; Lauc, G.; Aulchenko, Y. S.

2026-02-02 genetics
10.64898/2026.01.30.702519 bioRxiv
Show abstract

The genetic regulation of the plasma N-glycome variation in human populations is not fully characterized, partly due to the limited sample size in glyco-genetics studies. Here, we aimed to demonstrate that protein-specific N-glycan profiles, like those of immunoglobulin G (IgG), can be accurately reconstructed from the total plasma N-glycome (TPNG), enabling us to find new regulators of this complex process re-analysing existing datasets. By testing multiple linear and non-linear machine learning approaches we built a model to reconstruct IgG N-glycans from TPNG data, training on the TwinsUK cohort and validating on CEDAR. We reconstruct GWAS summary statistics for IgG N-glycans by applying the trained linear model to plasma glycan GWAS summary statistics, i.e., as GWAS of linear combinations of plasma glycan traits. The majority of the identified loci had been implicated in IgG N-glycosylation GWAS. Additionally, we found four new loci and suggested the role of FCRLA, KDELR2, HHEX, and TCF3 in the regulation of IgG N-glycosylation. In conclusion, we showed that our method enables the creation of protein-specific N-glycome datasets, allowing for powerful meta-analyses without the need to profile new samples.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 2%
14.5%
2
Frontiers in Genetics
197 papers in training set
Top 0.3%
10.2%
3
Scientific Reports
3102 papers in training set
Top 12%
7.2%
4
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.9%
5
iScience
1063 papers in training set
Top 3%
4.0%
6
Bioinformatics
1061 papers in training set
Top 5%
3.6%
7
eLife
5422 papers in training set
Top 30%
2.9%
8
BMC Medical Genomics
36 papers in training set
Top 0.3%
2.4%
9
Genome Medicine
154 papers in training set
Top 3%
2.1%
50% of probability mass above
10
Cell Genomics
162 papers in training set
Top 2%
2.1%
11
Nature Communications
4913 papers in training set
Top 46%
2.1%
12
Human Molecular Genetics
130 papers in training set
Top 1%
1.9%
13
Frontiers in Immunology
586 papers in training set
Top 4%
1.9%
14
BMC Genomics
328 papers in training set
Top 2%
1.9%
15
Genetic Epidemiology
46 papers in training set
Top 0.4%
1.8%
16
Human Genetics and Genomics Advances
70 papers in training set
Top 0.2%
1.8%
17
Journal of Genetics and Genomics
36 papers in training set
Top 0.9%
1.7%
18
EMBO Molecular Medicine
85 papers in training set
Top 2%
1.7%
19
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
20
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.7%
21
PLOS Genetics
756 papers in training set
Top 9%
1.7%
22
PLOS ONE
4510 papers in training set
Top 56%
1.5%
23
BMC Bioinformatics
383 papers in training set
Top 5%
1.3%
24
Communications Biology
886 papers in training set
Top 17%
1.0%
25
Cell Reports
1338 papers in training set
Top 30%
0.9%
26
Genome Biology
555 papers in training set
Top 6%
0.9%
27
Genes & Immunity
11 papers in training set
Top 0.1%
0.9%
28
Analytical Chemistry
205 papers in training set
Top 2%
0.8%
29
Life Science Alliance
263 papers in training set
Top 2%
0.8%
30
Molecular & Cellular Proteomics
158 papers in training set
Top 2%
0.8%