Back

Landscape of blood group antigens and alleles in the Indian population from whole genome sequences

Rophina, M.; Bhoyar, R. C.; Imran, M.; Senthivel, V.; Divakar, M. K.; Mishra, A.; Jolly, B.; Sivasubbu, S.; Scaria, V.

2023-09-26 genetic and genomic medicine
10.1101/2023.09.26.23296145 medRxiv
Show abstract

Blood group antigens are genetically inherited macromolecular structures which form the underlying factor for inter individual variations in human blood. Currently there exists over 390 human blood group antigens corresponding to 44 blood group systems and 2 erythroid specific transcription factors. Distribution of these blood group antigens have been found to differ significantly among various ethnic populations. To date, there is a lack of comprehensive research that offers extensive blood group profiles for the Indian population. Whole genome sequence data (hg38) of 1029 self-declared healthy Indian individuals generated as a part of the pilot phase IndiGen programme were used for the analysis. Variants spanning the genes of 44 blood group systems and two transcription factors KLF1, GATA1 were fetched and annotated for their functional consequences. Our study reports a total of 40712 blood group related variants of which 695 were identified as non-synonymous variants in the coding region. Of the total non-synonymous variants, 105 were found to have a known blood phenotype. A total of 24 variants belonging to 12 blood groups were predicted to be deleterious by more than three computational tools. Our study was also able to identify a few rare blood phenotypes including Au(a-b+), Js(a+b+), Di(a+b-), In(a+b-) and KANNO-. This study is the first to use genomic data to understand the blood group antigen profiles of the Indian population, and it also systematically compares these profiles with those of other global populations. Key pointsO_LIAccurate characterization of the genomic landscape of known and rare blood group alleles and antigens in the Indian population using the whole genome sequencing data of 1029 self-declared healthy individuals C_LIO_LIUnderstanding the distinct similarities and differences in blood group genotypes and phenotypes across diverse global populations through systematic comparison of genomic datasets. C_LI Graphical abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=151 SRC="FIGDIR/small/23296145v1_ufig1.gif" ALT="Figure 1"> View larger version (33K): org.highwire.dtl.DTLVardef@3dea45org.highwire.dtl.DTLVardef@df929borg.highwire.dtl.DTLVardef@121e5corg.highwire.dtl.DTLVardef@1874efb_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.

1
Frontiers in Genetics
197 papers in training set
Top 0.1%
51.7%
50% of probability mass above
2
Human Genomics
21 papers in training set
Top 0.1%
4.0%
3
BMC Medical Genomics
36 papers in training set
Top 0.2%
3.6%
4
Scientific Reports
3102 papers in training set
Top 44%
2.7%
5
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.9%
6
Genomics
60 papers in training set
Top 0.8%
1.9%
7
Genes & Immunity
11 papers in training set
Top 0.1%
1.9%
8
PLOS ONE
4510 papers in training set
Top 50%
1.9%
9
Briefings in Bioinformatics
326 papers in training set
Top 3%
1.9%
10
Nature Communications
4913 papers in training set
Top 50%
1.8%
11
BMC Genomics
328 papers in training set
Top 2%
1.8%
12
Frontiers in Immunology
586 papers in training set
Top 4%
1.7%
13
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.2%
14
Genome Medicine
154 papers in training set
Top 6%
0.9%
15
Genes
126 papers in training set
Top 2%
0.9%
16
Nucleic Acids Research
1128 papers in training set
Top 17%
0.8%
17
PLOS Computational Biology
1633 papers in training set
Top 23%
0.8%
18
GigaScience
172 papers in training set
Top 3%
0.7%
19
Frontiers in Ecology and Evolution
60 papers in training set
Top 4%
0.7%
20
Methods
29 papers in training set
Top 0.6%
0.7%
21
Frontiers in Pediatrics
29 papers in training set
Top 0.9%
0.7%
22
Molecular Biology Reports
19 papers in training set
Top 0.6%
0.7%
23
BMC Bioinformatics
383 papers in training set
Top 7%
0.7%
24
Biology
43 papers in training set
Top 3%
0.7%
25
iScience
1063 papers in training set
Top 37%
0.6%
26
Frontiers in Human Neuroscience
67 papers in training set
Top 3%
0.6%
27
Human Genetics and Genomics Advances
70 papers in training set
Top 1%
0.6%