Back

The UAE Genome Program: Unique Genetic Insights from 43,608 Individuals

Mousa, M. M.; Olbrich, M. M.; Wohlers, I. I.; Al aamri, A. A.; Alsuwaidi, A. H.; Marzouka, N. a.-d.; Alnaqbi, H. H.; Alameri, M. S.; Ruta, D. D.; Alazazi, J. J.; Magalhaes, T. T.; Mafofo, J. J.; Quilez, J. J.; Allam, M. M.; Mohamad, M. S.; Drou, N. N.; Idaghdour, Y. Y.; Hamoudi, R. R.; Tay, G. G.; Ibrahim, S. S.; Alkaabi, F. F.; Al Mannaei, A. A.; Alsafar, H. H.

2025-09-14 genetic and genomic medicine
10.1101/2025.09.12.25334546 medRxiv
Show abstract

Here, we present a comprehensive genomic characterization of a cohort of 43,608 Emirati genomes sequenced as part of the Emirati Genome Program (EGP). This study identified more than 421 million single-nucleotide variants and indels and more than 600 million copy-number and structural variants. Small variants had 756 million molecular effects annotated. Of 7.7 million polymorphic variants having an allele frequency (AF) of more than 5% in EGP, 1,348 have a predicted deleterious effect on a protein. Characterization with respect to global variation shows that EGP represents a genetic continuum encompassing the range of African, Asian, and European populations. It is best described by two Arabian, an Eurasian, and an African component, with the predominant Arabian component linked to mitochondrial haplogroups J and T that are commonly attributed to the Middle East. Various aneuploidies of sex chromosomes were detected in 93 individuals overall, and aneuploidy of chromosome 21 was identified in 41 individuals. Median inbreeding coefficient and cumulative runs of homozygosity (ROHs) lengths were increased due to extensive consanguinity, were largest in the groups with Arabian main ancestry components, and were higher than reported for Qatar. Families were identified based on genetic relatedness and classified into 264 families with unrelated parents and 247 families with third- and fourth-degree consanguineous parents. Representative consanguineous pedigrees of families in EGP were outlined. Cumulative ROHs were affected by the main ancestry component and significantly increased in offspring of consanguineous parents, with a pronounced difference between 3rd and 4th-degree relatedness. Investigation of cumulative AFs of variants causing Mendelian diseases highlighted genes related to alpha- and beta-thalassemia (HBB, HBA2) and showed a high burden of variants causing severe recessive diseases, metabolic and retinal disorders, and hearing loss. In summary, EGP represents a landmark effort in characterizing the genetic diversity of the Emirati population, leveraging the largest Middle Eastern cohort reported to date.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Genome Medicine
154 papers in training set
Top 0.3%
12.8%
2
Human Genomics
21 papers in training set
Top 0.1%
10.2%
3
Scientific Reports
3102 papers in training set
Top 9%
8.5%
4
Frontiers in Genetics
197 papers in training set
Top 0.4%
8.5%
5
Human Molecular Genetics
130 papers in training set
Top 0.3%
6.4%
6
Nature Communications
4913 papers in training set
Top 32%
4.9%
50% of probability mass above
7
Genes
126 papers in training set
Top 0.2%
4.0%
8
The American Journal of Human Genetics
206 papers in training set
Top 1%
4.0%
9
Genomics
60 papers in training set
Top 0.3%
3.6%
10
Nucleic Acids Research
1128 papers in training set
Top 7%
3.1%
11
Human Genetics
25 papers in training set
Top 0.1%
2.8%
12
European Journal of Human Genetics
49 papers in training set
Top 0.4%
2.6%
13
PLOS ONE
4510 papers in training set
Top 48%
2.1%
14
Cell Genomics
162 papers in training set
Top 2%
2.1%
15
Human Mutation
29 papers in training set
Top 0.3%
1.8%
16
Communications Biology
886 papers in training set
Top 11%
1.5%
17
International Journal of Molecular Sciences
453 papers in training set
Top 9%
1.3%
18
Genetics in Medicine Open
10 papers in training set
Top 0.1%
0.9%
19
PLOS Genetics
756 papers in training set
Top 15%
0.8%
20
npj Genomic Medicine
33 papers in training set
Top 0.9%
0.8%
21
Orphanet Journal of Rare Diseases
18 papers in training set
Top 0.7%
0.8%
22
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 44%
0.8%
23
Journal of Clinical Immunology
11 papers in training set
Top 0.1%
0.7%
24
Gigabyte
60 papers in training set
Top 2%
0.7%
25
Journal of Advanced Research
15 papers in training set
Top 1%
0.7%
26
BMC Genomics
328 papers in training set
Top 7%
0.7%
27
Journal of Personalized Medicine
28 papers in training set
Top 2%
0.7%
28
Nature Medicine
117 papers in training set
Top 6%
0.5%