Back

Expanding CIRdb, a comprehensive catalog of whole-exome sequencing data of Canary Islanders

Diaz-de Usera, A.; Rubio-Rodriguez, L. A.; Munoz-Barrera, A.; Lorenzo-Salazar, J. M.; Guillen-Guio, B.; Jaspez, D.; Corrales, A.; Marcelino-Rodriguez, I.; Rodriguez-Perez, M. d. C.; Cabrera-de Leon, A.; Gonzalez-Montelongo, R.; Cruz-Guerrero, R.; Carracedo, A.; Flores, C.

2025-11-27 genetic and genomic medicine
10.1101/2025.11.24.25340885 medRxiv
Show abstract

Within the intricate European genetic diversity landscape, Canary Islanders exhibit a unique genetic admixture, comprising European (EUR), North African (NAF), and sub-Saharan African (SSA) ancestries. This study aimed to comprehensively characterize the full spectrum of small genetic variation among 920 unrelated donors from this population based on whole-exome sequencing data to further develop CIRdb as the Canary Islanders-specific reference catalog of genetic variation. We combined this with SNP array data and whole-genome sequencing for specific analyses, revealing a total of 387,555 variants, of which 15.1% were previously unreported. Notably, 74.4% of these variants were classified as rare (with frequency <0.5%), including up to 40% of singletons. We also identified and curated a set of 2,068 variants prioritized as putative pathogenic. Intriguingly, the novel pathogenic variants exhibited enrichment in respiratory, cardiovascular, and metabolic disorders. Genetic differentiation patterns clustered separately individuals from the smallest islands, providing fine-grained insights into within-archipelago differentiation. A scan of local genetic ancestry deviations across the genome revealed an EUR ancestry enrichment around the 17q21.31 inversion, widely recognized for positive selection and associated to pleiotropic effects across pulmonary, infectious, and immunological diseases. Our results also evidenced a selective sweep shared by Canary Islanders and the NAF population around Prune Exopolyphosphatase 1 gene, which is associated with body mass index, cardiovascular health, and metabolic traits. Taken together, CIRdb presents a valuable resource of exome-wide genetic variation in a population at the edge of Southwestern European genetic diversity.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Genome Medicine
154 papers in training set
Top 0.1%
23.4%
2
Cell Genomics
162 papers in training set
Top 0.1%
20.2%
3
Nature Communications
4913 papers in training set
Top 9%
15.3%
50% of probability mass above
4
Human Genomics
21 papers in training set
Top 0.1%
6.6%
5
The American Journal of Human Genetics
206 papers in training set
Top 2%
2.5%
6
Cell
370 papers in training set
Top 9%
2.2%
7
Genome Biology
555 papers in training set
Top 4%
1.8%
8
Nature Genetics
240 papers in training set
Top 4%
1.7%
9
Genomics
60 papers in training set
Top 1%
1.5%
10
Nature
575 papers in training set
Top 12%
1.4%
11
Scientific Reports
3102 papers in training set
Top 63%
1.4%
12
Frontiers in Genetics
197 papers in training set
Top 6%
1.3%
13
Nucleic Acids Research
1128 papers in training set
Top 13%
1.3%
14
Human Molecular Genetics
130 papers in training set
Top 2%
1.3%
15
Communications Biology
886 papers in training set
Top 13%
1.3%
16
European Journal of Human Genetics
49 papers in training set
Top 0.9%
1.0%
17
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
18
eLife
5422 papers in training set
Top 56%
0.8%
19
Human Genetics
25 papers in training set
Top 0.4%
0.8%
20
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 44%
0.7%
21
BMC Medical Genomics
36 papers in training set
Top 2%
0.7%
22
Nature Human Behaviour
85 papers in training set
Top 5%
0.5%
23
Human Genetics and Genomics Advances
70 papers in training set
Top 1%
0.5%
24
Nature Medicine
117 papers in training set
Top 6%
0.5%
25
National Science Review
22 papers in training set
Top 3%
0.5%