Back

Power is a major confounder in the analysis of cross-ancestry 'portability' in human eQTLs

Gibbs, P. M.; Beasley, I. J.; Del Azodi, C. B.; McCarthy, D. J.; Gallego Romero, I.

2026-02-27 genomics
10.64898/2026.02.26.708346 bioRxiv
Show abstract

The phenotypic effects of germline variants are often mediated through gene regulation. Expression quantitative trait loci (eQTLs) are genetic variants associated with changes in gene expression. Understanding how eQTLs vary across populations is essential for characterising the genetic and regulatory drivers of trait diversity. Meta-analysing eQTL studies from multiple populations enables more robust detection of eQTLs and can reveal regulatory mechanisms shaped by population-specific environmental or ancestry-related factors. However, across the multi-ancestry eQTL literature, a wide range of methods have been used to quantify eQTL portability across ancestry groups. Because different studies employ different portability metrics, it is challenging to form a coherent view of the regulatory landscape across populations. In this work, we analyse eQTL summary statistics from ten datasets matched on tissue type and sequencing technology. We compare portability metrics used previously and show that they can yield markedly different patterns of apparent regulatory conservation or divergence. We then examine the statistical determinants of portability across metrics and demonstrate that sample size, minor allele frequency, and linkage disequilibrium are major drivers of the observed differences in eQTL portability across studies. These findings highlight that differences in statistical power stemming from factors such as population size and allele frequency must be accounted for when evaluating eQTL portability. To address this issue, we introduce a new approach designed to correct for these factors when calling eQTL portability. Finally, we show that empirical Bayes multivariate adaptive shrinkage provides a powerful framework for meta-analysing multiple eQTL studies, with the ability to pool signals across populations to produce more robust effect-size estimates within each population.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
The American Journal of Human Genetics
206 papers in training set
Top 0.1%
37.2%
2
PLOS Genetics
756 papers in training set
Top 2%
8.3%
3
eLife
5422 papers in training set
Top 18%
4.8%
50% of probability mass above
4
GENETICS
189 papers in training set
Top 0.1%
4.8%
5
Genome Research
409 papers in training set
Top 0.7%
4.2%
6
Genome Biology
555 papers in training set
Top 2%
4.2%
7
Cell Genomics
162 papers in training set
Top 1%
3.9%
8
Frontiers in Genetics
197 papers in training set
Top 2%
3.5%
9
Genetics
225 papers in training set
Top 1%
3.5%
10
Genome Biology and Evolution
280 papers in training set
Top 0.8%
2.0%
11
Nature Communications
4913 papers in training set
Top 49%
1.9%
12
Scientific Reports
3102 papers in training set
Top 59%
1.7%
13
G3 Genes|Genomes|Genetics
351 papers in training set
Top 1%
1.6%
14
Human Genetics and Genomics Advances
70 papers in training set
Top 0.4%
1.3%
15
PLOS Computational Biology
1633 papers in training set
Top 20%
1.2%
16
Nucleic Acids Research
1128 papers in training set
Top 14%
1.2%
17
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
1.2%
18
Molecular Biology and Evolution
488 papers in training set
Top 4%
0.9%
19
Genetic Epidemiology
46 papers in training set
Top 0.7%
0.9%
20
PLOS ONE
4510 papers in training set
Top 64%
0.9%
21
Bioinformatics
1061 papers in training set
Top 9%
0.9%
22
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 5%
0.8%
23
American Journal of Biological Anthropology
11 papers in training set
Top 0.3%
0.7%
24
Bioinformatics Advances
184 papers in training set
Top 5%
0.6%