Back

Flexible and efficient count-distribution and mixed-model methods for eQTL mapping with quasar

Pullin, J. M.; Wallace, C.

2025-07-17 genetic and genomic medicine
10.1101/2025.07.17.25331702
Show abstract

Identifying genetic variants that affect gene expression, expression quantitative trait loci (eQTLs), is a major focus of modern genomics. Today, various methods exist for eQTL mapping, each using different statistical and methodological approaches. However, it is unclear which approaches lead to better performance, and challenges, particularly scalability as datasets continue to increase in size, remain. Here, we introduce quasar, a flexible and efficient C++ software program for eQTL mapping. Compared to existing eQTL mapping methods, quasar implements a wider variety of statistical models, including the linear model, Poisson and negative binomial generalised linear models, linear mixed model and Poisson and negative binomial generalised linear mixed models. Methodologically, we introduce and implement a simple, analytic approximation to the score test variance in mixed models. Furthermore, we highlight that difficulties with accurately estimating the negative binomial dispersion parameter, previously identified in the context of RNA-seq differential expression analysis, also apply to eQTL mapping. Therefore, quasar implements the Cox-Reid adjusted profile likelihood which enables unbiased estimation of the negative binomial dispersion parameter. We assess quasars performance and compare it to three existing eQTL mapping methods: apex, jaxQTL and tensorQTL, on the OneK1K dataset. We demonstrate that quasars output agrees with established methods where their models aligns but that quasar is at least 30% and up to 25 times faster. We exploit the range of models implemented in quasar to compare statistical models for eQTL mapping without confounding by implementation. We find that: count-based models have higher power, mixed models do not show better performance in a dataset without substantial relatedness, and the adjusted profile likelihood improves Type 1 error control when using the negative binomial distribution. Additionally, we investigate the relative performance of Poisson and negative binomial mixed models and the use of different approaches for gene-level FDR control. Overall, quasar provides a performant and versatile program for eQTL mapping and we nominate the negative binomial GLM model, incorporating adjusted profile likelihood dispersion estimation, as the statistical model with the best performance.

Matching journals

1
Bioinformatics
Oxford University Press (OUP) · based on 24 published papers
#1
182× avg
2
The American Journal of Human Genetics
Elsevier BV · based on 77 published papers
Top 0.7%
32× avg
3
Nature Communications
Springer Science and Business Media LLC · based on 483 published papers
Top 6%
4.8× avg
4
Nature Genetics
Springer Science and Business Media LLC · based on 72 published papers
Top 3%
9.7× avg
5
Genome Biology
Springer Science and Business Media LLC · based on 14 published papers
#1
93× avg
6
Scientific Reports
Springer Science and Business Media LLC · based on 701 published papers
Top 41%
4.6%
7
Human Genetics and Genomics Advances
Elsevier BV · based on 39 published papers
Top 0.2%
33× avg
8
Cell Genomics
Elsevier BV · based on 34 published papers
Top 0.6%
27× avg
9
BMC Genomics
Springer Science and Business Media LLC · based on 15 published papers
#1
70× avg
10
Genome Medicine
Springer Science and Business Media LLC · based on 56 published papers
Top 3%
9.1× avg
11
PLOS Genetics
Public Library of Science (PLoS) · based on 39 published papers
Top 0.9%
19× avg
12
Frontiers in Genetics
Frontiers Media SA · based on 32 published papers
Top 1%
14× avg
13
Proceedings of the National Academy of Sciences
Proceedings of the National Academy of Sciences · based on 100 published papers
Top 5%
4.8× avg
14
Briefings in Bioinformatics
Oxford University Press (OUP) · based on 11 published papers
Top 0.2%
59× avg
15
Genes
MDPI AG · based on 21 published papers
Top 1%
21× avg
16
Human Molecular Genetics
Oxford University Press (OUP) · based on 28 published papers
Top 3%
10× avg
17
iScience
Elsevier BV · based on 74 published papers
Top 5%
5.1× avg
18
npj Genomic Medicine
Springer Science and Business Media LLC · based on 18 published papers
Top 2%
14× avg
19
Human Genomics
Springer Science and Business Media LLC · based on 13 published papers
Top 2%
17× avg
20
PLOS ONE
Public Library of Science (PLoS) · based on 1737 published papers
Top 94%
0.8%
21
Human Genetics
Springer Science and Business Media LLC · based on 14 published papers
Top 1%
17× avg
22
PLOS Computational Biology
Public Library of Science (PLoS) · based on 141 published papers
Top 9%
1.6× avg
23
Nature
Springer Science and Business Media LLC · based on 58 published papers
Top 9%
2.7× avg
24
European Journal of Human Genetics
Springer Science and Business Media LLC · based on 25 published papers
Top 3%
8.3× avg
25
Science Advances
American Association for the Advancement of Science (AAAS) · based on 52 published papers
Top 6%
3.7× avg
26
Communications Biology
Springer Science and Business Media LLC · based on 36 published papers
Top 5%
4.2× avg