Back

Case-base-control designs

Elhezzani, N. S.; Bergsma, W.; Weale, M.

2019-08-02 genetics
10.1101/723452 bioRxiv
Show abstract

Most genome-wide association studies (GWASs) use randomly selected samples from the population (hereafter bases) as the control set. This approach is successful when the trait of interest is rare; otherwise, a loss in the statistical power to detect disease-associated variants is expected. To address this, a proposal to combine the three sample types, cases, controls and bases is introduced, for instances when the disease under study is prevalent. This is done by modelling the bases as a mixture of multinomial logistic functions of cases and controls, according to the disease prevalence. The maximum likelihood method is used to estimate the underlying parameters using the EM algorithm. Three classical tests of association; score, Walds, and likelihood ratio tests are derived and their power of detecting genetic associations under different designs is compared. Simulations show that combining the three samples can increase the power to detect disease-associated variants, though a very large base sample set can compensate for the lack of controls.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Genetic Epidemiology
46 papers in training set
Top 0.1%
32.7%
2
PLOS ONE
4510 papers in training set
Top 19%
10.0%
3
Scientific Reports
3102 papers in training set
Top 14%
6.8%
4
Frontiers in Genetics
197 papers in training set
Top 0.9%
6.3%
50% of probability mass above
5
Bioinformatics
1061 papers in training set
Top 4%
6.3%
6
BMC Bioinformatics
383 papers in training set
Top 3%
3.6%
7
European Journal of Human Genetics
49 papers in training set
Top 0.3%
3.6%
8
PLOS Computational Biology
1633 papers in training set
Top 14%
2.1%
9
Human Brain Mapping
295 papers in training set
Top 3%
1.7%
10
The American Journal of Human Genetics
206 papers in training set
Top 2%
1.7%
11
Biology
43 papers in training set
Top 0.8%
1.7%
12
International Journal of Epidemiology
74 papers in training set
Top 2%
1.3%
13
Genetics
225 papers in training set
Top 3%
1.3%
14
Human Molecular Genetics
130 papers in training set
Top 2%
1.3%
15
Applied Sciences
24 papers in training set
Top 0.5%
1.2%
16
PLOS Genetics
756 papers in training set
Top 11%
1.2%
17
G3 Genes|Genomes|Genetics
351 papers in training set
Top 2%
0.9%
18
Statistics in Medicine
34 papers in training set
Top 0.3%
0.8%
19
Physical Biology
43 papers in training set
Top 2%
0.8%
20
Communications Biology
886 papers in training set
Top 21%
0.8%
21
Genes
126 papers in training set
Top 3%
0.8%
22
Nature Communications
4913 papers in training set
Top 63%
0.7%
23
Peer Community Journal
254 papers in training set
Top 4%
0.7%
24
Biometrics
22 papers in training set
Top 0.3%
0.6%
25
Briefings in Bioinformatics
326 papers in training set
Top 8%
0.6%