Back

Validity and Interpretation of Two-Sample Mendelian Randomization with Binary Traits

Wu, Z.; Wang, J.

2026-02-18 genetics
10.1101/2024.06.09.598150 bioRxiv
Show abstract

BackgroundTwo-sample Mendelian randomization (MR) is widely applied to binary exposures and outcomes. Yet standard MR models rely on linear effect assumptions that are difficult to interpret for binary traits. Although liability-based interpretations have been suggested, it remains unclear whether conventional summary-data MR is formally justified in this setting or what causal parameter it identifies. MethodsWe develop a liability-threshold framework in which binary traits arise from underlying continuous liabilities. We derive explicit relationships between genome-wide association study (GWAS) coefficients obtained from logistic or linear regression on binary traits and marginal genetic associations on the liability scale. Under small genetic effects, typical for complex traits, observed-scale GWAS coefficients are approximately proportional to liability-scale associations. ResultsThis proportionality implies that standard two-sample MR methods remain statistically coherent for binary traits. MR applied to binary exposures or outcomes estimates a scaled causal effect between underlying liabilities rather than an effect on the observed binary scale. The scaling factor depends primarily on trait prevalence and is directly computable. Simulations and UK Biobank analyses confirm that, after rescaling, MR using binary traits recovers liability-scale causal effects consistent with analyses based on continuous traits. ConclusionsWe provide a formal statistical justification for summary-data MR with binary traits and clarify the causal parameter being estimated. These results support routine MR practice for binary exposures and outcomes while enabling coherent interpretation of effect sizes. Key MessagesO_LIThe interpretation of two-sample MR with binary exposures or outcomes is often unclear because GWAS analyses are performed on the observed binary scale. C_LIO_LIUnder a liability threshold framework with small genetic effects, GWAS coefficients from logistic or linear regression on binary traits are approximately proportional to genetic associations on an underlying continuous liability scale. C_LIO_LIConsequently, conventional summary-data MR applied to binary or ordinal traits remains valid and estimates a scaled causal effect between liabilities, requiring no modification of existing methods. C_LI

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
International Journal of Epidemiology
74 papers in training set
Top 0.1%
33.4%
2
Genetic Epidemiology
46 papers in training set
Top 0.1%
8.5%
3
PLOS ONE
4510 papers in training set
Top 24%
7.3%
4
American Journal of Epidemiology
57 papers in training set
Top 0.1%
6.4%
50% of probability mass above
5
Human Molecular Genetics
130 papers in training set
Top 0.6%
4.2%
6
Bioinformatics
1061 papers in training set
Top 5%
4.0%
7
The American Journal of Human Genetics
206 papers in training set
Top 1%
4.0%
8
PLOS Genetics
756 papers in training set
Top 4%
3.6%
9
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.8%
3.3%
10
BMC Bioinformatics
383 papers in training set
Top 4%
1.9%
11
BMC Medicine
163 papers in training set
Top 3%
1.8%
12
European Journal of Epidemiology
40 papers in training set
Top 0.3%
1.7%
13
Nature Communications
4913 papers in training set
Top 51%
1.7%
14
PLOS Medicine
98 papers in training set
Top 2%
1.7%
15
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
16
European Journal of Human Genetics
49 papers in training set
Top 0.8%
1.3%
17
BMC Medical Research Methodology
43 papers in training set
Top 1%
0.9%
18
Scientific Reports
3102 papers in training set
Top 70%
0.9%
19
Genetics in Medicine
69 papers in training set
Top 0.9%
0.9%
20
Psychological Medicine
74 papers in training set
Top 2%
0.8%
21
Trials
25 papers in training set
Top 2%
0.8%
22
GENETICS
189 papers in training set
Top 2%
0.7%
23
Statistics in Medicine
34 papers in training set
Top 0.5%
0.5%
24
Clinical Pharmacology & Therapeutics
25 papers in training set
Top 1%
0.5%