Rethinking covariate adjustment in psychiatric biomarker research: a framework applied to UK Biobank blood samples
Shin, M.; Crouse, J. J.; Hickie, I. B.; Wray, N. R.; Albinana, C.
Show abstract
ImportanceBlood-based biomarkers hold promise for psychiatric diagnosis and prognosis, yet clinical translation is constrained by poor reproducibility. Psychiatric biomarker studies are typically small, and demographic, behavioral, and temporal covariates often go undetected or cannot be adequately modeled. This may lead to residual confounding and unstable associations. ObservationsLeveraging UK Biobank data (N=~500,000), we systematically quantified how technical, demographic, behavioral, and temporal covariates influence 29 blood biomarkers commonly measured in research studies in psychiatry. Variance analyses showed substantial differences across biomarkers. Technical factors explained 1-6% and demographic factors explained 5-15% of the variance, with pronounced age-by-sex interactions for lipids and sex hormones. Behavioral covariates, particularly body mass index (BMI) and smoking, strongly influenced inflammatory markers. Temporal factors introduced systematic confounding. Chronotype was associated with blood collection time, multiple biomarkers exhibited marked diurnal rhythms (including testosterone, triglycerides, and immune markers), and inflammatory markers showed seasonal peaks in winter. In association analysis of biomarkers with major depression, bipolar disorder and schizophrenia, covariate adjustments attenuated or eliminated a substantial proportion of the biomarker-disorder associations, with BMI emerging as the dominant confounder. These findings demonstrate that such confounding structures exist and can be characterized in large cohorts, though specific biomarker-disorder relationships require validation in clinical samples. Conclusions and RelevancePoor reproducibility of biomarkers may not only stem from insufficient biological signal but also from inconsistent handling of confounders. We propose a systematic framework distinguishing technical factors (to be removed), demographic factors (addressed through adjustment or stratification), temporal factors (ideally controlled at design stages), and behavioral factors (requiring explicit causal reasoning). Associations robust to multiple adjustment strategies should be prioritized for clinical biomarker development. Standardized collection protocols, comprehensive covariate measurement, and transparent reporting across models are essential to improve reproducibility and identify biomarkers that reflect genuine illness-related pathophysiology.
Matching journals
The top 8 journals account for 50% of the predicted probability mass.