Epidemiology — Latest Matching Preprints

1

Widespread genetic effect heterogeneity impacts bias and power in nonlinear Mendelian randomization

Wang, J.; Morrison, J.

2026-04-20 epidemiology 10.64898/2026.04.17.26351133 medRxiv

Top 0.2%

2.1%

Show abstract

1Mendelian randomization (MR) uses genetic variants as instrumental variables to infer causal relationships between complex traits. Standard MR can be used to estimate an average causal effect at the population level, and typically assumes a linear exposure-outcome relationship. Recently, several methods for estimating nonlinear effects have been developed. However, many have been found to produce spurious empirical findings when subjected to negative control analyses. We propose that this poor performance may be attributable to heterogeneity in variant-exposure associations. We demonstrate that heterogeneous genetic effects on exposure lead to biased estimates, poor coverage, and inflated type I error in control function and stratification-based methods. In contrast, two-stage least squares (TSLS) methods are robust to such heterogeneity, but suffer from low precision and low power in some circumstances. We show that a statistical test for heterogeneity can be used to guide the choice of nonlinear MR methods. Using UK Biobank data, we reassess the causal effects of BMI, vitamin D, and alcohol consumption on blood pressure, lipid, C-reactive protein, and age (negative control). We find strong evidence of heterogeneity for all three exposures, and also recapitulate previous results that control function and stratification-based methods are prone to false positives. Finally, using nonparametric TSLS, we identify evidence of nonlinear causal effects of BMI on HDL cholesterol, triglycerides, and C-reactive protein; however, specific estimates of the shape of these relationships are imprecise. Altogether, our results suggest that common nonlinear MR methods are unreliable in the presence of realistic levels of heterogeneity, and that more methodological development is required before practically useful nonlinear MR is feasible.

2

Addition of Bupropion or Varenicline to Nicotine Replacement Therapy After Acute Coronary Syndrome: A Propensity-Matched Real-World Analysis

Qadeer, A.; Gohar, N.; Maniyar, P.; Shafi, N.; Juarez, L. M.; Mortada, I.; Pack, Q. R.; Jneid, H.; Gaalema, D. E.

2026-04-23 cardiovascular medicine 10.64898/2026.04.21.26351432 medRxiv

Top 0.2%

2.0%

Show abstract

Introduction: Smoking cessation after acute coronary syndrome (ACS) is a Class I recommendation, yet prescription pharmacotherapy use remains low and its real-world cardiovascular effectiveness when added to nicotine replacement therapy (NRT) is poorly characterized. Methods: We conducted a retrospective cohort study using the TriNetX US Collaborative Network (67 healthcare organizations). Adults hospitalized with ACS who received NRT within one month, serving as a proxy for active smoking status, were identified. Two co-primary propensity-matched (1:1, 50 covariates, caliper 0.10 SD) comparisons evaluated bupropion + NRT and varenicline + NRT individually versus NRT alone; a supportive analysis evaluated combined pharmacotherapy versus NRT alone. All-cause mortality was the primary endpoint. Secondary outcomes included MACE, heart failure exacerbations, major bleeding, TIA/stroke, emergency rehospitalizations, and cardiac rehabilitation utilization, assessed at 6 months and 1 year via Kaplan-Meier analysis. Hazard ratios (HRs) greater than 1.0 indicate higher hazard in the NRT-only group. Results: After matching, the combined analysis comprised 8,574 pairs, the bupropion analysis 4,654 pairs, and the varenicline analysis 2,126 pairs. At 1 year, the combined pharmacotherapy group had significantly lower all-cause mortality (HR 1.26, 95% CI 1.16-1.37), MACE (HR 1.16, 95% CI 1.12-1.21), heart failure exacerbations (HR 1.16, 95% CI 1.08-1.25), major bleeding (HR 1.18, 95% CI 1.08-1.28), and greater cardiac rehabilitation utilization (HR 0.82, 95% CI 0.74-0.92; all p < 0.001). TIA/stroke did not differ significantly. Six-month results were consistent. Both varenicline and bupropion individually showed lower mortality and MACE. A urinary tract infection falsification endpoint showed no between-group differences, supporting matching validity. The pharmacotherapy group had higher rates of new-onset depression, driven predominantly by bupropion recipients. Conclusions: In this propensity-matched real-world analysis, adding prescription smoking cessation pharmacotherapy to NRT after ACS was associated with lower mortality and fewer adverse cardiovascular events, supporting broader integration into post-ACS care pathways.

3

Modeling the impact of adherence to U.S. isolation and masking guidance on SARS-CoV-2 transmission in office workplaces in 2021-2022

Garcia Quesada, M.; Wallrafen-Sam, K.; Kiti, M. C.; Ahmed, F.; Aguolu, O. G.; Ahmed, N.; Omer, S. B.; Lopman, B. A.; Jenness, S. M.

2026-04-21 epidemiology 10.64898/2026.04.14.26350639 medRxiv

Top 0.2%

2.0%

Show abstract

Non-pharmaceutical interventions (NPIs) have been important for controlling SARS-CoV-2 transmission, particularly before and during initial vaccine rollout. During the pandemic, the US Centers for Disease Control and Prevention issued isolation and masking guidance in case of COVID-19-like illness, a positive SARS-CoV-2 test, or known exposure to SARS-CoV-2. However, the impact of this guidance on mitigating transmission in office workplaces is unclear. We used a network-based mathematical model to estimate the impact of this guidance on SARS-CoV-2 transmission among office workers and their communities. The model represented social contacts in the home, office, and community. We used data from the CorporateMix study to parametrize social contacts among office workers and calibrated the model to represent the COVID-19 epidemic in Georgia, USA from January 2021 through August 2022. In the reference scenario (58% adherence to guidance among office workers and the broader population), workplace transmission accounted for a small fraction of total infections. Reducing adherence among office workers to 0% increased workplace transmissions by 27.1% and increasing adherence to 75% reduced workplace transmission by 7.0%. Increasing adherence to 75% among office workers had minimal impact on symptomatic cases and deaths; increasing it among the broader population was more effective in reducing office worker cases and deaths. In our model, moderate adherence to recommended NPIs in workplaces was effective in reducing transmission, but increasing adherence had limited benefit given workplaces that have low contact intensity and hybrid work arrangements. These results underscore the public health benefits of community-wide adoption of recommended NPIs.

4

Demystifying Clone-Censor-Weight Method in Target Trial Emulation: A Real-World Study of HPV Vaccination Strategies

Lin, T.; Li, Y.; Huang, Z.; Gui, T. T.; Wang, W.; Guo, Y.

2026-04-22 health informatics 10.64898/2026.04.21.26351413 medRxiv

Top 0.2%

1.9%

Show abstract

Target trial emulation (TTE) offers a principled way to estimate treatment effects using real-world observational data, but analyses of time-varying treatment strategies remain vulnerable to immortal time bias. The clone-censor-weight (CCW) approach is increasingly used to address this problem, yet key aspects of its causal interpretation and implementation remain unclear. In this work, we emulate a target trial using electronic health records (EHRs) to compare completion of a 3-dose 9-valent human papillomavirus vaccination (HPV) series within 12 months versus remaining partially vaccinated among vaccine initiators. We link CCW to the classic potential outcome framework in causal inference, evaluate the role of different weighting mechanisms, and account for within-subject correlation induced by cloning using cluster-robust variance estimation. Our study provides practical guidance for applying CCW in real-world comparative effectiveness studies to address immortal time bias and supports more rigorous and interpretable treatment effect estimation in TTE.

5

Improving Care by FAster risk-STratification through use of high sensitivity point-of-care troponin in patients presenting with possible acute coronary syndrome in the EmeRgency department (ICare-FASTER): a stepped-wedge cluster randomized trial

Than, M.; Pickering, J. W.; Joyce, L. R.; Buchan, V. A.; Florkowski, C. M.; Mills, N. L.; Hamill, L.; Prystowsky, J.; Harger, S.; Reed, M.; Bayless, J.; Feberwee, A.; Attenburrow, T.; Norman, T.; Welfare, O.; Heiden, T.; Kavsak, P.; Jaffe, A. S.; apple, f.; Peacock, W. F.; Cullen, L.; Aldous, S.; Richards, A. M.; Lacey, C.; Troughton, R.; Frampton, C.; Body, R.; Mueller, C.; Lord, S. J.; George, P. M.; Devlin, G.

2026-04-23 cardiovascular medicine 10.64898/2026.04.21.26351433 medRxiv

Top 0.3%

1.7%

Show abstract

BACKGROUND Point-of-care (POC) high-sensitivity cardiac troponin (hs-cTn) testing has the potential to expedite decision-making and reduce emergency department (ED) length of stay for patients presenting with possible myocardial infarction (MI) by ensuring that results are consistently available when looked for by clinicians. We assessed the real-life effectiveness and safety of implementing POC hs-cTn testing in the ED. METHODS We conducted a pragmatic, stepped-wedge cluster randomized trial. The control arm was usual care with an accelerated diagnostic pathway utilizing a single-sample rule-out step with a central laboratory hs-cTn assay. The intervention arm used the same pathway with a POC hs-cTnI. The primary effectiveness outcome was ED length of stay assessed using a generalized linear mixed model, and the safety outcome was 30-day MI or cardiac death. RESULTS Six sites participated with 59,980 ED presentations (44,747 individuals, 61{+/-}19 years, 49.5% female) from February 2023 to January 2025, in which 31,392 presentations were during the intervention arm. After adjustment for co-variates associated with length of stay, the intervention reduced length of stay by 13% (95% confidence intervals [CI], 9 to 16%. P<0.001), corresponding to a reduction of 47 minutes (95%CI, 33 to 61 minutes) from a mean length of stay in the control arm of 376 minutes. The 30-day MI or cardiac death rate was similar in the control and intervention arms (0.39% and 0.39% respectively, P=0.54). CONCLUSIONS Implementation of whole-blood hs-cTnI testing at the POC into an accelerated diagnostic pathway was safe and reduced length of stay in the ED compared with laboratory testing.

6

Mediation analysis in longitudinal data: an unbiased estimator for cumulative indirect effect

Li, Y.; Cabral, H.; Tripodis, Y.; Ma, J.; Levy, D.; Joehanes, R.; Liu, C.; Lee, J.

2026-04-20 epidemiology 10.64898/2026.04.18.26351189 medRxiv

Top 0.3%

1.7%

Show abstract

Mediation analysis quantifies how an exposure affects an outcome through an intermediate variable. We extend mediation analysis to capture the cumulative effects of longitudinal predictors on longitudinal outcomes. Our proposed model examines how mediators transmit the effects of the current and previous exposure on the current outcome. We construct a least-squared estimator for cumulative indirect effect (CIE) and used three approaches (exact form, delta method, and bootstrap procedure) to estimate its standard error (SE). The estimator of CIE is unbiased with no unmeasured confounding and independent model errors between mediator model and outcome model at all time points, as shown in statistical inference and in simulations. While three SE estimates are numerically similar, bootstrap procedure is recommended due to its simplicity in implementation. We apply this method to Framingham Heart Study offspring cohort to assess if DNA methylation mediates the association of alcohol consumption with systolic blood pressure over two time points. We identify two CpGs (cg05130679 and cg05465916) as mediators and construct a composite DNA methylation score from 11 CpGs, which mediates for 39% of the cumulative effect. In conclusion, we propose an unbiased estimator for CIE. Future studies will investigate the missingness in mediators and outcomes.

7

The #aware.hiv Europe study: protocol for a stepped-wedge cluster randomised trial of a multimodal hospital-based implementation strategy targeting HIV indicator condition-guided testing in European hospitals.

Vliegenthart-Jongbloed, K. J.; Bunea, O.-M.; Fijołek, F.; Razzolini, I. P.; Barber, T. J.; Bernardino, J. I.; Nozza, S.; Psomas, C. K.; De Scheerder, M.-A.; Vasylyev, M.; Voit, F. M.; Jordans, C. C. E.; Willemsen, R.; van Wingerden, M. D.; Bienkowski, C.; Miron, V. D.; Felder, A.-K.; Hanssen, B.; Hontelez, J.; Li, Y.; Stutterheim, S.; Skrzat, A.; Sandulescu, O.; Rokx, C.; #aware.hiv Europe,

2026-04-20 hiv aids 10.64898/2026.04.17.26351141 medRxiv

Top 0.5%

0.8%

Show abstract

IntroductionAcross Europe, many people with HIV are diagnosed late despite repeated contact with hospital services for HIV indicator conditions. These conditions flag a possible underlying HIV infection for which HIV testing is recommended. They provide an opportunity to identify people with HIV, yet implementation of indicator condition based testing remains insufficient in hospital practice. The #aware.hiv Europe study was developed to address this gap by embedding HIV teams into routine care to normalise HIV testing. Methods and analysis#aware.hiv Europe is a stepped-wedge cluster randomised trial in 30 hospitals across ten European countries. Five clusters of 6 hospitals each will sequentially transition from control to implementation periods when local HIV teams led by an infectious diseases specialist will be installed. Intervention activities include hospital-wide peer audit and feedback on missed testing opportunities, targeted education, stigma reduction activities, and strengthening of linkage to HIV prevention and care. Patients with predefined HIV indicator conditions are identified using International Classification of Diseases, 10th Revision (ICD-10) diagnosis codes, confirmed through manual review. The primary outcome is the change in HIV testing rate among patients with confirmed HIV indicator conditions. Secondary outcomes include HIV case detection, cascades of diagnosis, care and prevention, variation in testing practices, healthcare professional knowledge and stigma, and implementation outcomes. Analyses will use mixed effects regression models accounting for clustering and time within the stepped-wedge design. Ethics and disseminationThe study has ethical approval in all hospitals to use routinely collected clinical data under exemption from informed consent for patient level data. Results will be disseminated through peer reviewed publications, conferences, and collaboration with clinical and community partners with the goal to inform HIV testing policies. Trial registrationClinicalTrials.gov NCT06900829. https://clinicaltrials.gov/study/NCT06900829 Strengths and limitations of this study+ Large, multinational, real-world, stepped-wedge, cluster randomized trial design. + Primary outcome derived from routinely collected clinical data, using a GDPR- and GCP-compliant approach with exemption from informed consent. + Hospital-wide intervention targeting care professionals, delivered through proactive expert HIV teams across departments powered to conclude on hard HIV care cascade clinical endpoints and stigma reducing interventions. + Implementation science design informed by established frameworks (CFIR and RE-AIM) to strengthen cross-continental generalisability. - Variation in healthcare systems and baseline testing practices across countries may contribute to heterogeneity in implementation and outcomes. - Despite standardised SOPs, local clinical judgement influences the assessment of HIV indicator conditions.

8

Racioethnic Disparities in Risk of Cardiometabolic Risk Factors and Cardiovascular Disease among Women Treated for Breast Cancer: The Pathways Heart Study

Yao, S.; Zimbalist, A.; Sheng, H.; Fiorica, P.; Cheng, R.; Medicino, L.; Omilian, A.; Zhu, Q.; Roh, J.; Laurent, C.; Lee, V.; Ergas, I.; Iribarren, C.; Rana, J.; Nguyen-Huynh, M.; Rillamas-Sun, E.; Hershman, D.; Ambrosone, C.; Kushi, L.; Greenlee, H.; Kwan, M.

2026-04-24 epidemiology 10.64898/2026.04.23.26351612 medRxiv

Top 0.5%

0.8%

Show abstract

Background: Few studies have examined racioethnic disparities in cardiovascular disease (CVD) in women after breast cancer treatment, who are at higher risk due to cardiotoxic cancer treatment. Methods: Based on the Pathways Heart Study of women with a history of breast cancer, this analysis examines the association between cardiometabolic risk factors (hypertension, diabetes, and dyslipidemia) and CVD events with self-reported race and ethnicity, as well as genetic similarity. Multivariable logistic and Cox proportional hazards regression models were used to test race and ethnicity and genetic similarity with prevalent and incident cardiometabolic risk factors and CVD events. Results: Of the 4,071 patients in this analysis, non-Hispanic Black (NHB), Asian, and Hispanic women were more likely to have prevalent and incident diabetes than non-Hispanic White (NHW) women. Analysis of genetic similarity revealed results consistent with self-reported race and ethnicity. For CVD risk, NHB women were more likely to develop heart failure and cardiomyopathy than NHW women. In contrast, Hispanic women were at lower risk of any incident CVD, serious CVD, arrhythmia, heart failure or cardiomyopathy, and ischemic heart disease, which was consistent with the associations found with Native American ancestry. Conclusions: This is the largest multi-ethnic study of disparities in CVD health in breast cancer survivors, demonstrating corroborating findings between self-reported race and ethnicity and genetic similarity. The results highlight disparities in cardiometabolic risk factors and CVD among breast cancer survivors that warrant more research and clinical attention in these distinct, high-risk populations.

9

Antenatal Screening for Sexually Transmitted Infections to Reduce Preterm Birth or Low Birthweight (Philani Ndiphile Study): A Randomized Three-Group Trial

Babalola, C. M.; Medina-Marino, A.; Mdingi, M. M.; Wilson, M. L.; Mukomana, F.; Muzny, C. A.; Taylor, C. M.; Gigi, R. M.; Jung, H.; Low, N.; Peters, R. P.; Klausner, J. D.

2026-04-21 sexual and reproductive health 10.64898/2026.04.15.26350805 medRxiv

Top 0.5%

0.7%

Show abstract

BackgroundChlamydia trachomatis, Neisseria gonorrhoeae, and Trichomonas vaginalis are curable sexually transmitted infections (STIs) associated with adverse birth outcomes. Most infections are asymptomatic. Whether antenatal STI screening improves birth outcomes remains uncertain. MethodsIn a randomized three-group trial in South Africa, pregnant women aged 18 years or older were assigned before 27 weeks gestation to: (1) screening and treatment for Chlamydia trachomatis, Neisseria gonorrhoeae, and Trichomonas vaginalis at enrollment, with tests-of-cure (One-Time Screening); (2) screening and treatment at enrollment, repeated at 30 to 34 weeks (Two-Time Screening); or (3) Standard-of-Care (Syndromic management). The primary outcome was a composite of preterm birth (<37 weeks gestation) or low birthweight (<2500 g), analyzed in the modified intention-to-treat population of participants with live births. Components of the composite outcome were evaluated individually as the main secondary outcomes. The study was registered with ClinicalTrials.gov, NCT04446611. FindingsOf 2247 enrolled participants, 1910 had live births. The composite outcome occurred in 22{middle dot}9% of the One-Time Screening group (risk ratio [RR] 0{middle dot}99; 95% confidence interval [CI] 0{middle dot}81-1{middle dot}21), 20{middle dot}6% of the Two-Time Screening group (RR 0{middle dot}89; 95% CI 0{middle dot}72-1{middle dot}09), compared with 23{middle dot}2% of the Standard-of-Care group. Preterm birth occurred in 18{middle dot}9% of the One-Time Screening group (RR 1{middle dot}00; 95% CI 0{middle dot}80-1{middle dot}26), 14{middle dot}5% of the Two-Time Screening group (RR 0{middle dot}77; 95% CI 0{middle dot}60-0{middle dot}99), and 18{middle dot}8% of the Standard-of-Care group. Low birthweight occurred in 14{middle dot}1% of the One-Time Screening group (RR 1{middle dot}10; 95% CI 0{middle dot}83-1{middle dot}46), 12{middle dot}9% of the Two-Time Screening group (RR 1{middle dot}01; 95% CI 0{middle dot}76-1{middle dot}35), and 12{middle dot}8% of the Standard-of-Care group. InterpretationNeither screening strategy for Chlamydia trachomatis, Neisseria gonorrhoeae, and Trichomonas vaginalis reduced the primary composite outcome of preterm birth or low birthweight, or low birthweight alone. The Two-Time antenatal STI screening strategy, however, reduced preterm birth by 23%.

10

Interventions to improve retention in HIV care: a systematic review and network meta-analysis of randomised controlled trials

Rehman, N.; Guyatt, G.; JinJin, M.; Silva, L. K.; Gu, J.; Munir, M.; Sadagari, R.; Li, M.; Xie, D.; Rajkumar, S.; Lijiao, Y.; Najmabadi, E.; Dhanam, V.; Mertz, D.; Jones, A.

2026-04-20 hiv aids 10.64898/2026.04.18.26351146 medRxiv

Top 0.6%

0.7%

Show abstract

BackgroundSustained retention in care supports continuous access to antiretroviral therapy, routine clinical monitoring, and long-term viral suppression. ObjectiveTo compare the effectiveness of interventions for improving retention in care among people living with HIV (PLHIV). DesignSystematic review and network meta-analysis Data sourcesPubMed, Embase, CINAHL, PsycINFO, Web of Science, and the Cochrane Library from 1995 to December 2024. Eligibility criteriaRandomised controlled trials (RCTs) evaluating interventions to improve retention in care, viral load suppression, or quality of life (QoL) among PLHIV, compared with standard of care (SoC) or other interventions. Data extraction and synthesisPairs of reviewers independently screened studies, extracted data, and assessed risk of bias using ROBUST-RCT. We conducted a fixed-effect frequentist network meta-analysis and rated interventions categories relative to SoC based on effect estimates effects and the certainty of evidence.. Dichotomous outcomes were summarized as odds ratios (ORs) with 95% confidence intervals (CIs), and continuous outcomes as mean differences (MDs) with 95% CI. ResultsEighty-four trials enrolling 107 137 PLHIV evaluated 13 intervention categories. For retention in care, five interventions supported by moderate or high certainty evidence proved superior to SoC: multi-month dispensing (OR 2.02, 95% CI 1.32 to 3.09), task shifting (OR 1.94, 95% CI 1.42 to 2.66), differentiated service delivery (OR 1.47, 95% CI 1.22 to 1.76), behavioural counselling (OR 1.36, 95% CI 1.21 to 1.54), and supportive interventions (OR 1.31, 95% CI 1.11 to 1.55). For viral load suppression, two interventions supported by moderate or high certainty evidence proved superior to SoC: task shifting (OR 2.07, 95% CI 1.25 to 3.43) and behavioural counselling (OR 1.34, 95% CI 1.11 to 1.67). Across outcomes, no intervention demonstrated convincing superiority over other active interventions. ConclusionsAmong 13 intervention categories, only a subset provided moderate or high-certainty evidence of superiority to the standard of care, and no superiority to other interventions. Persistent evidence gaps for key populations, diverse settings, and long-term outcomes support the need for context-sensitive and patient-centred interventions. RegistrationPROSPERO CRD42024589177 Strengths and limitations of this study[tpltrtarr] This systematic review followed Cochrane methods and was reported in accordance with PRISMA-NMA guidelines. [tpltrtarr]The network meta-analysis integrated direct and indirect evidence to compare multiple intervention categories within a single framework. [tpltrtarr]Risk of bias and certainty of evidence were assessed using ROBUST-RCT and the GRADE approach for network meta-analysis, respectively. [tpltrtarr]Some networks were sparse, and limited representation of key populations and long-term follow-up constrained the strength and generalisability of inferences.

11

AI/ML-based prediction of TB treatment failure: A systematic review and meta-analysis

Kamulegeya, R.; Nabatanzi, R.; Semugenze, D.; Mugala, F.; Takuwa, M.; Nasinghe, E.; Musinguzi, D.; Namiiro, S.; Katumba, A.; Ssengooba, W.; Nakatumba-Nabende, J.; Kivunike, F. N.; Kateete, D. P.

2026-04-22 infectious diseases 10.64898/2026.04.16.26350453 medRxiv

Top 0.6%

0.7%

Show abstract

BackgroundTuberculosis (TB) remains a leading cause of infectious disease mortality worldwide, and treatment failure contributes to ongoing transmission, drug resistance, and poor clinical outcomes. Artificial intelligence and machine learning approaches have attracted growing interest for predicting tuberculosis treatment outcomes, but the literature is heterogeneous and lacks a comprehensive synthesis. MethodsWe conducted a systematic review and meta-analysis of studies that developed or validated machine learning models to predict TB treatment failure. We searched PubMed/MEDLINE and Embase from January 2000 to October 2025. Studies were eligible if they developed, validated, or implemented an artificial intelligence or machine learning model for the prediction of TB treatment failure or a closely related poor outcome in patients receiving anti-TB treatment. Risk of bias was assessed using the Prediction model Risk Of Bias Assessment Tool. Random-effects meta-analysis was performed to pool area under the curve values, with subgroup analyses and meta-regression to explore heterogeneity. ResultsThirty-four studies were included in the systematic review, of which 19 reported area under the curve values suitable for meta-analysis (total participants, 100,790). Studies were published between 2014 and 2025, with 91% published from 2019 onward. Tree-based methods were the most common algorithm family (52.9%), and multimodal models integrating three or more data types were used in 41.2% of studies. The pooled area under the curve was 0.836 (95% confidence interval 0.799-0.868), with substantial heterogeneity (I{superscript 2} = 97.9%). In subgroup analyses, studies including HIV-positive participants showed lower discrimination (pooled area under the curve 0.748) compared to those excluding them (0.924). Only eight studies (23.5%) performed external validation, and only one study (2.9%) was rated as low risk of bias overall, primarily due to methodological concerns in the analysis domain. Eggers test suggested publication bias (p = 0.024). Major evidence gaps included underrepresentation of high-burden countries, HIV-affected populations, social determinants, pediatric TB, and extrapulmonary disease. ConclusionsMachine learning models for predicting TB treatment failure show promising discrimination but are not yet ready for routine clinical implementation. Performance varies substantially across populations and settings, and methodological limitations, including inadequate validation, poor calibration assessment, and high risk of bias, limit confidence in current estimates. Future research should prioritize rigorous external validation, calibration assessment, and development in underrepresented populations, particularly HIV-affected and high-burden settings. Author SummaryTB kills over a million people annually. While curable, treatment failure remains common and drives ongoing transmission and drug resistance. Researchers increasingly use artificial intelligence and machine learning to predict which patients will fail treatment, but it is unclear if these models are ready for clinical use. We reviewed 34 studies including nearly 1.1 million participants from 22 countries. On average, models correctly distinguished patients who would fail treatment from those who would not 84% of the time, a performance generally considered good. However, this average hid enormous variation. Models developed in populations including HIV-positive people performed substantially worse, suggesting prediction is harder with HIV co-infection. Worryingly, only one study used high-quality methods; 97% had serious flaws in handling missing data, checking calibration, or testing in new populations. Only eight studies validated their models in different settings. To conclude, we found that machine learning is promising in predicting TB treatment failure, but it is not ready for clinical use. Researchers should prioritize validation in high-burden settings, include social determinants, and improve methodological rigor before these tools can help patients.

12

Missed Opportunities for Stroke Prevention in Hypertensive Patients: A Retrospective Case-Control Study

Yang, H.; Liu, Y.; Kim, C.; Huang, C.; Sawano, M.; Young, P.; McPadden, J.; Anderson, M.; Burrows, J. S.; Krumholz, H. M.; Brush, J. E.; Lu, Y.

2026-04-22 cardiovascular medicine 10.64898/2026.04.21.26351407 medRxiv

Top 0.8%

0.5%

Show abstract

BackgroundHypertension is the leading modifiable risk factor for ischemic stroke, yet the adequacy of preventative hypertension care in routine clinical practice remains suboptimal. Whether gaps in hypertension management represent missed opportunities for stroke prevention remains unclear. ObjectiveTo evaluate the association between hypertension care delivery and the risk of incident ischemic stroke. MethodsWe conducted a retrospective, matched, nested case-control study among adults with hypertension using electronic health record data from a large regional health system (2010-2024). Patients with a first-ever ischemic stroke were matched 1:2 to controls on age, sex, race and ethnicity, and calendar time. Three care metrics were assessed during follow-up: (1) outpatient visits with blood pressure (BP) measurement per year; (2) number of antihypertensive medication ingredients; and (3) medication intensification score. Conditional logistic regression estimated adjusted odds ratios (aORs). ResultsThe study included 13,476 cases and 26,952 matched controls (N = 40,428). Mean (SD) age was 64.8 (12.2) years, 54.1% were female, and mean follow-up was 2,497 (1,308) days. Cases had fewer BP visits per year (median, 2.50 vs. 3.01; p < 0.001), similar number of medication ingredients (2.00 vs 2.00), and lower treatment intensification scores (-0.211 vs - 0.125). In adjusted models, >5 BP visits per year was associated with lower stroke odds (aOR, 0.55; 95% CI, 0.51-0.59) compared with [≤]1 visit. Use of 2-3 medication ingredients (vs 0) was also associated with reduced stroke odds (aOR, 0.80; 95% CI, 0.75-0.86), whereas >3 ingredients was not significant. The highest quartile of treatment intensification showed the strongest association (aOR, 0.47; 95% CI, 0.44-0.51). Findings were consistent across subgroup and sensitivity analyses, including strata defined by baseline SBP and follow-up SBP. ConclusionsGreater engagement in hypertension care was associated with lower odds of ischemic stroke, suggesting that gaps in routine management may represent missed opportunities for prevention.

13

Comparative fine-mapping of breast cancer susceptibility loci using summary statistics methods and multinomial regression

O'Mahony, D. G.; Beasley, J.; Zanti, M.; Dennis, J.; Dutta, D.; Kraft, P.; Kristensen, V.; Chenevix-Trench, G.; Easton, D. F.; Michailidou, K.

2026-04-22 epidemiology 10.64898/2026.04.21.26351364 medRxiv

Top 0.8%

0.5%

Show abstract

Summary statistics fine-mapping methods offer advantages over classical methods, including avoiding data-sharing constraints and improved modelling of correlated variables and sparse effects. However, its performance has not been comprehensively evaluated in breast cancer using real-world data. Previous multinomial stepwise regression (MNR) fine-mapping analyses for breast cancer identified 196 credible sets. Here, we apply summary statistics fine-mapping, compare methods, and assess parameters influencing performance. Using summary statistics from the Breast Cancer Association Consortium, we compared finiMOM, SuSiE, and FINEMAP to published MNR results across 129 regions. Performance was assessed by recall using in-sample and out-of-sample LD. Discordant credible sets were examined for technical factors, and target genes were defined using the INQUISIT pipeline. SuSiE showed the closest agreement with MNR. Results varied across regions depending on the assumed number of causal variants (L), with higher values reducing recall and no single L maximising performance. At optimal L per region, SuSiE identified 8,192 CCVs in 244 credible sets, with recall of 88%, 86%, and 72% for overall, ER-positive, and ER-negative breast cancer. Thirty MNR sets were missed. Discordance was partially explained by allele flips, imputation quality, and array heterogeneity. Fifty-two MNR-identified genes, including BRCA2, WNT7B and CREBBP were not recovered, while additional candidate genes were identified. Using out-of-sample LD reduced recall by 3% but identified novel variants. Fine-mapping results vary across methods, and no single approach is sufficient. The choice of L strongly influences results, and combining analytical approaches with functional validation can improve causal variant identification.

14

Robustly Quantifying Uncertainty in International Avian Influenza A(H5N1) Infection Fatality Ratios

Gada, L.; Afuleni, M. K.; Noble, M.; House, T.; Finnie, T.

2026-04-23 public and global health 10.64898/2026.04.22.26351373 medRxiv

Top 0.8%

0.5%

Show abstract

Knowing the mortality rates associated with infection by a pathogen is essential for effective preparedness and response. Here, harnessing the flexibility of a Bayesian approach, we produce an estimate of the Infection Fatality Ratio (IFR) for A(H5N1) conditional on explicit assumptions, and quantify the uncertainty thereof. We also apply the method to first-wave COVID-19 data up to March 2020, demonstrating the estimates that could be obtained were the model available then. Our analysis uses World Development Indicators (WDI) from the World Bank, the A(H5N1) WHO confirmed cases and deaths tracker by country (2003-2024), and COVID-19 cases and deaths data from John Hopkins University (January and February 2020). Since infectious disease dynamics are typically influenced by local socio-economic factors rather than political borders, individual countries are placed within clusters of countries sharing similar WDIs relevant to respiratory viral diseases, with clusters derived by performing Hierarchical Clustering. To estimate the IFR, we fit a Negative Binomial Bayesian Hierarchical Model for A(H5N1) and COVID-19 separately. We explicitly modelled key unobserved parameters with informative priors from expert opinion and literature. By modelling underreporting, our analysis suggests lower fatality (15.3%) compared to WHO's Case Fatality Ratio estimate (54%) on lab-confirmed cases. However, credible intervals are wide ([0.5%, 64.2%] 95% CrI). Therefore, good preparedness for a potential A(H5N1) pandemic implies adopting scenario planning under our central estimate, as well as for IFRs as high as 70%. Our approach also returns a COVID-19 IFR estimate of 2.8% with [2.5%, 3.1%] 95% CrI which is consistent with literature.

15

Peer support boosted Hepatitis C treatment access among marginalised populations in England: A Bayesian causal factor analysis.

Schmidt, C.; Samartsidis, P.; Seaman, S.; Emmanouil, B.; Foster, G.; Reid, L.; Smith, S.; De Angelis, D.

2026-04-22 health policy 10.64898/2026.04.20.26351261 medRxiv

Top 0.9%

0.4%

Show abstract

To minimise health disparities, equitable access to medical treatment is paramount. In a pioneering intervention, National Health Service Englands Hepatitis C virus (HCV) programme has implemented country-wide peer support to boost treatment access. Peer support workers (peers) are individuals with relevant lived experience, who promote testing and treatment in marginalised populations underserved by traditional health services. We evaluated the English peers intervention, exploiting its staggered rollout and rich surveillance data between June 2016 and May 2021. Peers increased HCV cases identified by 13{middle dot}9% (95% credible interval (95% CrI) [5{middle dot}3, 21{middle dot}7]), sustained viral responses by 8{middle dot}0% (95% CrI [-4{middle dot}4, 18{middle dot}6]), and drug services referrals by 8{middle dot}8% (95% CrI [-12{middle dot}5, 22{middle dot}6]). The interventions effectiveness was magnified during the first COVID-19 lockdown and individuals supported by peers typically belonged to populations with poor treatment access. Our findings indicate that peers can boost equity in treatment access on a national scale.

16

Impact of Azithromycin Administration at Hospital Discharge on Antimicrobial Resistance and Enteropathogen Carriage 3 Months Following Treatment

Mogeni, P.; Ochieng, J. B.; Kariuki, K.; Rwigi, D.; Atlas, H. E.; Tickell, K. D.; Aluoch, L. R.; Sonye, C.; Apondi, E.; Ambila, L.; Diakhate, M. M.; Singa, B. O.; Liu, J.; Platts-Mills, J. A.; Saidi, Q.; Denno, D. M.; Fang, F. C.; Walson, J. L.; Houpt, E. R.; Pavlinac, P. B.

2026-04-20 epidemiology 10.64898/2026.04.17.26351054 medRxiv

Top 1.0%

0.4%

Show abstract

BackgroundThe Toto Bora trial tested whether a course of azithromycin reduced rates of re-hospitalization or death in the 6 months following hospitalization among Kenyan children. We hypothesized that azithromycin would reduce enteric bacteria and increase carriage of macrolide resistance in the subsequent 3 months. MethodsKenyan children (1-59 months) hospitalized and subsequently discharged for non-traumatic conditions provided fecal samples before and 3 months after randomization to a 5-day course of azithromycin or placebo. Quantitative PCR identified enteropathogens and AMR-conferring genes in fecal samples. Generalized estimating equations assessed the impact of the randomization arm on pathogen and resistance gene detection, accounting for baseline presence and site. ResultsAmong 1,393 baseline stools, 12.4% had at least one bacterial enteropathogen, 94.7% had at least one macrolide-resistance gene, and 92.6% had at least one beta-lactamase-resistance gene identified. At month 3, children randomized to azithromycin had a 6.1% higher likelihood of carrying a macrolide resistance gene compared to placebo (adjusted prevalence ratio [aPR], 1.06; 95% CI, 1.04-1.08; P<0.001). Specifically, azithromycin randomization was associated with a higher relative prevalence of erm(B) (aPR, 1.09 [95% CI, 1.04-1.15]; P=0.001), erm(C) (aPR, 1.23 [95% CI, 1.14-1.31]; P<0.001), msr(A) (aPR, 1.14 [95% CI, 1.04-1.25]; P=0.007), and msr(D) (aPR, 1.07 [95% CI, 1.03-1.11]; P=0.001). There was no difference in overall bacterial pathogen prevalence (18.9% vs 17.3%) between randomization arms, but a slightly lower proportion of children had Shigella after randomization in the azithromycin arm (3% vs. 5%, aPR, 0.79 [95% CI, 0.62, 1.01]; P=0.063). InterpretationAzithromycin at hospital discharge was associated with higher carriage of macrolide-resistance-conferring genes in the post-discharge period compared with placebo, without significant declines in enteric pathogen carriage other than modest changes to Shigella. The potential benefits and risks of empiric azithromycin need to be considered, as children are increasingly exposed to this broad-spectrum antibiotic.

17

Fentanyl Purity and Overdose Decline: A Reexamination of Geographic Trends

Dasgupta, N.; Sibley, A. L.; Gildner, P.; Gora Combs, K.; Post, L. A.; Tobias, S.; Kral, A. H.; Pacula, R. L.

2026-04-24 epidemiology 10.64898/2026.04.23.26351605 medRxiv

Top 1%

0.3%

Show abstract

Drug overdose deaths in the United States reached record levels during the fentanyl era before recently declining. A plausible hypothesis is that a sudden drop in fentanyl purity beginning in 2023 caused the downturn in overdose mortality. We evaluated this hypothesis by replicating a published analysis with regional overdose data, using models that account for time trends and autocorrelation, and negative control indicators to test for spurious correlation. When fentanyl purity was rising, the national purity series did not track overdose increases in most regions and showed only a modest association in the West. When both purity and mortality later declined, the observed associations were also seen with unrelated macroeconomic indicators that shared the same time pattern. National fentanyl purity alone does not provide a sufficient explanation for recent overdose declines.

18

Diagnostic Delays Drive Transmission in Dense Cities: Modeling the Waiting-Window Effect and Its Mitigation

Bahig, S.; Oughton, M.; Vandesompele, J.; Brukner, I.

2026-04-22 epidemiology 10.64898/2026.04.20.26350946 medRxiv

Top 1%

0.3%

Show abstract

In dense urban settings, delays between diagnostic sampling and effective isolation can sustain transmission during peak infectiousness. We define a waiting-window transmission externality arising when infectious individuals remain mobile while awaiting results, formalized as E = N{middle dot}P{middle dot}TR{middle dot}D, where N is daily testing volume, P test positivity, TR transmission during the waiting period, and D turnaround time. Using Monte Carlo simulation and a susceptible-infectious-recovered (SIR) framework, we quantify excess infections per 1,000 tests/day under multiple diagnostic workflows. A surge scenario incorporates positive coupling between TR and D ({rho} = 0.45), reflecting co-occurrence of laboratory saturation and elevated contacts during system stress. Under centralized 48-hour workflows, excess infections reach [~]80 at P = 10% and [~]401 at P = 50%, increasing to [~]628 under surge conditions. In contrast, near-patient rapid testing and home sampling reduce this to [~]5 and [~]25-26, respectively. Workflows that eliminate the waiting window--either through immediate isolation at sampling or through home-based PCR that returns results at the point of collection--effectively collapse the transmission term. These findings identify diagnostic delay as a modifiable driver of epidemic dynamics. Operational redesign of testing workflows, including decentralized sampling and home-based molecular diagnostics, offers a scalable pathway to improve epidemic controllability and reduce inequities in dense urban environments.

19

Effects of Mindfulness-Based Interventions on Executive Function in Children and Adolescents: A Systematic Review and Meta-Analysis

Li, N.

2026-04-20 psychiatry and clinical psychology 10.64898/2026.04.18.26351184 medRxiv

Top 1%

0.3%

Show abstract

BackgroundMindfulness-based interventions (MBIs) have been increasingly adopted in educational settings to support cognitive development in youth. Executive function (EF)--encompassing inhibitory control, working memory, and cognitive flexibility--is a plausible target of MBI given its reliance on attention regulation. However, prior reviews have yielded mixed conclusions, partly due to inconsistent construct definitions and the pooling of heterogeneous outcome measures. ObjectivesTo (1) estimate the pooled effect of MBI on EF in youth aged 3-18 years using only construct-validated, direct EF measures, (2) examine potential moderators including age group, EF domain, and risk of bias, and (3) test dose-response relationships via meta-regression on intervention duration. MethodsWe searched PubMed, PsycINFO, CINAHL, Scopus, and Web of Science from inception to March 2026, supplemented by reference-list searches from two existing systematic reviews and a scoping review. Only English-language publications were eligible. Eligible studies were randomised controlled trials (RCTs) or quasi-RCTs of MBI (excluding yoga-only interventions) in typically developing youth, with at least one direct behavioural or computerised EF outcome. Risk of bias was assessed using Cochrane RoB 2. Hedges g was computed for each study, and pooled using a DerSimonian-Laird random-effects model. Subgroup analyses by age group, EF domain, and risk of bias were conducted, alongside leave-one-out sensitivity analyses, Eggers regression test, trim-and-fill, and Knapp-Hartung-adjusted meta-regression on intervention duration. Evidence certainty was rated using GRADE. ResultsThirteen RCTs (nine school-age, four preschool; total N = 1,560) met inclusion criteria. The pooled effect was g = 0.365 (95% CI 0.264 to 0.465; p < .00001), with negligible heterogeneity (I2 = 0.0%; Q = 6.76, p = .87). Effects were consistent across age groups (school-age g = 0.389; preschool g = 0.318) and EF domains (inhibitory control, working memory, cognitive flexibility; pbetween = .60). Meta-regression on intervention duration (4-20 weeks) was non-significant (p = .79). The effect was robust in leave-one-out analyses, in the low risk-of-bias subgroup (g = 0.361; k = 8), and after trim-and-fill adjustment (g = 0.354). The 95% prediction interval (0.252 to 0.477) was entirely positive. GRADE certainty was rated MODERATE, downgraded once for risk of bias. ConclusionsMBIs appear to produce a small, statistically significant improvement in EF in youth aged 3-18 years, with moderate certainty of evidence per the GRADE framework. The effect is consistent across preschool and school-age samples and across EF domains, with no significant dose-response relationship within the 4-20 week range studied. Emerging mediation evidence suggests that EF improvement may serve as an important pathway through which MBI supports emotion regulation, though this requires replication. Further large-scale, pre-registered RCTs with active control conditions and longitudinal follow-up are warranted.

20

Menopause in the All of Us Research Program: A Descriptive Summary of Electronic Health Record and Survey Response across Sociodemographic Characteristics

Staples, J. W.; White, S. L.; Giacalone, A.; Pozdeyev, N.; Sammel, M. D.; Stranger, B. E.; Valencia, C. I.; Santoro, N.; Hendricks, A. E.

2026-04-25 sexual and reproductive health 10.64898/2026.04.17.26351129 medRxiv

Top 1%

0.2%

Show abstract

Objective. Menopause is a significant physiological transition with implications for health outcomes (e.g., cardiometabolic), yet gaps remain in understanding the menopause transition, including how menopause timing and type influence health outcomes. Large-scale cohort studies in midlife (age~40-60) females, including the All of Us Research Program (AoURP), provide opportunities to study menopause across diverse populations and data modalities. We characterized menopause-related data in AoURP, focusing on age distributions and concordance between EHR diagnosis codes and self-reported survey responses. Methods. We analyzed menopause-related survey, EHR diagnostic code, and genomic data among ~396,000 participants in AoURP with female sex. We summarized menopause data across modalities, overlap between survey, EHR, and genomic data, and age distributions overall and across sociodemographic characteristics. Results. Among ~396,000 females, surveys captured ~193,000 menopause observations, nearly seven times more than structured EHR diagnoses (~28,000), suggesting under- ascertainement in EHR data. Nearly all females (~99%) with an EHR menopause diagnosis also reported menopause in the survey. Approximately 22,000 participants had intersected EHR, survey, and genomic menopause-related data. Survey-based age patterns matched expectations, with participants <40 years predominantly reporting pre-menopausal status and those >60 years predominantly reporting post-menopausal status. A small subset (N{approx}1,700; 4%) (age>70 years) reported no menopause, suggesting response or recall bias. EHR menopause codes were concentrated after age>45 years, with a notable spike at age 65. Modest differences in survey-based menopause age distributions were observed by sociodemographic characteristics (e.g., race, ancestry). Conclusions. These findings inform sampling strategies, power calculations, phenotype definition, and study design for menopause research using AoURP.