Epidemiology
○ Ovid Technologies (Wolters Kluwer Health)
Preprints posted in the last 30 days, ranked by how well they match Epidemiology's content profile, based on 26 papers previously published here. The average preprint has a 0.02% match score for this journal, so anything above that is already an above-average fit.
Irlmeier, R.; Jin, Z.; Ye, F.
Show abstract
Background Simon two-stage designs for binary endpoints and their time-to-event analogues, including the Kwak and Jung method, rely on a fixed null benchmark. Their Type I error control is valid only when that benchmark is correctly specified. In practice, historical benchmarks are often inconsistent due to small samples, population heterogeneity, changing eligibility criteria, and evolving standards of care. Even modest misspecifications can substantially inflate the Type I error rate, leading to costly advancement of ineffective treatments. Methods We propose the Interval-Null Robust (INR) two-stage design framework that accounts for uncertainty in the historical null benchmark. We define the null hypothesis as a plausible range of clinically uninteresting values: p[isin][p0L, p0U] for binary endpoints and {lambda}[isin][{lambda}0L, {lambda}0U] (or equivalent survival probabilities) for time-to-event endpoints. Type I error is controlled uniformly over the full null interval: sup{theta}[isin]{theta}0 Pr{theta}(Go) [≤] . Under the monotonicity of the Go probability, the supremum occurs at the least favorable null configuration - p0U and {lambda}0L - but the design is not reduced to a point-null formulation. The interval defines the uncertainty set for error control and is used in selecting among feasible designs through robust criteria such as worst-case regret or minimal average expected sample size. Results Across representative planning scenarios for both endpoint types, classic designs calibrated to a single benchmark exhibit substantial Type I error inflation when the true null parameter exceeds the assumed planning value. INR designs maintain the nominal Type I error rate across the full null interval, directly addressing this vulnerability to benchmark misspecification. The robustness-efficiency trade-off can be managed through design constraints and robust optimization criteria while preserving uniform Type I error control. Conclusions INR two-stage designs offer a transparent framework for addressing historical control uncertainty in single-arm Phase II trials. By replacing reliance on a fixed benchmark assumption with a more realistic interval of clinically plausible null values, INR design reduces the risk of false-positive Go-decisions caused by benchmark misspecification. INR applies to both binary and time-to-event endpoints and is implemented in the open-source INRDesign R package and accompanying interactive Shiny app.
Hagan, J.
Show abstract
Background. Cross-validation (CV) is widely used to estimate predictive performance, but can overestimate performance when applied at the observation level to repeated-measures data. When continuous predictor variables are measured repeatedly within subjects and the binary outcome is defined at the subject level, naive observation-level CV introduces data leakage through within-subject dependence, producing optimistically biased estimates of the area under the receiver operating characteristic curve (AUROC). The magnitude of this bias and the performance of alternative partitioning strategies have not been formally characterized for this data structure. Methods. Three CV strategies were compared for estimating subject-level AUROC in ridge logistic regression models: naive observation-level 10-fold CV, subject-level 10-fold CV, and leave-one-cluster-out (LOCO) CV. The framework was applied to a motivating clinical dataset of daily oxygenation measures and retinopathy of prematurity outcomes among 101 extremely low birth weight infants. A factorial simulation study was conducted across 162 parameter combinations varying cluster count (20-150), intraclass correlation (0.1-0.5), within-cluster autocorrelation (0.2-0.8), and outcome prevalence (10-35%), with 500 simulated datasets per condition (76,389 valid datasets total). Results. In the motivating dataset, naive CV produced optimism of +0.078 AUROC units for severe ROP prediction (15 events, 101 subjects) and +0.031 for any ROP prediction (48 events). Subject-level 10-fold CV closely approximated LOCO (deviation [≤] 0.015). In the simulation, naive CV optimism ranged from +0.039 to +0.204 across all conditions, increasing monotonically with higher ICC, higher autocorrelation, fewer clusters, and lower event rates. Subject-level 10-fold CV was essentially unbiased relative to LOCO across all 162 conditions (mean absolute deviation = 0.002). Conclusions. Naive observation-level CV meaningfully overestimates discriminative performance in the repeated-measures binary outcome setting and should not be used. Subject-level CV partitioning effectively eliminates this bias. Accordingly, subject-level partitioning should be considered essential, not optional, when validating prediction models using repeated-measures data with subject-level outcomes.
Owusu-Boaitey, N.; Meyer, M. J.; Herrera-Esposito, D.; Bottcher, L.; Lukz, M.; Cook, S.; Stoto, M. A.; Kraemer, J. D.
Show abstract
Seroprevalence surveys reveal the extent of humoral immunity against pathogens such as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), and under some circumstances represent cumulative incidence of prior infection. However, antibody waning - or seroreversion - biases these estimates by reducing assay sensitivity in a time-varying manner. Because assay sensitivity decays over time, naively using serosurveys can substantially bias estimates of SARS-CoV-2 cumulative incidence and fatality rates. The Bayesian assay-specific, time-varying sensitivity adjustment developed in this paper can reliably correct for this bias and account for the delay between infection and serosurvey. In seroprevalence studies conducted in the United States in 2020, adjusting for time-varying sensitivity increased cumulative incidence by up to 1.4-fold, with an adjustment of 1.08 for a national study. Our estimates contrast with a previously published 2-fold adjustment that did not account for assay design. This suggests that previous analyses overestimated cumulative incidence by applying seroreversion corrections that did not account for assay-specific effects, or underestimated cumulative incidence by not applying seroreversion corrections. These biases imply fatality rate underestimation and overestimation, respectively. Our model provides a framework for design-specific time-varying sensitivity corrections in seroprevalence surveys for other pathogens.
Kleper, S. L.; Melamed, R. D.
Show abstract
Machine learning models for causal inference aim to adjust for confounding factors that are associated with both an exposure and an outcome, creating a spurious biased association. But, these methods are rarely empirically evaluated to assess their success in mitigating such bias. Recent advances in knowledge representation, including both foundation models and knowledge graphs, could enrich these models, but rigorous evaluations are needed in order to assess their potential. Here, we ask whether enriching existing causal inference models with knowledge representations from foundation models can improve confounding control. Rather than using semi-simulated data to address this question, we focus on examples of real confounding: we emulate target randomized active comparator trials that are subject to confounding by indication. Our results can guide researchers aiming to develop or apply methods for discovering causal effects from observational data.
Beer, S.; Simpkin, A. J.; Eldeeb, S. Y.; Zar, H. J.; Stein, D. J.; Dunn, E. C.; Smith, A. D. A. C.
Show abstract
Background: In prospective cohort studies, where an exposure is collected repeatedly, interest often lies in determining whether the timing of that exposure has a differential effect on a later outcome. The Structured Life Course Modeling Approach (SLCMA), where users select between temporal hypotheses of exposure specified a priori, provides one way to analyse such longitudinal data. However, few studies using SLCMA consider the effect of time-varying covariates (TVC) which may impact associations. Methods: We present a modified version of the SLCMA - called direct and mediated effects (DME)-SLCMA - which corrects for TVC. We first develop the DME-SLCMA method, test it through simulation, and apply it to psychosocial data from the Drakenstein Child Health Study (DCHS, n=336) to investigate relationships between maternal psychopathology, TVC of socioeconomic status, and offspring depressive symptoms. Results: We found that, on average, offspring depressive symptoms score increased by 3.9% (95% CI: 1.0%-6.9%, p = 0.039) for each unit of maternal psychopathology (SRQ) at 48 months whilst adjusting for time-varying socioeconomic status (at 18, 30, 42 and 54 months). Our simulations identified several realistic scenarios where selections ignoring TVC - with TVC mediated exposure effects present - were prone to be incorrect, including our DCHS example. Conclusion: DME-SLCMA is a robust new approach for life course modelling in the presence of time-varying covariates. We recommend adjusting for TVC whenever possible, and, when not possible, our simulation study identified that scenarios where mediated effects are comparable, or greater, in magnitude to direct effects are most prone to confounding.
Obeng-Gyasi, E.
Show abstract
Background: Mixture epidemiology deploys sophisticated estimators, Bayesian kernel machine regression with causal mediation analysis (BKMR-CMA), quantile G-computation (QGC), and parametric G-computation, alongside conventional regression. Comparative evaluations have assumed additive, non-mediated data-generating processes, leaving conditions under which estimator choice determines causal validity uncharacterized. Methods: We developed a simulation framework using military-relevant exposure distributions (metals, per- and polyfluoroalkyl substances [PFAS], polychlorinated biphenyls [PCBs]) and allostatic load (AL) across three deployment tiers, with parameters drawn from military occupational health and contamination literature. Four data-generating processes were specified as directed acyclic graphs: direct effects with confounding (M1), full mediation through AL (M2), synergistic AL-exposure interaction (M3), and collider structure (M4). We evaluated ordinary least squares (OLS), QGC, G-computation, and BKMR-CMA on bias, root mean squared error, and 95% confidence interval coverage across 500 Monte Carlo replications at n = 500 and n = 1,000. Results: No estimator dominated across all mechanisms. Under M1, OLS and G-computation produced near-identical modest positive bias; BKMR-CMA achieved lower root mean squared error through kernel shrinkage. Under M2, BKMR-CMA exhibited severe positive bias for AL (mean bias = +0.579 SD units; coverage = 32.8%). Under M3, BKMR-CMA was the only estimator achieving nominal 95% coverage for AL (95.2%), while regression-based approaches fell to 83.6%. Under M4, G-computation produced persistent bias and near-zero coverage for lead, reflecting structural non-identification. Conclusions: Estimator validity is fundamentally mechanism-dependent. Researchers should base estimator choice on explicit causal assumptions about whether AL functions as confounder, mediator, moderator, or collider, particularly in military and occupational cohorts. We provide a mechanism-to-estimator mapping for applied researchers.
Jones, L.; Ergas, R.; Tibbs, A.; Russo, E. T.; Norville, J.; Bingay, B.; Brown, C. M.; Reich, N. G.; Pasco, R.
Show abstract
Background Pediatric immunizations for Respiratory Syncytial Virus (RSV), including monoclonal antibodies for infants and vaccines for pregnant people, have become broadly available and can prevent severe RSV outcomes in infants. However, quantifying the impact of RSV immunization in prevention of severe pediatric illness at the population-level is limited by lack of RSV case surveillance data. The Massachusetts Department of Public Health (DPH) conducted a modeling analysis using routine public health surveillance data to estimate the state-level impact of new RSV immunization products on Emergency Department (ED) visits and hospitalizations in Massachusetts for highest risk pediatric groups. Methods A scenario projection tool, called R.Scenario.Vax, was utilized to simulate RSV-associated ED hospital encounters by age group in the context of newly available immunizations. ED visit and hospitalization data from the National Syndromic Surveillance Program (NSSP) during the time period 10/08/2017--10/19/2024 were analyzed, scaled to account for changes in RSV testing practices over time and missing encounter volume in historic data, and utilized to inform model fit of a "typical" RSV season. RSV immunization data from the Massachusetts Immunization Information System (MIIS) for the 2023--2024 and 2024--2025 RSV seasons informed high and moderate pediatric RSV immunization coverage scenarios and their impact was compared to a counterfactual reference scenario of no new immunizations. Median projections were quantitatively and qualitatively compared to observed 2024--2025 season data. Percent reduction in hospital encounters and encounters averted per 10,000 population were calculated for each scenario as compared to the reference. Results Projections for the youngest at-risk age groups showed significantly lower RSV-associated ED visits and hospitalizations during the 2024--2025 season for both high and moderate immunization coverage scenarios. Median projections for infants under 6 months old in the highest coverage scenario, wherein nearly all infants were immunized, showed 72.6% lower ED visits and 73.4% lower hospitalizations when compared to the reference scenario, equating to 262 ED visits and 85 hospitalizations averted per 10,000 population. Conclusions Our results support the use of modeling methods for public health insights and suggest that RSV immunizations for infant populations result in significantly lower RSV-related ED encounters in Massachusetts.
gahan, k.
Show abstract
Abstract Background. Area-level cancer disparities are routinely estimated from public county data in which rates based on small counts (fewer than 16 cases or deaths) are suppressed. Analysts typically drop suppressed counties (complete-case analysis). Because suppression depends on case counts tied to population size and demographic composition, this missingness may be informative, but its effect on the disparity estimate has not, to our knowledge, been quantified. Methods. In a cross-sectional ecological study of 3,143 U.S. counties (analytic sample 3,018 with computable exposure) using one frozen public release of NCI State Cancer Profiles incidence and mortality data and ACS 2018-2022 5-year data, we estimated the most- versus least-deprived ICE(race+income) quintile rate ratio (RR) and rate difference for female breast, stomach, and cervix cancers under four suppression-handling methods: complete-case, available-case, bounding, and model-based small-area estimation. We characterized which counties were erased, and, following the ADEMP framework, ran a Monte Carlo simulation (1,000 replicates per cell; Monte Carlo standard error of bias approximately 0.0025) calibrated to the release to measure bias against a known truth. Analyses were pre-registered. Results. The suppressed fraction rose with rarity: 7.4% of counties for breast, 61.3% for stomach, and 75.7% for cervix incidence. Suppression was concentrated in the most-deprived quintile (cervix, 81.8% suppressed vs 63.8% least-deprived) and overwhelmingly removed rural rather than minority residents (cervix: 81% of the rural but 9% of the minority population erased). For breast (little suppression) the RR was 0.87 (95% CI 0.85-0.89) and identical across methods; for cervix incidence the complete-case RR (1.56) exceeded the model-based estimate (1.50), and for cervix mortality (91% suppressed) complete-case (1.86) exceeded model-based (1.56) by 16% with a wide bounding interval (1.88-2.62). In calibrated simulation, population-weighted complete-case bias was small (less than 2%) at the observed deprivation-county-size correlation and grew with rarity, threshold, and unweighted aggregation; its direction was conditional, becoming positive (over-estimation) as deprived counties became smaller. Conclusions. Complete-case handling of suppressed counties over-estimates rare-cancer area disparities relative to methods that retain them, while silently erasing most of the rural and most-deprived communities the estimate is meant to represent. The effect is negligible for common cancers and grows with rarity. Public-data disparity analyses should report the suppressed fraction and use bounded or model-based estimates by default. Keywords: cancer disparities; small-count suppression; Index of Concentration at the Extremes; informative missingness; small-area estimation; rural health.
Sood, E.; Canter, K.; Arasteh, K.; Kazak, A. E.
Show abstract
Background: Maternal mental health problems are common after prenatal diagnosis of congenital heart disease (CHD), with long-term implications for child and family wellbeing. HEARTPrep is a prenatal psychosocial intervention with three self-paced modules and corresponding telehealth sessions, delivered during pregnancy via mobile app to improve mental health and wellbeing for mothers expecting a baby with CHD. This proof-of-concept study evaluated the feasibility of HEARTPrep and examined maternal mental health and psychosocial functioning throughout participation. Methods: Participants were mothers receiving care for a fetal CHD diagnosis within one health system. Feasibility was assessed via rates of enrollment and completion. Mothers completed 4-item PROMIS questionnaires assessing anxiety, depression, and social isolation and reported self-efficacy and hope on a weekly basis throughout HEARTPrep. Results: Of 34 recruited mothers, 29 (85%) enrolled and two were subsequently not eligible (delivery prior to participation, change in fetal diagnosis), resulting in a final sample of 27 mothers. The majority (n = 22, 81%) completed all three telehealth sessions and Modules 1 (n = 22, 81%) and 2 (n = 19, 70%), with just over half (n = 14, 52%) completing Module 3 prior to delivery. Mean PROMIS depression T-scores decreased from 57.5 to 52.9, and 48% of mothers had a decrease in depression scores exceeding the meaningful change threshold (half standard deviation). The percentage of mothers reporting high self-efficacy increased from 19% to 48%. Conclusions: HEARTPrep is feasible and corresponds with reduced maternal depression and increased self-efficacy, supporting proof-of-concept. A randomized controlled trial is needed to determine whether HEARTPrep improves outcomes compared to a control group.
Agarwal, T.; Namburu, J. R.; Kachroo, P.
Show abstract
Background: Pregnancy loss has important implications for womens health. Although maternal age is a well-established risk factor, the contribution of routinely measured cardiometabolic and behavioral markers at population-scale remains incompletely characterized. Objective: To examine associations between cardiometabolic, nutritional, and behavioral risk markers and pregnancy loss among U.S. women of reproductive age. Methods: We conducted a cross-sectional analysis of 4,842 U.S. women aged 20-44 years with [≥]1 pregnancy using the National Health and Nutrition Examination Survey data (2013-2023). Pregnancy loss was defined as [≥]1 prior miscarriages. Exposures included body mass index, smoking exposure (cotinine), lipid biomarkers, vitamin D and folate, and a composite cardiometabolic-nutritional risk score. Survey-weighted logistic regression estimated adjusted odds ratios (aORs) and 95% confidence intervals, with bootstrap resampling for predictor robustness. Results: The weighted prevalence of pregnancy loss was 23%. Higher odds of pregnancy loss were associated with increasing age (aOR per year=1.02; 95% CI: 1.00-1.04), Non-Hispanic Black race (aOR=1.32; 95% CI: 1.00-1.74), overweight (aOR=1.56; 95% CI: 1.16-2.11), obesity (aOR=2.06; 95% CI: 1.39-3.05), and smoking (aOR=1.58; 95% CI: 1.19-2.10). Adverse lipid profiles, particularly elevated triglycerides (aOR=1.83; 95% CI: 1.16-2.90) and high low-density lipoprotein (aOR=2.97; 95% CI: 1.45-6.61), were independently associated with pregnancy loss. Vitamin D/folate were not stable predictors. Higher composite cardiometabolic-nutritional risk scores were observed among women with pregnancy loss (P=0.026). Conclusion: Pregnancy loss clustered with adverse cardiometabolic and behavioral risk markers in a nationally representative population. These findings highlight pregnancy loss as a marker of broader metabolic vulnerability supporting the need for longitudinal studies and cardiometabolic profiling to inform preconception care and risk stratification.
Tsai, A. C.; Baguma, C.; Ahereza, P.; Ashaba, S.; Ayebare, P.; Bangsberg, D. R.; Comfort, A. B.; Gumisiriza, P.; Juliet, M.; Kananura, J.; Kiconco, A.; Kyokunda, V.; Lukwago, P.; Mushavi, R. S.; Namara, E. B.; Perkins, J. M.; Rasmussen, J. M.; Satinsky, E. N.; Siedner, M. J.; Tweheyo, B. M.; Kakuhikire, B.
Show abstract
BackgroundHIV-related stigma remains a primary barrier to the elimination of the HIV epidemic worldwide. No studies have examined long-term changes in the distribution of stigmatizing attitudes within populations. MethodsWe conducted a whole-population, open cohort study of adults in 8 villages in rural southwestern Uganda, with 5 biennial surveys spanning 2014-2024 (N=1,776 at baseline; 869 participated in all waves). We measured individual negative attitudes toward people with HIV ("public stigma") and perceptions of negative attitudes among others ("perceived stigma") using parallel 15-item scales. We estimated mean stigma scores, computed inequality measures at each wave, and decomposed inequality by sociodemographic characteristics. Leveraging the cohort design, we estimated intraclass correlation coefficients and rank-order stability over time. ResultsBoth public and perceived stigma declined substantially from baseline to endline and became concentrated in an increasingly smaller subgroup of the population. Theil decomposition failed to identify any stratifying variables that explained more than 3% of this variation: nearly all the inequality in HIV stigma occurred within population subgroups rather than between them. In longitudinal analyses, public stigma showed trait-like properties (intraclass correlation coefficient=0.35; 95% CI, 0.31-0.38) and meaningful rank-order stability (baseline-to-endline r=0.41). Perceived stigma showed no rank-order stability, no appreciable between-person variance, and universal convergence to low levels regardless of baseline. ConclusionsBoth public and perceived HIV stigma declined substantially in this rural Ugandan population, but remaining public stigma has become concentrated within a persistent minority. Sociodemographic profiling to target individuals who carry persistently negative attitudes toward people with HIV is unlikely to succeed.
Leonard, S. A.; Dysart, K.; Callahan, A.; Siadat, S.; Zhang, J.; Handley, S. C.; Huybrechts, K. F.; Igbinosa, I.; Bateman, B. T.
Show abstract
Background: Epic Cosmos is a relatively new centralized electronic health record dataset with high potential utility in perinatal epidemiologic research. Objectives: The study objectives were to develop replicable steps to create longitudinal, linked maternal-infant cohorts in Cosmos, assess completeness of key variables, evaluate potential selection bias with restrictions for longitudinal healthcare encounters, and provide an example epidemiologic analysis. Methods: We created maternal-infant cohorts by starting with live births during 2023-2024 recorded in the BirthFact data table and joining with additional data tables as needed. We selected and created variables for perinatal characteristics, common comorbidities, and routinely measured vital signs and laboratory values, and assessed variable completeness. We sequentially restricted the birth cohort for maternal-infant linkage and longitudinal healthcare from first-trimester prenatal care encounter through infant follow-up care within 12 weeks post-discharge from birth hospitalization. Finally, we conducted an example analysis of the association between high systolic blood pressure in the first trimester ([≥]140 mm Hg) and later onset of preeclampsia among those with chronic hypertension. Results: The total linked birth cohort included 2,624,186 pregnancies. Completeness was >90% for most variables assessed but was 77% for racial and ethnic group and 76% for body mass index at delivery. Characteristics of the cohort were similar to those reported for the entire United States birth population based on birth certificate data, including similar regional and racial-ethnic composition. Longitudinal cohort restriction requiring linked records from first trimester prenatal care through infant follow-up care reduced the cohort size to 509,148 pregnancies. However, restriction had minimal effects on cohort characteristics. In the example analysis, high systolic blood pressure was associated with increased risk of preeclampsia among those with chronic hypertension (aRR: 1.26; 95% CI: 1.22, 1.30). Conclusions: This study provides a rigorous and reproducible approach to creating longitudinal, linked maternal-infant cohorts in Epic Cosmos and the analytical findings suggest high data quality and representativeness.
Fonseca-Romero, P.; Smith, T.; Ahmed, S. M.; Jones, A.; Alekhina, N.; Brintz, B. J.; Dien Bard, J.; Chapin, K. C.; Cohen, D. M.; Festekjian, A.; Jackson, J. T.; Kanwar, N.; Larsen, C. D.; Leber, A. L.; Selvarangan, R.; Freedman, S.; Pavia, A. T.; Leung, D. T.
Show abstract
Background: Diarrheal illness in children leads to 3.5 million care visits and 200,000 hospitalizations annually in the US. Viruses are responsible for most pediatric diarrheal cases, yet limited guidance on distinguishing viral from bacterial etiologies complicates clinical decision-making, especially regarding empiric antibiotic use. Methods: We used clinical and qualitative molecular etiologic data from the Implementation of Molecular Diagnostics for Pediatric Acute Gastroenteritis (IMPACT) study to develop prediction models for viral etiology of diarrhea. We used conditional random forests to identify informative clinical and environmental predictors and evaluated model performance using logistic regression and random forests within a 5-fold cross-validation framework. We conducted external validation using the Alberta Provincial Pediatric Enteric Infection Team (APPETITE) dataset. Results: Variables predictive of viral etiology included younger age, non-bloody diarrhea, winter season, and presence of vomiting. External validation showed that an AUC of 0.82 can be achieved with a parsimonious 5-variable model, yielding a sensitivity of 0.92 and specificity of 0.55 Conclusion: Our results suggest that in North American healthcare settings, clinical prediction models can inform decision-making by identifying children with a high probability of viral diarrhea, improving diagnostic clarity, and reducing unnecessary testing and treatment.
Shinozaki, K.; Miura, F.
Show abstract
Background Human challenge trials provide a unique opportunity to quantify pathogen infectivity in terms of the probability of infection given an inoculated dose. However, between-pathogen comparisons are often distorted by individual heterogeneity in host susceptibility and by differences in background immunity across trial populations. We examined how dose-dependent infection risks differ across common respiratory viruses when such heterogeneity is explicitly incorporated. Methods We conducted a systematic review of human challenge trials for four respiratory viruses: respiratory syncytial virus (RSV), influenza virus, rhinovirus, and adenovirus. Using the extracted data, we fitted dose-response models under different distributional assumptions, allowing both continuous susceptibility variation and discrete immune fractions. We compared alternative heterogeneity models and evaluated pathogen-specific dose-response patterns using original and scaled dose metrics. Results All four viruses showed substantial heterogeneity in host susceptibility, and models assuming homogeneous susceptibility were unsupported. RSV and influenza were best described by models with a distinct immune or effectively non-susceptible subgroup, and the estimated immune proportions were approximately 40% and 25%, respectively. In contrast, rhinovirus and adenovirus were better explained by continuously distributed susceptibility, with little evidence of a fully immune subgroup. On a scaled dose axis, rhinovirus and adenovirus showed steeper increases in infection risk with dose than RSV and influenza. Conclusions The structure of susceptibility heterogeneity differs across common respiratory viruses, which in turn shapes dose-dependent infection risks. Incorporating this heterogeneity is essential for valid cross-pathogen comparison and for interpreting human challenge data in epidemiologic and public health contexts.
Shukla, N.; Bartington, S. E.; Hansell, A. L.; Lucas, T. C.
Show abstract
Background: In the absence of high-resolution response data, exposure-response modelling often relies on aggregated low-frequency exposure data, leading to loss of high-resolution information. Mixed Data Sampling (MIDAS) from econometrics offers an alternative but is limited due to its inability to make high-resolution predictions, inflexible likelihoods and penalised nonlinear functions, and limited visualization options. We propose a mixed-frequency Distributed Lag Non-linear Model (mf-DLNM) which can eliminate the need to aggregate exposure data in environmental epidemiology and provide high resolution predictions for time series studies. Methods: We evaluated the inference and predictive performance of the mf-DLNM. To evaluate its ability to estimate exposure-response relationships, we applied mf-DLNM and same-frequency (sf)-DLNM using data from the West Midlands, UK. Additionally, we compared the predictive performance of mf-DLNM with sf-DLNM and MIDAS across nine regions of England. As MIDAS cannot predict at the resolution of the predictor (daily), we compared the predictive performance of mf-DLNM and MIDAS at weekly resolution. To test the model's ability to predict high temporal resolution risk (daily), we compared sf-DLNM (with access to daily mortality counts) with mf-DLNM (with access only to weekly mortality counts). Results: In the West Midlands example, mf-DLNM performed comparably to sf-DLNM in estimating daily risk of temperature on respiratory mortality. Furthermore, mf-DLNM and MIDAS exhibited similar performance for weekly predictions. For high-resolution predictions, mf-DLNM and sf-DLNM showed nearly similar performance, despite mf-DLNM having access only to low-resolution response data. Conclusion: This mixed-frequency approach in environmental epidemiology overcomes the limitations of predicting health risks using aggregated exposure data and provides estimates of high-resolution outcomes in the absence of high-frequency health outcome datasets.
Shaw, S. Y. Y.; Mahar, A.; Bailey, K.; Payne, M.; Kindrachuk, J.; Kelly, C.; Friesen, K. J.; Bernstein, C. N.; Reimer, J.; Becker, M. L.; McClarty, L. M.; Stein, D.; Nickel, N. C.
Show abstract
Objectives: To examine COVID19 vaccine uptake among people diagnosed with sexually transmitted and bloodborne infections (STBBI) and reported methamphetamine users in Manitoba, Canada, during the acute phase of the COVID19 pandemic. Methods: We conducted a retrospective matched cohort study using linked population based administrative healthcare, laboratory, and vaccination databases in Manitoba. Individuals aged 16+ years with laboratory confirmed chlamydia/gonorrhea (CT/NG), syphilis, HIV, and/or documented methamphetamine use during the four years prior to March 1, 2020 were included in eight exposed cohorts. Each cohort was matched to unexposed comparators on age, sex, geographic region, and income quintile. The primary outcome was receipt of 2+ COVID19 vaccine doses between December 1, 2020 and March 31, 2022. Poisson regression models estimated adjusted rate ratios (aRRs) and 95% confidence intervals (95% CIs) for vaccine uptake. Results: Compared with matched comparators, most exposed cohorts were less likely to complete the COVID19 primary vaccine series. Individuals in the Syphilis Only (aRR: 0.87, 95% CI: 0.85 0.90), Syphilis Plus (aRR: 0.84, 95% CI: 0.81 0.86), CT/NG Only (aRR: 0.95, 95% CI: 0.94 0.96), CT/NG Plus (aRR: 0.82, 95% CI: 0.80 0.85), Methamphetamine Only (aRR: 0.78, 95% CI: 0.76 0.80), and Methamphetamine + STBBI cohorts (aRR: 0.74, 95% CI: 0.72 0.77) had significantly lower vaccine uptake. The HIV Only cohort did not differ significantly from matched comparators (aRR: 0.98, 95% CI: 0.95 1.01). Lower uptake was concentrated among individuals living in lower-income areas. Conclusions: People diagnosed with STBBI and methamphetamine users in Manitoba experienced significant inequities in COVID19 vaccine uptake, particularly those with STBBI coinfections and concurrent substance use. Integrated vaccination approaches linked with HIV, harm reduction, and addiction services may improve vaccine equity during future public health emergencies.
Mantena, S. D.; Johnson, A.; Schuetz, N.; Tolas, A.; Montalvo, S.; Delgado-SanMartin, J.; Ramirez Posada, M.; Du, L.; Zhang, S.; Huynh, A. D.; Oppezzo, M.; King, A. C.; Schmiedmayer, P.; Lawrie, A.; Rodriguez, F.; Ashley, E.; Kim, D. S.
Show abstract
Objective: Hispanic/Latinx populations in the U.S. experience higher rates of chronic disease linked to physical inactivity, yet digital health interventions remain largely inaccessible to more than 16 million Hispanic/Latinx adults with limited English proficiency. While large language models (LLMs) offer scalable personalization, their use in non-English behavioral coaching is unexplored. This study introduces MHC-Coach-ES, a Spanish-language LLM fine-tuned on the Transtheoretical Model (TTM) of behavior change. Materials and Methods: We fine-tuned Llama 3-70B-Instruct using a two-stage pipeline. First, the model was adapted to Spanish health and motivational language using a 2.21-million-token corpus. Second, it was instruction-tuned on 3,268 translated human written messages to align the model with the Transtheoretical Model (TTM) of Behavioral Change. We compared MHC-Coach-ES with Llama 3-70B-Instruct and translated human-expert messages using a forced-choice preference survey (N = 77) and blinded expert review (N = 2). Results: Spanish-speaking participants significantly preferred MHC-Coach-ES messages over translated human-expert messages (81% preference, P<0.001). Linguistic analysis showed that MHC-Coach-ES produced more temporally anchored messages than the base model (65% vs. 20%), while maintaining readability. In blinded evaluation, clinical experts rated MHC-Coach-ES higher for alignment with Transtheoretical Model stages than human-expert messages (4.83 vs. 4.38 out of 5). The base model also outperformed translated expert messages across preference and expert ratings. Conclusions: Generative AI can operationalize behavioral science frameworks in Spanish, offering a scalable approach to reducing health disparities. The strong performance of both MHC-Coach-ES and the base model highlights the promise of generative and personalized approaches over translation-based localization for theory-driven behavioral interventions.
Piorkowska, N. J.; Olejnik, A.; Ostromecki, A.; Kuliczkowski, W.; Mysiak, A.; Bil-Lula, I.
Show abstract
Background: Machine-learning models based on circulating biomarkers are increasingly used in cardiovascular research; however, model performance alone provides limited insight into how the predictive signal is distributed across features. We aimed to characterize the biomarker signal architecture of a machine-learning model distinguishing ST-elevation myocardial infarction (STEMI) from non-ST-elevation myocardial infarction (NSTEMI), with a focus on signal concentration, redundancy, and conditional complementarity. Methods: We conducted a structured secondary analysis of a previously established, leakage-controlled machine-learning framework (n = 152 patients). The BIOMARKERS feature-set variant (10 biomarkers) was evaluated using outer-fold cross-validation. Model structure was interrogated using (i) leave-one-biomarker-out analysis, (ii) pairwise leave-two-out analysis with pair-excess estimation, (iii) cumulative ablation of top-ranked biomarkers, and (iv) forward reconstruction of minimal biomarker panels. Uncertainty was assessed using bootstrap resampling across folds. Results: The full biomarker model achieved a mean ROC-AUC approaching 0.94. The predictive signal was highly non-uniform, with MMP-2 showing the largest single-feature contribution (mean {Delta}AUC {approx} 0.16). Pairwise analysis identified conditional complementarity between selected non-lipid biomarkers, particularly MMP-2 and EMMPRIN (pair {Delta}AUC {approx} 0.26; positive excess over single-feature effects), whereas lipid-related markers formed a highly correlated and largely redundant sub-cluster. Cumulative ablation demonstrated rapid performance collapse following removal of top-ranked biomarkers, consistent with structural signal concentration. Forward panel analysis showed that a compact subset of biomarkers (three features) achieved performance within ~0.01 ROC-AUC of the full model, indicating the presence of a minimal high-yield panel. Bootstrap confidence intervals suggested that small performance differences should be interpreted with caution. Conclusions: Predictive performance in this biomarker-based model arises from a structured and unevenly distributed signal architecture, characterized by a dominant core biomarker, conditionally complementary contributors, and a redundant lipid cluster. These findings highlight the importance of evaluating model structure, not only aggregate performance, and suggest that biomarker-based machine-learning systems may benefit from architecture-aware interpretation and simplification strategies.
Borges, M. C.; Urquijo, H.; Yang, Q.; van der Graaf, A.; McBride, N.; Haug, E. B.; Soares, A. G.; Clayton, G. C.; Bond, T. A.; Al Arab, M.; Horn, J.; Thomas, L.; Bhatta, L.; Asvold, B. O.; Magnus, M. C.; Evans, D. M.; Burden, C.; Birchenall, K.; Brumpton, B.; Gaunt, T. R.; Hart, E. C.; Kutalik, Z.; Lawlor, D. A.
Show abstract
Background and Aims Hypertension during pregnancy is a major cause of maternal and neonatal morbidity and mortality, yet the efficacy and safety of antihypertensive treatments in this setting remain uncertain. We evaluated the effects of antihypertensive drug targets on adverse pregnancy-related outcomes using genetic variants to instrument target perturbation. Methods We performed drug target Mendelian randomization to mimic pharmacological perturbation of targets from six commonly used antihypertensive drug classes, using data from up to 671,922 pregnant women. Genetic variants near drug target genes associated with systolic or diastolic blood pressure were selected as instruments. We estimated effects of target modulation on six primary and eight secondary pregnancy outcomes. Results Genetically instrumented downregulation of blood pressure through beta-blocker (BB) and calcium-channel blocker (CCB) targets, particularly ADRB1 and CACNB2, was associated with a reduced risk of hypertensive disorders of pregnancy, including preeclampsia. For example, CACNB2-instrumented lowering corresponded to a 7% (95% CI: 5-9%) reduction in preeclampsia risk per 1 mmHg decrease in blood pressure. For most other targets, estimates were directionally consistent but imprecise. Across additional outcomes, effects varied by target, with suggestive evidence for reduced risks of miscarriage, preterm birth, small-for-gestational-age birth, and labour induction, although these estimates were accompanied by substantial uncertainty. Conclusions These findings support a protective effect of BB and CCB targets on hypertensive disorders of pregnancy and highlight potential target-specific differences in safety. This work illustrates the value of Mendelian randomization in addressing clinical uncertainties where robust trial evidence is limited.
Payne, J. Y.; Rhodes, S.; Shoag, J.; Rothberg, M.; Le, P.; Cullen, J.; Hartman, H.
Show abstract
Background: Prostate cancer survival varies by stage at diagnosis, and Black men experience a disproportionate burden of advanced disease. We examined whether neighborhood deprivation, measured by Area Deprivation Index (ADI), contributes to racial differences in metastatic presentation. Methods: We conducted a population-based study of men diagnosed with prostate cancer in the Ohio Cancer Incidence Surveillance System from 1996 to 2016. The primary endpoint was distant-stage disease at diagnosis. Generalized additive models assessed nonlinear associations of ADI and diagnosis year with metastatic risk. Inverse probability of treatment weighting (IPTW) models estimated odds ratios comparing Black with White men after sequential adjustment for diagnosis year, age, insurance, and ADI. Results: Among 135,095 men, 18,690 were Black and 116,405 were White. Distant-stage disease occurred in 7.0% of Black men and 5.0% of White men. Black men had higher median ADI (60.9 vs. 47.3). Medicaid-insured men had the highest unadjusted odds of metastatic presentation (OR, 4.68; 95% CI, 4.13-5.31), exceeding uninsured men (OR, 2.91; 95% CI, 2.54-3.34). In IPTW models without age adjustment, the odds ratio decreased from 1.54 to 1.24 after adding insurance and ADI. In age-adjusted IPTW models, the odds ratio decreased from 1.79 to 1.41 after adding insurance and ADI. Generalized additive models showed increasing metastatic risk at higher ADI values and after 2008. Conclusions: Neighborhood deprivation and insurance-related access explained part, but not all, of the excess odds of metastatic diagnosis among Black men. Impact: Integrating ADI into cancer surveillance may improve identification of populations at risk for late-stage diagnosis.