Back

FACETS

Canadian Science Publishing

Preprints posted in the last 7 days, ranked by how well they match FACETS's content profile, based on 11 papers previously published here. The average preprint has a 0.01% match score for this journal, so anything above that is already an above-average fit.

1
Keeping human in the loop: A three-phase generative AI workflow for research integrity in data-intensive science.A methodological case study using elite Ethiopian distance-running data

Galko, P.; Yisamaw, A.; Haugen, T.; Seiler, S.

2026-05-29 sports medicine 10.64898/2026.05.29.26354013 medRxiv
Top 0.2%
0.8%
Show abstract

Background: Generative AI tools can support data-intensive research by writing code, drafting prose, searching analytical possibilities, and stress-testing claims. They can also produce false citations, drift between statistical specifications, and lose continuity across long investigations. This paper describes a practical workflow for using AI systems in empirical research while keeping discovery, verification, and accountability inspectable. Methods: We developed and applied a three-phase human-AI workflow to a case study of 14 elite Ethiopian distance runners. The dataset contained 22,605 GPS-segments collected across 97 consecutive days in late 2025, supplemented by venue and athlete metadata collected in the field. Phase 1 used an autonomous data-exploration tool to pre-filter the hypothesis space across five seeded research questions. Phase 2 used an AI system under direct human guidance to construct candidate findings into numerical claims, verification scripts, and draft text. Phase 3 used an independent AI system in an adversarial role to stress-test methods, statistics, prose, figures, and citations. The workflow was informed by Pearl's distinction between association, intervention, and counterfactual reasoning, with human judgement retained for research direction, interpretation, and final claims. Results: The workflow produced three empirical analyses and a documented correction process. The analyses estimated an altitude-to-sea-level pace correction of +0.10 min/km per 1,000 m at matched heart rate, showed why pooled altitude-surface regression was not identifiable within this venue system, documented method-dependence in heart-rate-based intensity classification, characterised within-venue route variation as a 64/36 path-fixed-to-trail-variable split with the Sululta label resolving into two functionally distinct sub-venues, and reframed the cohort's training through a 3x3x3 prescription lattice grounded in Ethiopian coaching practice. The adversarial phase identified several hallucinated citations, a terminology error between HC1 and cluster-robust standard errors, and several inconsistencies between prose, figures, and computed results. Verification scripts re-derived nearly all numerical claims from the cleaned lap-level data. Conclusions: The case study shows how researchers can organise AI-assisted empirical work so that candidate discovery, claim construction, independent stress-testing, and final accountability remain separated. The workflow did not remove the need for domain expertise or human judgement. Its value was in making the route from candidate finding to manuscript claim explicit, reproducible, and open to challenge. Trial registration: Not applicable.

2
Development and validation of a multiplexed quantitative PCR assay for clinical detection and surveillance of Oropouche virus

Stachler, E.; McMahon, K.; Gopal, N.; Knoll, H.; Baillargeon, K. R.; Mora, A. C.; Wondrash, H. A.; Sullivan, E. M.; Rush, S.; Gratalo, D.; Ozonoff, A.; Sabeti, P. C.; Springer, M.

2026-05-28 infectious diseases 10.64898/2026.05.26.26354109 medRxiv
Top 1%
0.2%
Show abstract

Background Oropouche virus (OROV) is an emerging vector-borne virus with rapidly expanding geographic range, increasing case counts, and growing evidence of severe outcomes including neuroinvasive disease and vertical transmission. Because OROV infection presents with nonspecific febrile illness that overlaps clinically with other viruses including dengue, zika, and chikungunya, accurate molecular diagnostics are essential for patient care and surveillance. Yet existing assays rely on single genomic targets and are vulnerable to detection failure as the virus evolves and reassorts. Methodology/Principal Findings To support diagnostic capacity, we developed and clinically validated a multiplexed qPCR assay targeting three regions of the OROV S segment, incorporating redundancy to preserve sensitivity across viral diversity while enabling robust clinical interpretation. The multiplex also includes an assay targeting RNaseP as an internal sample control to ensure adequate sample processing. We evaluated assay performance using both historical and contemporary OROV strains and validated the assay on contrived serum, plasma, and cerebrospinal fluid samples, assessing linearity, limit of detection (LOD), accuracy, specificity, precision, and sample stability. The assay met or exceeded all predefined acceptance criteria for clinical testing and achieved an LOD as low as 6 copies per reaction for contemporary outbreak strains. We further implemented a logic-based interpretation matrix that reduced false-positive risk while maintaining sensitivity near the analytical LOD. Conclusions/Significance Our assay sensitively and specifically detects OROV RNA in serum, plasma, and cerebrospinal fluid while incorporating safeguards against viral evolution and reassortment. The assay has been approved for use by CLIA at Nexus Medical Labs in 49 U.S. states, expanding access to timely OROV diagnostics in the United States and providing a durable framework for molecular detection of reassorting, rapidly evolving viruses as OROV continues to spread into new regions.

3
Detection of Anti-H5 Antibodies in People with Exposure to Wild Birds in Northern Canada

Wallace, H. L.; Hiebert, M.; Hunter, M.; Halbrook, M.; Harrigan, R. J.; Bogoch, I. I.; Rimoin, A. W.; Shaw, S. Y.; Larcombe, L.; Orr, P. H.; Kindrachuk, J.

2026-05-26 infectious diseases 10.64898/2026.05.24.26353994 medRxiv
Top 2%
0.1%
Show abstract

Using a commercially available H5 serology assay, we identified a 7.4% (n=5/68) anti-H5 seroreactivity rate among hunters in Northern Canada. All participants reported close contact with wild birds.

4
Cleaner Air for Lower Cardiometabolic Risk: protocol for a double-blind, randomized, sham-controlled trial of HEPA filtration in adults with prediabetes.

Wittkopp, S.; Asachi, P.; Kazatsker, F.; Aleman, J. O.; Gordon, T.; Brook, R.; Thorpe, L.; Newman, J. D.

2026-06-01 endocrinology 10.64898/2026.05.29.26354420 medRxiv
Top 2%
0.1%
Show abstract

Introduction Air pollution is a leading driver of cardiovascular disease with a growing body of literature implicating this in worse glucose homeostasis. Increases in fine particulate matter air pollution (PM2.5) are associated with increased blood glucose and hemoglobin A1c across the glycemic spectrum from normoglycemia to prediabetes to all forms of diabetes. Despite strong evidence for positive associations of PM2.5 with dysglycemia, it remains unknown if reducing air pollution exposure through air filtration can effect improvements in glucose. This study aims to test the hypothesis that short-term, in-home air pollution reduction using high efficiency particulate air (HEPA) filtration will improve blood sugar in adults with prediabetes. Methods and analysis This trial is a randomized, double-blind, sham-controlled trial of the effects of lowering air pollution exposure using HEPA filtration on cardiometabolic health in adults with prediabetes living in the New York City area. Participants will be randomly assigned to use bedroom air cleaners, or sham air cleaners, while measuring PM2.5 continuously for 1 month. The primary outcomes will be continuous glucose monitoring metrics measured before and after HEPA air filtration. Exploratory outcomes will include insulin resistance measures, serum biomarkers and transcriptomics measured before and after HEPA intervention. We will quantify effects of HEPA filtration with models using treatment arm (true versus sham filtration) as the independent variable. Secondary analyses will model continuous measures of PM2.5 as the independent variable. Ethics and Dissemination This study has undergone peer review; and the work was supported by Grant 2023-0214 from the Doris Duke Foundation, who had no other role in study design or implementation. The study was registered in ClinicalTrials.gov (NCT05994937) prior to recruitment. Clinical Trials Clinical Trials NCT05994937; https://clinicaltrials.gov/study/NCT05994937

5
Heterogeneity in susceptibility among humans to common respiratory viral infections

Shinozaki, K.; Miura, F.

2026-06-01 infectious diseases 10.64898/2026.05.29.26353692 medRxiv
Top 3%
0.1%
Show abstract

Background Human challenge trials provide a unique opportunity to quantify pathogen infectivity in terms of the probability of infection given an inoculated dose. However, between-pathogen comparisons are often distorted by individual heterogeneity in host susceptibility and by differences in background immunity across trial populations. We examined how dose-dependent infection risks differ across common respiratory viruses when such heterogeneity is explicitly incorporated. Methods We conducted a systematic review of human challenge trials for four respiratory viruses: respiratory syncytial virus (RSV), influenza virus, rhinovirus, and adenovirus. Using the extracted data, we fitted dose-response models under different distributional assumptions, allowing both continuous susceptibility variation and discrete immune fractions. We compared alternative heterogeneity models and evaluated pathogen-specific dose-response patterns using original and scaled dose metrics. Results All four viruses showed substantial heterogeneity in host susceptibility, and models assuming homogeneous susceptibility were unsupported. RSV and influenza were best described by models with a distinct immune or effectively non-susceptible subgroup, and the estimated immune proportions were approximately 40% and 25%, respectively. In contrast, rhinovirus and adenovirus were better explained by continuously distributed susceptibility, with little evidence of a fully immune subgroup. On a scaled dose axis, rhinovirus and adenovirus showed steeper increases in infection risk with dose than RSV and influenza. Conclusions The structure of susceptibility heterogeneity differs across common respiratory viruses, which in turn shapes dose-dependent infection risks. Incorporating this heterogeneity is essential for valid cross-pathogen comparison and for interpreting human challenge data in epidemiologic and public health contexts.

6
Cross-Sectional Measures of Periodontal Severity: Distortion from Severity-Dependent Tooth Loss

McCormick, K. M.; Amarasena, N.; Guzzo, G.; Nath, S.; Jamieson, L.

2026-05-30 dentistry and oral medicine 10.64898/2026.05.27.26354277 medRxiv
Top 3%
0.0%
Show abstract

Aim: Cross-sectional summaries of periodontitis based on clinical attachment loss (CAL) are, by definition, conditioned on surviving teeth. Because the most severely affected teeth are more likely to have been lost, these measures may underestimate cumulative disease burden and show an artificial flattening (attenuation) of severity with age. We hypothesised that measures more sensitive to severe attachment loss would show greater attenuation at older ages than measures defined across a broader range of sites. Materials and Methods: Using nationally representative data from adults aged 30+ years in NHANES 2009-2014, we examined age-specific trajectories across multiple continuous measures of periodontal severity and assessed whether divergence between measures followed the pattern predicted under severity-dependent tooth loss. Results: The proportion of observable sites declined from 93% at ages 30-34 to 68% at 80+ years, establishing the structural basis for the divergence observed across severity measures. All severity measures showed nonlinear attenuation with age, with distortion increasing with severity threshold. Higher-threshold measures exhibited the greatest attenuation, while lower-threshold measures showed more stable trajectories. Conclusions: Cross-sectional summaries of periodontitis reflect disease among surviving teeth rather than cumulative damage across teeth originally at risk. Attenuation at older ages is consistent with depletion of the most severely affected teeth rather than biological slowing. Distortion varies by measure, with higher-threshold and mean-based indices most affected, whereas the CAL 3+ mm threshold provides a more stable basis for age comparisons.

7
Changes in Frequency of Resuscitation Among the Oldest Old Following Japans End-of-Life Care Guideline Revision: A Population-Level Interrupted Time-Series Analysis Using National Open Claims Data

Sakai, M.; Nakayama, T.

2026-05-30 health policy 10.64898/2026.05.28.26354307 medRxiv
Top 3%
0.0%
Show abstract

Resuscitation in the oldest old at the end of life is associated with potential harm, raising concerns about misalignment with patients goals of care. This study aimed to elucidate changes in the use of resuscitation among the oldest old in Japan following the revision of the national guideline on end-of-life care which explicitly incorporates the concept of advance care planning. We conducted a repeated cross-sectional study using the National Database of Health Insurance Claims Open Data, including adults aged [≥]85 years, from April 2014 to March 2024. The annual number of resuscitation procedures per 100,000 individuals aged [≥]85 years was used as the measure of frequency. Resuscitation included closed-chest cardiopulmonary resuscitation (CPR) and endotracheal intubation. Interrupted time series analysis was used to examine changes following the 2018 revision of the national end-of-life care guideline. The frequencies of CPR and endotracheal intubation declined before 2018 (CPR: age 85-89, -68.4 [-87.9 to -48.8]; age [≥]90, -106.7 [-131.5 to -82.0]; intubation: age 85-89, -57.5 [-71.8 to -43.2]; age [≥]90, -69.5 [-80.7 to -58.3]), but the decline attenuated thereafter (CPR: age 85-89, +56.2 [28.0 to 84.5]; age [≥]90, +84.1 [50.7 to 117.6]; intubation: age 85-89, +36.6 [8.5 to 64.7]; age [≥]90, +38.3 [23.8 to 52.8]). These findings provide insight into the changes in resuscitation trends following policy interventions supporting end-of-life decision-making. Further studies are needed to better understand the mechanisms underlying this change.

8
Compatibility of National Food Composition Databases with USDA FoodData Central: A Seven-Country LLM-Based Analysis

Nakagawa, S.; Yamamoto, A.

2026-06-01 nutrition 10.64898/2026.05.23.26353942 medRxiv
Top 3%
0.0%
Show abstract

To evaluate the international interoperability of food composition databases, we assessed the compatibility of seven national food composition tables with USDA FoodData Central (FDC) using the LLM-based matching method reported previously (Nakagawa and Yamamoto, 2026). Databases from four English-speaking countries (Canada, United Kingdom, Australia, and New Zealand), South Korea, and Japan were compared with 8,158 USDA FDC entries (SR Legacy and Foundation Foods, excluding Survey/FNDDS). Match rates varied by country (62.0-89.7%) and food category. After excluding six USDA categories unsuitable for cross-national comparison, 45.2% of the remaining 6,290 entries were not matched by any country. Canada showed the highest concordance, reflecting shared North American food supply. Japan and South Korea showed similar low coverage for vegetables and spices. These findings suggest that while USDA FDC represents a practical foundation for a globally comprehensive food composition database given its breadth, systematic incorporation of country-specific foods and classification schemes will be necessary to achieve true international interoperability.

9
Estimating Lifetime Periodontal Burden Under Informative Tooth Loss

McCormick, K. M.; Amarasena, N.; Guzzo, G.

2026-05-30 dentistry and oral medicine 10.64898/2026.05.27.26354300 medRxiv
Top 3%
0.0%
Show abstract

Background: Periodontitis is defined by cumulative, irreversible tissue destruction, yet population-based measurement typically relies on cross-sectional indicators derived from retained teeth. Destruction that occurred earlier in life, particularly disease severe enough to result in tooth loss, is structurally excluded from these measures, potentially leading to systematic underestimation of lifetime periodontal burden. Objective: To develop and evaluate a measurement framework that estimates lifetime periodontal burden from cross-sectional data by explicitly incorporating informative tooth loss under etiological uncertainty. Methods: Data were drawn from 10,324 adults aged [≥]30 years participating in the 20090-2016 National Health and Nutrition Examination Survey (NHANES) who completed full-mouth periodontal examination and glycated hemoglobin (HbA1c) testing. Lifetime periodontal burden was estimated by combining observed clinical attachment loss in retained teeth with probabilistic contributions from missing teeth, using three alternative age-stratified attribution schedules derived from epidemiological studies of periodontal extraction. Performance was compared with conventional measures of periodontal severity and extent using distributional analyses, correlations with HbA1c, discrimination of diabetes status, and relative importance analysis. Age-adjusted models were treated as sensitivity analyses. Results: Estimated lifetime periodontal burden exhibited strong, monotonic age gradients across glycemic categories, in contrast to more attenuated patterns observed for severity and extent. Across attribution schedules, lifetime burden showed stronger correlations with HbA1c ({rho} = 0.30-0.32) than conventional measures. In multivariable models including all indices, lifetime burden retained an independent association with HbA1c, whereas severity and extent contributed little unique information. Discriminative performance for diabetes status was consistently higher for lifetime burden than for conventional measures and remained stable across attribution schedules. Conclusions: Lifetime periodontal burden can be estimated from cross-sectional data by explicitly modelling informative tooth loss rather than restricting measurement to retained teeth. Incorporating historical tissue loss under uncertainty yields a more coherent representation of cumulative periodontal destruction than snapshot-based measures and provides a methodological basis for life-course-oriented periodontal epidemiology.

10
Using Bayesian Evidence Synthesis to estimate the number of sex workers in the United Kingdom

Long, H.; Gada, L.; Murray, L.; Laurence, T.; Hayward, A.; Finnie, T.

2026-05-26 public and global health 10.64898/2026.05.21.26353767 medRxiv
Top 3%
0.0%
Show abstract

Sex work is diverse and includes a broad range of people and settings. Over the last thirty years, a large proportion of public health emergencies of international concern (PHEIC) have involved infections transmitted through sexual or close contact and in sexual networks (WHO 2024). Sex workers can face increased disadvantage in relation to these public health emergencies. Given the significant health inequalities sex workers can face, they should be eligible to receive targeted and tailored health support to reduce health protection risks (Hester 2019; Jeal and Salisbury 2004a). However, they are often not explicitly eligible for targeted and tailored support due to a lack of information on incidence, prevalence of disease, and even more basic data such as reliable estimates of the number of sex workers in the UK. Accordingly, the aim of this paper is to determine a population size estimate, with uncertainty, that is more robust than those currently available. In this study, we apply Bayesian Evidence Synthesis to bring together historic estimation efforts with recent ONS National Population Estimates and Genito-Urinary Medicine Clinics Attendance Data (GUMCAD) from the UK Health Security Agency (UKHSA). A key feature of our model is the embedding of uncertainty from each input study in model priors, hence propagating it through to our final estimate. The Bayesian evidence synthesis model estimated a total of 84,000 sex workers in the United Kingdom (95% credible interval: 49,000-130,000), representing 0.121% of the current UK population.

11
Increasing frequency of secondary dengue infections in sequential outbreaks (2016-2024). Clinical impact and diagnostic challenges.

Espindola, S. L.; Pereson, M. J.; Lema, J. M.; Kachuk, A.; Carballo, G.; Aloisi, N.; Badano, M. N.; Miretti, M.; Di Lello, F. A.; Bare, P. C.

2026-06-01 infectious diseases 10.64898/2026.05.29.26354405 medRxiv
Top 4%
0.0%
Show abstract

Successive dengue virus (DENV) outbreaks can progressively reshape population immunity influencing disease expression and diagnostic performance. Objectives The aim was to evaluate the impact of secondary infections across sequential outbreaks on clinical severity, serotype dynamics and diagnostic concordance. Methods This retrospective study analyzed 976 febrile-stage samples from three sequential outbreaks in Misiones, Argentina. For serotyping and clinical analyses, 869 viremic samples confirmed by at least one direct method were included (2016: n=512; 2019: n=148; 2024: n=209). Additionally, 318 samples, including 107 non-viremic cases, were used to compare NS1 rapid diagnostic tests (NS1 Ag) and RT-PCR. Viral serotyping and clinical and laboratory markers of disease severity were evaluated. Results Secondary infections increased from 31.05% (2016) to 43.24% (2019) and 53.87% (2024) (p<0.0010). Serotype distribution shifted from DENV-1 predominance in 2016 (95.12%), DENV-1/DENV-4 co-circulation in 2019 (60.71%/39.29%), and DENV-2 predominance in 2024 (97.60%). Secondary infections were associated with more severe disease manifestations, particularly in 2024, with higher hematocrit (p=0.0120) and hemoglobin (p=0.0080), lower white blood cells (p=0.020) and platelet counts (p=0.0030), and elevated AST (p=0.0007) and ALT (p=0.0130). Concordance between NS1 Ag and RT-PCR was lower in secondary infections (k=0.457 vs k=0.759, p=0.0013). Conclusions The rising frequency of secondary infections may affect both clinical severity and diagnostic performance during outbreaks. The clinical impact was more evident in 2024, likely associated with the introduction of a new serotype. These findings highlight the need for optimized surveillance and diagnostic strategies to improve case detection and patient management during epidemics.

12
Inferring Sexual Network Bridging Using Genomics: A Simulation Study

Kline, M. C.; Helekal, D.; Oliveira Roster, K. I.; Grad, Y.

2026-05-26 infectious diseases 10.64898/2026.05.24.26353967 medRxiv
Top 4%
0.0%
Show abstract

The dynamics of sexually transmitted infections involve interconnected transmission networks, including men who have sex with men and heterosexual populations. Understanding the extent of bridging between these networks can inform surveillance, guide interventions, and aid in the interpretation of their impact, but methods for quantifying bridging have been lacking. Here, we addressed whether pathogen genomics tools, successfully used to reconstruct transmission in other contexts, could accurately infer sexual network bridging. Based on simulations of gonorrhea spread, we evaluated phylodynamic bridging metrics inferred by ancestral state reconstruction under a range of sampling schemes, from comprehensive to sparse. These metrics differentiated sexual network structures even with biased sampling schemes, but accuracy depended on the sampling scheme and density: phylodynamic bridging estimates using sequences from all detected infections for one network configuration were on average 6.9% above the true value, whereas estimates from 5% of infections in symptomatic men with many partners were on average >1000% above the true value. These results suggest routine overestimation of bridging from unadjusted inferences from genomics data and provide context for interpreting existing genomic surveillance data and targeted studies.

13
Optical coherence tomography as a biomarker for frontotemporal dementia: a systematic review & meta-analysis

Wang, E.; Kohli, A.; Taha, H. B.

2026-05-27 neurology 10.64898/2026.05.19.26353366 medRxiv
Top 5%
0.0%
Show abstract

Background: Frontotemporal dementia (FTD) lacks widely accessible disease-specific biomarkers. Optical coherence tomography (OCT) and OCT angiography (OCTA) may provide non-invasive measures of retinal changes associated with neurodegeneration. We conducted a systematic review and meta-analysis evaluating retinal biomarkers in FTD compared with Alzheimer disease (AD) and controls. Methods: A systematic search of PubMed and Embase was conducted through April 25, 2026 according to PRISMA guidelines. Studies evaluating OCT/OCTA biomarkers in FTD with comparator groups were included. Inverse weighted random-effects models, publication bias assessments, and meta-regressions were performed. Results: Ten studies involving 139 individuals with FTD, 87 with AD, 29 with mild cognitive impairment, 14 with TDP-43 proteinopathy, 5 with tauopathy, and 255 controls were included in the systematic review; five studies were eligible for meta-analysis. Compared with AD, individuals with FTD demonstrated significantly thinner retinal nerve fiber layer (RNFL) thickness (SMD = -0.61, 95% CI -0.98, -0.24). Compared with controls, individuals with FTD exhibited significantly thinner ganglion cell layer-inner plexiform layer (GCL-IPL) thickness (SMD = -0.55, 95% CI -1.02, -0.08), whereas pooled analyses across multiple retinal biomarkers were non-significant (SMD = -0.19, 95% CI -0.52, 0.14). RNFL thickness correlated negatively with female % in FTD and positively with age in both AD and controls. Conclusions: Individuals with FTD exhibit lower RNFL thickness than AD and lower GCL-IPL thickness than controls, suggesting retinal alterations may reflect neurodegeneration. However, larger longitudinal studies with standardized OCT/OCTA protocols are needed to determine the diagnostic and prognostic utility of retinal biomarkers in FTD

14
Vaginal Antisepsis for Major Gynecologic Surgeries Using Chlorhexidine Gluconate versus Povidone Iodine: A Systematic Review and Meta-Analysis

Dias, Y.; Gebrekidan, F.; Lowder, J.; Sutcliffe, S.; Yaeger, L.

2026-05-27 obstetrics and gynecology 10.64898/2026.05.26.26353429 medRxiv
Top 5%
0.0%
Show abstract

ABSTRACT OBJECTIVE: We performed a systematic review and meta-analysis (SRMA) of post-surgical outcomes, comparing chlorhexidine gluconate (CHG) versus povidone iodine (PI) for vaginal antisepsis of major gynecologic procedures. DATA SOURCES: Ovid Medline, Embase, Scopus, Embase, Cochrane, and Clinicaltrials.gov were searched between 1986 and December 2023, for studies comparing CHG with PI for vaginal antisepsis of major gynecologic operations. STUDY ELIGIBILITY CRITERIA: We included Randomized Controlled Trials (RCTs) and non-RCTs comparing CHG to PI for vaginal antisepsis of major gynecologic operations. The primary outcome was surgical site infections (SSIs) and the secondary outcome was urinary tract infections (UTIs) and vaginal irritation. METHODS: Summary estimates were calculated by fixed effects models when I2 [&le;] 25% and by random effects models when I2 > 25%. Statistical analysis was performed using RevMan 5.4.1. The protocol for this systematic review was registered on PROSPERO (ID CRD42022378101). RESULTS: Nine studies met the inclusion criteria, four of which were randomized controlled trials (RCTs). 9538 patients were included, 4300 (45%) of whom were allocated to CHG and 5238 (55%) to PI. No statistically significant difference in SSI incidence was found for vaginal antisepsis with CHG versus PI in pooled analyses (n= 9538 patients; RR 1.20; 95% CI 0.92-1.57; I2 =0%). In contrast, a significantly higher risk of UTIs was observed for vaginal antisepsis with CHG than with PI (n=6061 patients; RR 1.48 95% CI 1.03-2.14; I2 = 0%). CONCLUSION: In our SRMA, there were no significant differences in SSI risk when either CHG or PI was utilized for antiseptic vaginal preparation. Interestingly, vaginal antisepsis with PI was associated with a lower incidence of post-operative UTIs following major gynecologic surgery. Our findings support current guidelines that form of vaginal antisepsis can be used for SSI prevention. They also suggest that PI may result in fewer postoperative UTIs but further randomized studies are needed to support these findings. Key words: surgical site infection, surgical wound infection, urinary tract infection, urogynecologic surgery, Chlorhexidine, Povidone Iodine, surgical antiseptic,

15
An ECG foundation model for generalizable cardiac function prediction across the lifespan

Yang, Y.; Peracchio, L.; Mayourian, J.; Miller, T.; La Cava, W.

2026-05-27 health informatics 10.64898/2026.05.26.26354128 medRxiv
Top 5%
0.0%
Show abstract

Background Artificial intelligence-enhanced electrocardiography (AI-ECG) enables scalable, low-cost cardiac dysfunction screening, but existing models are annotation-intensive and predominantly adult-derived, leaving paediatric generalizability uncertain. Paediatric cohorts exhibit highly variable cardiac morphology and function compared to adults, which may be useful for learning generalizable AI-ECG models. Methods We pretrained ECG-Fyler on a predominantly paediatric, all-age cohort at Boston Children's Hospital (1992-2023), annotated with a cardiology-specific coding system (Fyler codes), and evaluated it on assessments from echocardiography (echo) and cardiac magnetic resonance (CMR) studies. We validated on an external adult cohort from Columbia University Irving Medical Center. Performance was benchmarked against several AI-ECG foundation models by AUROC across age groups, lesion types, and limited-data scenarios. Findings The pretraining cohort comprised 782,138 ECGs from 255,271 patients (median age: 10.9 years, IQR: [2.8-16.8]). Internal evaluation included 178,495 ECG-echo pairs (median age: 10.9 [3.7-17.0]) and 8,584 ECG-CMR pairs (median age: 20.7 [15.6-29.6]). External validation included 82,543 ECG-echo pairs from adults (median age: 64.0 [52.0-74.0]). ECG-Fyler improved AUROC across biventricular dysfunction and dilation tasks, with the largest gains in low-data settings. In internal validation, ECG-Fyler detected low left ventricular ejection fraction (LVEF [&le;] 40%) from only 100 fine-tuning samples (AUROC: 0.80, 95% CI: [0.78-0.80]), outperforming other models (AUROC < 0.65) and improving with additional fine-tuning (AUROC: 0.94 [0.93-0.94]). Similar improvements were observed for CMR-derived LVEF, RVEF, and ventricular dilation. In external validation on adults, ECG-Fyler exhibited an AUROC of 0.83 (CI: [0.82-0.85]) for LVEF [&le;] 40%. After fine-tuning on less than 10% of external data, LVEF [&le;] 45% performance (AUROC: 0.87 [0.86-0.88]) outperformed a fully trained, site-specific prior model (AUROC: 0.85 [0.84-0.87]). Interpretation Pretraining on richly annotated, paediatric-dominant ECGs yields models that transfer efficiently across institutions and ages, supporting AI-ECG screening and triage when labels or imaging access are limited. Funding National Institutes of Health (R01LM012973); Kostin Innovation Fund, Boston Children's Hospital

16
Patient Versus Prediction-Level Evaluation of a Dynamic Clinical Prediction Model of Sepsis

Tuttle, M.; Maas, C. C. H. M.; An, J.; Wessler, B. S.; Harvey, W. F.; Selker, H. P.; van Klaveren, D.; Kent, D. M.

2026-05-27 health systems and quality improvement 10.64898/2026.05.26.26354141 medRxiv
Top 5%
0.0%
Show abstract

The Epic Sepsis Model version 2 (ESMv2) is a prediction model embedded into the electronic medical record used to warn clinicians which hospitalized patients are at risk for sepsis. We conducted a retrospective cohort study of 31,951 hospitalizations of 25,760 patients to compare analyses conducted at the commonly used patient-level (where a maximum prediction prior to the onset of sepsis is used to measure performance) vs novel prediction-level (where each prediction is used to measure performance). Sepsis, defined by the Sepsis 3 criteria occurred during 1,049 hospitalizations (3.3%). Patient-level analyses suggested excellent discrimination AUC 0.86; [IQR 0.85, 0.87], whereas prediction-level analyses demonstrated lower performance AUC 0.62; [IQR 0.57, 0.65]. Low estimates of the positive predictive value (14.5% at the patient level vs 4% at the prediction level) imply a high number of false alerts. Common evaluation approaches may overstate the performance of dynamic prediction models and mislead clinical decision-making.

17
Distinguishing Age-specific Patterns in Comorbidities of Obstructive Sleep Apnea Using Real-World Data

Goodman, M. O.; Alex, R. M.; Sands, S. A.; Azarbarzin, A.; Batool-anwar, S.; Pavlova, M. K.; Epstein, L. J.; Redline, S.; Cade, B. E.

2026-05-28 epidemiology 10.64898/2026.05.20.26352336 medRxiv
Top 5%
0.0%
Show abstract

Obstructive sleep apnea (OSA) is associated with a wide range of comorbidities, but the extent to which these follow predictable, age-dependent patterns is not well understood. Identifying such patterns could provide insight into OSA heterogeneity and its links to physiological measures of OSA. We trained age-dependent topic models (ATM) on longitudinal electronic health records from 36,426 patients with OSA in the Mass General Brigham Biobank. ATM organizes incident diagnoses into distinct comorbidity "topics," whose age-specific disease loadings represent predictive patterns linking related diagnoses across the life course. We applied the trained model to compute individual-level topic scores in independent data: a cohort of 11,689 OSA cases and 22,695 matched controls, and a cohort of 6,220 patients with polysomnography (PSG)-derived physiological measures. We identified 19 distinct age-dependent comorbidity profiles, all significantly associated with OSA case status (FDR-adjusted p<0.05). Topics reflected recognizable clusters including metabolic, neuropsychiatric, and immune-mediated conditions, and several were distinguished by age-of-onset of key comorbidities, such as early- vs late-onset asthma. Seventeen of the 19 topics were significantly associated with at least one of 13 PSG-derived physiological measures, including associations between cardiometabolic topics and the apnea-hypopnea index, sleep apnea specific hypoxic burden, and respiratory event-specific heart rate burden. These findings indicate that age-dependent comorbidity patterns distinguish meaningful OSA subtypes with differing prognoses and endophenotype associations. ATM offers insight into complex OSA comorbidity and suggests that age-informed, topic-based stratification may improve individualized risk assessment, interpretation of PSG findings, and targeting of clinical interventions.

18
Morphological feature remodeling of intracranial arteries in the context of inflammation and HIV-associated cognitive impairment

Hoang, N.; Yang, H.; Uddin, M. N.; Zhong, J.; Faiyaz, A.; Singh, M. V.; Boodoo, Z. D.; Sutton, K. R.; Wang, H. Z.; Sahin, B.; Khan, M. W.; Weber, M. T.; Yuan, C.; Chen, L.; Schifitto, G.

2026-05-27 hiv aids 10.64898/2026.05.19.26353071 medRxiv
Top 5%
0.0%
Show abstract

Background: Despite the success of combination antiretroviral therapy (cART), vascular comorbidities, including cerebrovascular disease, are more prominent in people living with HIV (PLWH) compared to people without HIV (PWOH). However, quantitative assessments of cerebrovascular morphometry and their associations with cognitive outcomes in the context of HIV are still limited. In this study, we explore this missing link. Methods: Magnetic Resonance Angiography (MRA) data, blood markers, and neurocognitive assessments were collected from 73 PWOH subjects (male: 57, female: 16; age: 53 {+/-} 16) and 99 PLWH subjects (male: 66, female: 30, age: 53 {+/-} 11). Vessel morphometric features were quantified using intraCranial Artery Feature Extraction (iCafe) to investigate associations between vessel morphometry, markers of monocytes, endothelial cell activation, and cognitive performance. Results: HIV status predicted a lower total number of branches ({beta} = -0.224, p = 0.001, d = -0.517) and shorter total distal length ({beta} = -0.173, p = 0.021, d = -0.370) with a moderate effect size. Total branch number was found to be negatively associated with plasma levels of monocyte markers (sCD14: r = -0.167, p = 0.033; sCD163: r = -0.157, p = 0.045) and positively correlated with white matter cerebral blood flow (r = 0.550; p [&le;] 0.05). HIV status was the strongest predictor of overall cognitive performance in ANCOVA model ({beta} = -0.219, p = 0.006, d = -0.453). Conclusions: Our results suggest that cognitive impairment in PLWH is associated with vessel morphology metrics. Monocyte immune activation may contribute to changes in vessel morphology.

19
Can Large Language Models Diagnose Primary Immunodeficiency from Patient-Described Symptoms?

Reteig, L. C.; Woloshin, S.; Maglione, P. J.; Farmer, J. R.; Ong, M.-S.

2026-05-27 allergy and immunology 10.64898/2026.05.26.26353818 medRxiv
Top 5%
0.0%
Show abstract

Patients with primary immunodeficiency (PID) often face prolonged diagnostic delays and may increasingly turn to large language models (LLMs) to interpret their symptoms during this period. We evaluated whether an LLM could recognize PID from symptom descriptions derived from interviews with 21 PID patients. In a prior study, we showed that GPT-4o identified PID in 96% of cases when prompted with physician-written patient histories (Rider et al., JACI, 2024). Here, when prompted with symptom descriptions in patients' own words, GPT-5 identified PID in only 7 cases (33%), although it more broadly suggested immune system issues in 18 cases (81%). The gap between these findings indicates that LLMs are sensitive to the language and framing of symptom descriptions, performing substantially worse when patients describe their own symptoms in everyday language than when clinicians summarize patient histories in structured medical terms. This study underscores the need to carefully evaluate how LLMs are used in patient-facing applications.

20
ERBB4 deficiency promotes atrial myopathy underlying the atrial fibrillation substrate

Yamaguchi, N.; Santucci, J.; Hong, S. J.; Ferrena, A.; Schlamp, F.; Willett, D.; Casdin, C. J.; Park, P. S.; Lin, X.; Xiao, J.; Hall, S.; Barnard, J.; Achter, J.; Kanhert, K.; Lundby, A.; Chung, M. K.; Van Wagoner, D. R.; Park, D. S.

2026-05-27 cardiovascular medicine 10.64898/2026.05.26.26354173 medRxiv
Top 5%
0.0%
Show abstract

Background Atrial fibrillation (AF) is a leading cause of stroke, cardiovascular morbidity, and mortality. Atrial myopathy, characterized by progressive metabolic, electrical, and structural changes, creates the arrhythmogenic substrate that drives AF. Defining the key drivers of atrial myopathic processes is essential for targeted therapies that can mitigate AF progression. Here we explore how reduced ERBB4 expression contributes to the development of left atrial myopathy. Methods We analyzed the Cleveland Clinic Biobank to compare left atrial ERBB4 levels in patients grouped by AF diagnosis. To investigate the impact of reduced ERBB4 levels on atrial tissue substrate, we created mouse models of cardiac-specific Erbb4 deficiency using Mlc2a (myosin light chain 2a)-Cre. Comprehensive physiological assessments were performed. Transcriptomic analyses of the left atrium were performed in an Erbb4 haploinsufficient mouse model and compared with human atrial datasets. Molecular validation of key dysregulated pathways was performed. Results We found that left atrial ERBB4 levels are reduced in patients with AF. Adult cardiomyocyte-specific Erbb4 heterozygous (Erbb4fl/+;Mlc2a-Cre) mice exhibited prolonged P-wave duration in the absence of ventricular dysfunction. Left atrial transcriptomic analysis in Erbb4 haploinsufficient mice showed upregulation of pathways related to fibrosis, apoptosis, and coagulation, and downregulation of pathways related to fatty acid metabolism and mitochondrial function, mirroring changes observed in pressure overload mouse models. A cross-species transcriptomic comparison revealed significant overlap between ERBB4-correlated gene expression and functional pathways in adult human atria and mice with Erbb4 haploinsufficiency. Validating the transcriptomic data, protein and functional assays demonstrated increased fibrosis, apoptosis, and oxidative stress in the mutant left atrial tissue. Conclusion Left atrial ERBB4 levels are reduced in AF patients. A mouse model of Erbb4 deficiency and human atrial transcriptomic analyses highlight a role for ERBB4 in supporting normal atrial metabolism while protecting against inflammation, apoptosis, and fibrosis.