Med
○ Elsevier BV
Preprints posted in the last 7 days, ranked by how well they match Med's content profile, based on 38 papers previously published here. The average preprint has a 0.03% match score for this journal, so anything above that is already an above-average fit.
Werner, C. J.; Sanchez-Garcia, E.; Mall, B.; Meyer, T.; Pinho, J.; Schulz, J. B.; Schumann-Werner, B.
Show abstract
Multi-consistency testing during flexible endoscopic evaluation of swallowing (FEES) is clinically necessary but introduces selection bias: worst scores inflate severity because the number of consistencies tested covaries with disease severity. In this retrospective observational study of hospitalized neurological patients, we derived and validated the FEES Dysphagia Index (FDI) in two temporally independent cohorts (Cohort 1: 2013-2018, N=1,257; Cohort 2: 2021-2025, N=1,686) from a single center. FDI-S averages Penetration-Aspiration Scale (PAS) scores across tested consistencies (0-100 scale); FDI-E uses Yale Pharyngeal Residue scores; FDI-C combines both. Selection bias was quantified using sequential branching-tree inverse probability weighting (IPW). Worst PAS overestimated severity by 24%; FDI deviated by <2%. FDI-C was significantly superior to Worst PAS for hospital-acquired pneumonia (HAP; AUC 0.70 vs. 0.60, p<0.001), mortality (0.71 vs. 0.62, p=0.040), and restricted oral intake (0.90 vs. 0.74, p<0.001), and statistically equivalent to clinician-rated severity. FDI-C mapped linearly onto ordinal Functional Oral Intake Scale values (FOIS; proportional odds RCS p=0.99). With functional status and diagnosis, FDI-C reconstructed the clinicians oral intake recommendation with AUC up to 0.93. The FDI-C-mortality relationship was sigmoidal with a clinically relevant transition zone between [~]50 and [~]85. FDI-C is a bias-resilient, bedside-calculable score with interval-scale properties that captures expert clinical judgment, suitable as both a clinical decision support tool and a continuous research endpoint.
Pinto, A.; Dong, X.; Wu, W.; Johnson, S. J.; Wen, Q.; Zhang, C.; Havey, J.; Wang, B.; Tang, G.; Farhat, A.; Zhang, D. Y.; Issa, G. C.; Zhang, X.
Show abstract
Massively multiplexed qPCR is primarily constrained by increasing primer dimer formation as the number of distinct primers in a single reaction increases. Previous multiplex primer design algorithms either fail to sufficiently suppress primer dimers at 100+ plex, or take exceedingly high amounts of computational resources to complete. Here, we present DIMPLE, a linear-runtime primer design algorithm that effectively generates 10,000+ primers to amplify thousands of potential amplicons in a single qPCR reaction. As one clinical demonstration of this algorithm, we designed an assay to detect 2,302 distinct KMT2A gene fusion subtypes using 204 primers in a single tube. In contrast to FISH and convention NGS approaches with 2% variant allele frequency (VAF) limit of detection, our DIMPLE qPCR assay was able to analytically detect gene fusions down to 0.05% VAF. We also constructed proof-of-concept multiplex qPCR panels for additional oncology gene fusions, multiplex pathogen detection, and DNA methylation markers. The scalability and low computational cost DIMPLE are complementary to new instrument platforms for massively multiplex qPCR readout for enabling rapid, point-of-care nucleic acid testing.
Schreiner, P. A.; Markianos, K.; Francis, M.; Despard, B.; Gorman, B. R.; Said, I.; Dong, F.; Gautam, S.; Dochtermann, D.; Shi, Y.; Devineni, P.; Kirkpatrick, C.; Khazanov, N.; Moser, J.; Million Veteran Program, ; Huang, G. D.; Muralidhar, S.; Tsao, P. S.; Pyarajan, S.
Show abstract
The Million Veteran Program (MVP) represents the largest and one of the most diverse single cohorts associated with longitudinal Electronic Health Record data (EHR) data. We profiled a subset of samples from MVP using the Illumina Infinium MethylationEPIC Beadchip (EPIC array) to generate one of the largest single cohort methylation dataset to-date. Methylation profiles were analyzed for 45,460 total individuals, with the most populous ancestries composed of 27,455 Europeans, 11,798 African Americans, and 4,859 Admixed Americans. We detail the strict quality control standards implemented to ensure the most robust method of methylation profiling of the MVP cohort. This dataset was then applied to evaluate the effects of smoking exposure on DNA methylation in MVP participants. Ancestry-stratified epigenome-wide association studies (EWAS) of smoking status (ever/never) were performed using over 750,000 probes with certifiable signal. Our multi-ancestry meta-analysis demonstrates replicability with existing EWAS and identifies 3,207 novel probe-smoking associations unlocked via the depth and breadth of data in this cohort.
Auger, S. D.; Varley, J.; Hargovan, M.; Scott, G.
Show abstract
Background: Current medical large language model (LLM) evaluations largely rely on small collections of cases, whereas rigorous safety testing requires large-scale, diverse, and complex cases with verifiable ground truth. Multiple Sclerosis (MS) provides an ideal evaluation model, with validated diagnostic criteria and numerous paraclinical tests informing differential diagnosis, investigation, and management. Methods: We generated synthetic MS cases with ground-truth labels for diagnosis, localisation, and management. Four frontier LLMs (Gemini 3 Pro/Flash, GPT 5.2/5 mini) were instructed to analyse cases to provide anatomical localisation, differential diagnoses, investigations, and management plans. An automated evaluator compared these outputs to the ground-truth labels. Blinded subspecialty experts validated 70 cases for realism and automated evaluator accuracy. We then evaluated LLM decision-making across 1,000 cases and scaled to 10,000 to characterise rare, catastrophic failures. Results: Subspecialist expert review confirmed 100% synthetic case realism and 99.8% (95% CI 95.5 to 100) automated evaluation accuracy. Across 1,000 generated MS cases, all LLMs successfully included MS in the differential diagnoses for more than 91% cases. However, diagnostic competence did not associate with treatment safety. Gemini 3 models had low rates of clinically appropriate steroid recommendations (Flash: 7.2% 95% CI 5.6 to 8.8; Pro: 15.8% 95% CI 13.6 to 18.1) compared to GPT 5 mini (23.5% 95% CI 20.8 to 26.1), frequently overlooking contraindications like active infection. OpenAI models inappropriately recommended acute intravenous thrombolysis for MS cases (9.6% GPT 5.2; 6.4% GPT 5 mini) compared to below 1% for Gemini models. Expanded evaluation (to 10,000 cases) probed these errors in detail. Thrombolysis was recommended in 10.1% of cases lacking symptom timing information and paradoxically persisted (2.9%) even when symptoms were explicitly documented as more than 14 days old. Conclusion: Automated expert-level evaluation across 10,000 cases characterised artificial intelligence clinical blind spots hitherto invisible to small-scale testing. Massive-scale simulation and automated interrogation should become standard for uncovering serious failures and implementing safety guardrails before clinical deployment exposes patients to risk.
Jia, E.; Omar, M.; Barash, Y.; Brook, O. R.; Ahmed, M.; Kruskal, J. B.; Gorenshtein, A.; Klang, E.
Show abstract
Ramaswamy et al. recently reported in Nature Medicine that ChatGPT Health, a consumer-facing health AI tool, undertriaged 51.6% of true emergencies. It was also susceptible to social anchoring in a structured stress test of triage recommendations. We applied the same vignette-based benchmark to OpenEvidence, a widely used physician-facing AI platform for clinical decision support. The benchmark included 960 prompts across 21 clinical domains (Supplementary Table S3). OpenEvidence undertriaged 12.5% of emergencies, a four-fold reduction relative to ChatGPT Health. It also showed no anchoring effect. Its errors skewed in a safer direction, including 68.0% overtriage of Home presentations. In 65 of 960 responses (6.8%), it declined to assign a triage level. These refusals occurred only in symptom-only prompts and never in urgent or emergency cases. Performance improved when objective clinical data were provided. Under the same benchmark, a widely used physician-facing system showed a different safety profile from a consumer-facing one. This suggests that who a health AI is built for can shape how it fails.
Yang, I. Y.; Patil, A.; Jin, O.; Loud, S.; Buxhoeveden, S.; Zhang, D. Y.
Show abstract
Multiple sclerosis (MS) is a debilitating disease affecting more than 1 million Americans, and today is assessed primarily through magnetic resonance imaging (MRI) and observational clinical symptoms. Given the autoimmune nature of MS, we hypothesized that high-dimensional gene expression data from peripheral blood mononuclear cells (PBMCs), when analyzed with the assistance of AI, may collectively serve as valuable biomarkers for the real-time risk and progression of MS. Here, we present PBMC RNA sequencing (RNAseq) results from N=997 samples, including 540 MS, 221 neuromyelitis optica (NMO), and 149 healthy controls. We constructed and optimized ensemble models for three clinical outcomes: (1) discrimination of early MS (EDSS [≤] 2.0) from healthy individuals with 74% AUC at 100% coverage, (2) differential diagnosis of MS from NMO with 91% AUC at 80% coverage, and (3) subtyping RRMS from progressive MS with 79% AUC at 80% coverage. To our knowledge, no prior molecular test has been reported for any of these three MS clinical tasks, and these results may have immediate impact on clinical management of MS patients. Two innovations that improved the stratification accuracy of our models: selection of gene sets based on expression variance in disease states, and use of non-linear rank sort and conviction weighting in the ensemble score calculation.
Hosking, A.; Iveson, M. H.; Sherlock, L.; Mukherjee, M.; Grover, C.; Alex, B.; Parepalli, S.; Mair, G.; Doubal, F.; Whalley, H. C.; Tobin, R.; Wardlaw, J. M.; Al-Shahi Salman, R.; Whiteley, W. N.
Show abstract
Background Outcome after stroke varies according to stroke subtype by location, but healthcare systems data studies do not include subtyping information. We linked natural language processing (NLP) of brain imaging reports to routinely collected data to estimate risk of death and other outcomes after stroke subtypes in a nationwide dataset. Methods We applied a previously validated NLP algorithm to all CT and MRI head scan reports in Scotland between 2010 and 2018. We linked the reports to hospital readmissions, prescriptions and death data to identify and characterize people with stroke, and to categorize into deep and cortical ischemic stroke, deep and lobar intracerebral hemorrhage (ICH), subarachnoid hemorrhage, and subdural hemorrhage. We used a matched cohort design, and age- and sex-matched four controls per case who never had a stroke. By subtype, we estimated rehospitalization with stroke, myocardial infarction (MI), cancer, dementia, epilepsy and death, accounting for confounders and competing risk of death. Results From 785,331 people with a head scan, we identified 64,219 with clinical stroke phenotypes (mean age 73.4yrs, 49.5% male), and subtyped 12,616 with deep ischaemic stroke; 14,103 with cortical ischaemic stroke; 1,814 with deep ICH; and 1,456 with lobar ICH. There was higher absolute rate of 1-year hospital readmission for lobar compared with deep ICH (4.9% [95%CI 3.9% - 6.1%] vs 3.4% [2.6% - 4.3%]), higher risk of dementia beyond 6 months after lobar ICH compared to controls than for other stroke subtypes (aHR 3.5 [2.3-5.3]); and higher risk of MI within 6 months of cortical ischemic stroke than for other stroke subtypes (aHR 4.6 [3.4-6.3]). Conclusions NLP of free-text reports linked to coded data successfully subtyped stroke at scale, and we estimated risk of clinically relevant outcomes. Future work should use free text to enable large-scale audit and epidemiology of people with stroke.
Shim, K. B.
Show abstract
Pancreatic ductal adenocarcinoma (PDAC) remains one of the deadliest solid tumors and continues to face low treatment-trial participation, fragmented evidence workflows, and labor-intensive ab- straction of unstructured clinical text. Existing oncology-focused language models show promise, but many depend on private institutional corpora, limiting reproducibility and practical reuse across centers. We present Onca, an open 9B dense model designed for four PDAC-relevant tasks: trial eligibility screening, case-specific clinical reasoning, structured pathology report extraction, and molecular variant evidence reasoning. Onca is fine-tuned from Qwopus3.5-9B-v3 with a single Un- sloth BF16 LoRA adapter on 37,364 training rows drawn from openly available sources. The evalu- ation spans 11 panels and compares Onca against Woollie-7B, CancerLLM-7B, OpenBioLLM-8B, and the unmodified Qwopus base. Onca achieves the strongest overall results on Trial Screening (81.6 F1), Clinical Reasoning (14.1 composite), Pathology Extraction (30.5 field exact-match), Pub- MedQA Cancer (68.3 macro-F1), and PubMedQA (66.5 macro-F1). The strongest gains appear in tasks closest to routine oncology workflow, especially trial review and pathology structuring. These findings suggest that clinically targeted pancreatic-cancer language models can be built from open data with competitive performance while remaining practical to train on a single workstation-scale GPU setup.
Sankaranarayanan, M.; Donahue, M. A.; Brooks, J. D.; Sun, S.; Newhouse, J. P.; Blacker, D.; Haneuse, S.; Hernandez-Diaz, S.; Moura, L. M. V. R.
Show abstract
ObjectiveLevetiracetam is commonly prescribed for seizure prophylaxis after acute ischemic stroke (AIS) and often continued beyond discharge. While its short-term effectiveness for preventing post-stroke seizures is established, it is unclear whether prolonged use improves survival, particularly in older adults. We estimated the effect of continued levetiracetam use on 90-day mortality among Medicare beneficiaries after AIS. MethodsUsing Traditional Medicare claims data (2008-2021), we identified beneficiaries aged [≥]66 years hospitalized for AIS who initiated outpatient levetiracetam within 90 days of discharge. After one month of continued post-stroke use of levetiracetam (start of follow-up), we compared 90-day mortality between patients with a new levetiracetam dispensation within a 14-day grace period post-follow up and those without one. We performed cloning, censoring and weighting to address immortal time bias and estimated standardized mortality risks, risk differences, and 95% confidence intervals (CI). ResultsAmong 3,212 eligible beneficiaries, 1,779 (55.4%) received a new levetiracetam dispensation within the 14-day grace period. Median age was 76 years (IQR 70-83); 57.8% were female. After adjustment for demographics, hospitalization characteristics, timing of initiation, and comorbidities, continued use was associated with lower 90-day mortality than discontinuation (53 vs 62 deaths per 1,000; risk difference -9 per 1,000; 95% CI: (-12,-5)). The reduction was observed primarily among patients aged [≥]75 years. SignificanceAmong older Medicare beneficiaries who initiated levetiracetam after AIS, continued outpatient use was associated with modestly lower 90-day mortality, particularly in those aged [≥]75 years. These findings suggest potential benefits of levetiracetam continuation beyond the immediate post-stroke period.
Yi, B.; Kim, H. Y.; Sotka, W.; Estey, R.; Green, S. J.; Shiau, H.
Show abstract
Gingival inflammation is associated with dysbiotic oral biofilms characterized by reduced nitrate-reducing capacity and diminished nitric oxide (NO) bioavailability. While dietary nitrate has been shown to influence oral microbial activity, the effects of sustained, localized nitrate delivery on oral biofilm ecology and gingival inflammation remain incompletely defined. In this randomized, double-blind, placebo-controlled trial, 30 adults with gingival bleeding were assigned to receive localized prebiotic nitrate (~0.989 mmol per dose) or placebo for 21 days. The primary outcome was mean bleeding on probing (mBOP). Secondary outcomes included modified Gingival Index (mGI), Quigley-Hein plaque index (QHPI), salivary nitrite (as a proxy for NO bioavailability), oral pH, and microbiome composition assessed by 16S rRNA gene sequencing. Prebiotic nitrate supplementation formulation delivered in a slow-release chewing gum significantly reduced mBOP (25.7% to 15.3%; p = 0.0002) compared to placebo chewing gum. Salivary nitrite levels and oral pH increased, indicating enhanced nitrate metabolism. Microbiome analysis demonstrated enrichment of nitrate-reducing taxa, including Rothia mucilaginosa and Neisseria spp., and a relative reduction in inflammation-associated genera such as Prevotella and Porphyromonas. Localized prebiotic nitrate formula delivered in a functional chewing gum was associated with reduced gingival inflammation and shifts in oral microbiome composition consistent with enhanced nitrate-reducing capacity critical in nitric oxide formation. These findings support a role for biofilm-directed nutritional modulation as a non-antimicrobial approach for managing gingival inflammation and improving nitric oxide bioavailability.
Chen, Y.; Law, Z. K.; Zhou, X.; Dai, Q.; Xiang, S.; Xiao, X.; Ma, J.; Feng, M.; Peng, W.; Zhou, S.; Chen, L.; Zhou, Y.; Lai, Y.; Yeo, L.; An, S.; He, Y.; Pan, S.-Y.
Show abstract
Abstract Objective: To compare the safety and efficacy of bridging intravenous thrombolysis (IVT) plus endovascular thrombectomy (EVT) versus direct EVT in patients with acute ischemic stroke (AIS) due to anterior circulation large vessel occlusion (LVO) treated within the 6- to 24-hour time window. Methods: This is a retrospective analysis of prospective EVT registry from 10 comprehensive stroke centers in China and Singapore between 2019 and 2024. Eligible patients had anterior circulation LVO, underwent EVT within 6-24 hours of onset, had ASPECTS 6, NIHSS 6, and pre-stroke mRS 2. Patients were stratified into bridging IVT + EVT (IVT group) versus direct EVT alone (non-IVT group). Propensity score matching (1:2 ratio) was performed to balance baseline covariates. The primary outcome was 3-month favorable functional outcome (mRS 0-2). Secondary outcomes included successful recanalization (mTICI 2b-3), symptomatic intracranial hemorrhage (sICH), hemorrhagic transformation (HT) and 3-month mortality. In the matched cohort, binary outcomes were compared using the Cochran-Mantel-Haenszel test. Results: Of 772 included patients, 110 (14.2%) received bridging IVT and 662 (85.8%) received direct EVT. After propensity score matching, 202 non-IVT patients were matched to 101 IVT patients, with all covariates well-balanced (absolute SMD <0.10). In the matched cohort, bridging IVT was not associated with a significant difference in 3-month favorable outcome (44.55% vs. 47.03%; common OR 0.91; 95% CI 0.56-1.46), successful recanalization (91.09% vs. 90.10%; OR 1.11; 0.51-2.44), sICH (5.94% vs. 9.41%; OR 0.61; 0.24-1.58), HT (23.76% vs. 23.27%; OR 1.03; 0.57-1.85), or 3-month mortality (15.84% vs. 13.37%; OR 1.22; 0.62-2.37). Conclusion: In this large multicenter propensity score-matched analysis, bridging intravenous thrombolysis before endovascular thrombectomy in the 6- to 24-hour time window was not significantly associated with improved efficacy or increased safety risks compared with direct endovascular therapy alone.
Mavura, Y.; Crosslin, D.; Ferar, K. D.; Lawlor, J. M.; Greally, J. M.; Hindorff, L.; Jarvik, G. P.; Kalla, S.; Koenig, B. A.; Kvale, M.; Kwok, P.-Y.; Norton, M.; Plon, S. E.; Powell, B. C.; Slavotinek, A.; Thompson, M. L.; Popejoy, A. B.; Kenny, E. E.; Risch, N.
Show abstract
PurposeDiagnostic yield from exome and genome sequencing varies widely across studies. It remains unclear how much of this variation reflects patient-level factors (e.g., sex, clinical features, race/ethnicity, genetic ancestry) versus site-level practices such as sequencing modality or variant interpretation workflows. We aimed to quantify the contributions of these factors to diagnostic outcomes across five U.S. clinical sequencing sites. MethodsWe performed a cross-sectional analysis of 3,008 prenatal, neonatal, and pediatric cases from the NHGRI Clinical Sequencing Evidence-Generating Research (CSER) consortium (2017-2023). Clinical indications spanned neurodevelopmental, neurological, immunological, metabolic, craniofacial, skeletal, cardiac, prenatal, and oncologic presentations. Genetic ancestry was inferred from sequencing data, and variants were interpreted using ACMG/AMP guidelines to classify DNA-based diagnoses. Generalized linear mixed models were used to estimate associations between diagnostic yield and fixed effects (sex, prenatal status, isolated cancer, number of clinical indications, sequencing modality, race/ethnicity, and genetic ancestry), while modeling study site as a random effect to quantify between-site variation. ResultsThe overall diagnostic yield was 19.0%. Multiple clinical indications (OR=1.47, 95% CI 1.20-1.80, p<0.001) were associated with higher diagnostic yield, and male sex (OR=0.80, 95% CI 0.66-0.96, p=0.017) and prenatal status (OR=0.63, 95% CI 0.44-0.90, p=0.012) were associated with lower yield. Sequencing modality, race/ethnicity, genetic ancestry, and isolated cancer were not statistically significantly associated with diagnostic outcomes.. A model without fixed effects attributed [~]10% of variance in diagnostic yield to between-site differences. After adjusting for covariates, site-level variance decreased to 5.7%, indicating consistent variation across sites not explained by measured patient factors. ConclusionAcross five sites, patient-level clinical features influenced diagnostic yield, but substantial site-level variation remained even after adjustment. Differences in variant interpretation, or case-classification practices may contribute to this residual variability. Further efforts to increase consistency in exome- and genome-sequencing diagnostic workflows may help reduce inter-site differences.
James-Pemberton, P.; Harper, D.; Wagerfield, P.; Watson, C.; Hervada, L.; Kohli, S.; Alder, S.; Shaw, A.
Show abstract
A multiplex diagnostic test is evaluated for self-reported long COVID associated persistent symptoms and a poor recovery from a SARS-CoV-2 infection. A mass-standardised concentration of total antibodies (AC), high-quality (HQ) antibodies and percentage of HQ antibodies (HQ%) is assessed against a spectrum of spike proteins to the SARS-CoV-2 variants: Wuhan, , {delta}, and the Omicron variants BA.1, BA.2, BA.2.12.1, BA.2.75, BA.5, CH.1.1, BQ.1.1 and XBB.1.5 in three cohorts. A cohort of control patients (n = 46) recovered (CC) and a cohort of self-declared long COVID patients (n = 113) (LCC). A nested Receiver Operating Characteristic (ROC) analysis, performed for the variant with lowest HQ concentration in the spectrum, produced an area under the curve and AUC = 0.61 (0.53-0.70) for the CC vs LCC cohorts. For the LCC cohort, the cut-off thresholds for AC = 0.8 mg/L, HQ = 1.5 mg/L and HQ% of 34% were determined, leading to a 71% sensitivity and 66% specificity derived by the Youden metric. The cohorts may be fully classified based on ROC and outlier analysis to give an incidence of persistent virus 62% (95% CI 52% - 71%), hyperimmune 12% (95% CI 7% - 20%) and unclassified, 26% (95% CI 18% - 35%). The overall diagnostic accuracy for both the hyper and hypo immune is 69%. All clinical interventions can now be tailored for the heterogenous long COVID patient cohort.
Tenbusch, M.; Koopman, G.; Mooij, P.; Roshani, B.; Irrgang, P.; Lapuente, D.; Kondova, I.; Bogers, W. M.; Remarque, E. J.; Vestweber, R.; Merida Ruiz, S. A.; Krüger, N.; Meyer, S.; Gefeller, O.; Stahl-Hennig, C.; Überla, K.
Show abstract
In a confirmatory study, we evaluated the immunogenicity and protective efficacy of a heterologous prime-boost vaccination strategy against respiratory syncytial virus (RSV) in non-human primates. Building on prior evidence of protective mucosal immunity induced by intramuscular DNA priming followed by an oropharyngeal adenoviral boost, we conducted a randomized, blinded, dual-centre study across two European primate research facilities. Rhesus macaques received a codon-optimized RSV-F DNA vaccine via electroporation, followed by two mucosal administrations of a recombinant adenovirus serotype 5 vector encoding the same antigen. Control groups included animals vaccinated with irrelevant influenza antigens and a comparator group mimicking natural immunity induced by primary RSV infection. Systemic and mucosal immune responses, including RSV-F-specific antibodies and tissue-resident memory T cells, were monitored longitudinally. Here, we detected robust immune responses, but with some variability between the two centres. However, following experimental RSV challenge performed 22 weeks after the final immunization, RSV-vaccinated animals demonstrated markedly reduced viral replication in both upper and lower respiratory tracts. However, unexpected RSV-specific immunity in the control group at one single study site prevented confirmation of the predefined primary endpoint. Overall, these results support the potential of mucosal adenoviral boosting following DNA priming to induce protective immunity against RSV, while highlighting challenges associated with multi-centre preclinical vaccine studies.
Haug, M.; Ilves, N.; Umov, N.; Loorents, H.; Suvalov, H.; Tamm, S.; Oja, M.; Reisberg, S.; Vilo, J.; Kolde, R.
Show abstract
Abstract Objective To address the unresolved bottleneck of selecting cohort-relevant clinical concepts for treatment trajectory analysis in observational health data, we introduce CohortContrast, an OMOP-compatible R package for enrichment-based concept identification, temporal and semantic noise reduction, and concept aggregation, enabling cohort-level characterization and downstream trajectory analysis. Materials and Methods We developed CohortContrast and applied it to OMOP-mapped observational data from the Estonian nationwide OPTIMA database, which includes all cases of lung, breast, and prostate cancer, focusing here on lung and prostate cancer cohorts. The workflow combines target-control statistical enrichment, temporal/global noise filtering, hierarchical concept aggregation and correlation-based merging, with optional patient clustering for downstream trajectory exploration. We validated the approach with a clinician-based plausibility assessment of extracted diagnosis-concept pairs and evaluated a large language model (LLM) as an auxiliary filtering step. Results We analyzed 7,579 lung cancer and 11,547 prostate cancer patients. The workflow reduced concept dimensionality from 5,793 to 296 concepts (94.9%) in lung cancer and from 5,759 to 170 concepts (97.0%) in prostate cancer, and identified three exploratory patient subgroups in both cohorts. In a plausibility assessment of 466 diagnosis-concept pairs, validators rated 31.3% as directly linked and 57.5% as indirectly linked. Discussion CohortContrast reduces manual concept curation by prioritizing and aggregating cohort-relevant concepts while preserving clinically interpretable treatment patterns in OMOP-based real-world data. Conclusion CohortContrast enables scalable reduction of broad OMOP concept spaces into clinically interpretable, cohort-specific representations for exploratory trajectory analysis and real-world evidence research.
Tiseo, K.; Dräger, S.; Santhosh Kumar, H.; Alkhazashvili, M.; Hammann, A.; Risch, P.; Willi, R.; Mkhatvari, T.; Fialova, C.; Adlhart, C.; Szabo, D.; Suknidze, M.; Patchkoria, I.; Broger, T.; Ivanova Reipold, E.; Varshanidze, K.; Osthoff, M.
Show abstract
1.Etiological diagnosis of lower respiratory tract infections (LRTIs) relies on sputum or bronchoalveolar lavage (BAL), which may be difficult to obtain or invasive. Exhaled breath aerosol (XBA) sampling offers a non-invasive alternative for pathogen detection. We evaluated the performance of the AveloMask, a face mask-based device designed to capture XBAs for molecular testing. In this prospective paired-sample study, hospitalized adults with pneumonia at three hospitals in Switzerland and Georgia provided an XBA sample using the AveloMask and a lower respiratory tract (LRT) specimen (sputum or BAL). XBA samples were analyzed by multiplex PCR using the Roche LightMix(R) panel and LRT samples were tested using the BioFire(R) FilmArray(R) Pneumonia Panel. Concordance between XBA and LRT samples was assessed using positive percent agreement (PPA), negative percent agreement (NPA), and overall percent agreement (OPA). Ninety-three participants were enrolled and 63 participants provided paired samples. AveloMask sampling identified the dominant pathogen (lowest Ct value in the LRT sample) in 40/47 LRT-positive cases (85.1%). Across all targets, PPA was 61% (95%CI, 50-72%), NPA was 100% (95%CI, 99-100%), and OPA was 95% (95% CI, 92-96%). PPA was higher for bacteria than for viruses and lower PPA was largely driven by reduced detection of low-abundance or co-infecting pathogens. In a subset analysis, AveloMask results showed substantial overlap with standard-of-care testing and could have supported antimicrobial de-escalation. Breath aerosol sampling using the AveloMask enabled non-invasive molecular detection of LRT pathogens in pneumonia cases and may complement conventional standard-of-care testing, particularly when sputum is unavailable.
Bar, O.; Murthy, M.; Cosgrove, K.; Saidi, Y.; El-Arar, W.; Goldenberg, M.; Sauvage, G.; Bergerat, A.; Cooley Demidkina, B.; Laliberte, K.; Xu, J.; Pierson, G.; Kwon, D. S.; Niles, J.; Yassour, M.; Mitchell, C.
Show abstract
ImportanceEmerging data show that B-cell depleting chemotherapies, which are increasingly used to treat autoimmune disorders and multiple sclerosis, can be associated with mucosal side effects such as inflammatory vaginitis. ObjectiveEvaluate the impact of rituximab treatment on vaginal mucosal immune markers, endocervical immune cell populations and vaginal microbiome. DesignCross-sectional observational study conducted between 2022 - 2024. SettingAcademic medical center, Boston Massachusetts. ParticipantsWe enrolled women aged >18 years who were either 1) receiving rituximab for autoimmune renal disease or were 2) healthy controls ExposureTreatment with rituximab, an anti CD20 monoclonal antibody. Main outcome and measureWe compared endocervical immune cell populations, vaginal fluid immune markers, vaginal fluid immunoglobulins and vaginal microbiome composition between individuals being treated with rituximab and healthy controls. ResultsWe enrolled 26 women treated with rituximab for autoimmune renal disease and 26 healthy controls. Median circulating and endocervical B-cell and plasma cell proportions were significantly lower in treated participants compared to controls. Median vaginal fluid IgA concentrations were significantly lower in participants treated with rituximab, while ILE, IgM, IgG1, IgG2, IgG3 and IgG4 were not different between groups. Total T cell frequencies were similar between groups, but the proportion of activated T cells (CD4+CD38+HLADR+) was significantly lower in people treated with rituximab. Concentrations of IL10, IL13, IL17, IL21, IL23, IL4, ITAC and TNFa were elevated in vaginal fluid from the rituximab group, while IL-8 was lower. A CST-IV-C, low-Lactobacillus pattern of vaginal microbiota was more common in the rituximab group. Conclusions and RelevanceSystemic B-cell depletion is associated with reduced vaginal fluid IgA, a more diverse microbiome composition, and increases in many vaginal fluid immune markers compared to healthy controls. The reduction in vaginal fluid IgA may provide opportunities for vaginal bacteria to induce inflammation. Key pointsO_ST_ABSQuestionC_ST_ABSHow does circulating B-cell depletion impact the vaginal microenvironment? FindingsIn this cross-sectional study of 52 women, B cell and plasma cell proportions were significantly lower in both blood and vaginal mucosa among rituximab-treated participants compared to healthy controls. Vaginal IgA concentrations, but not other immunoglobulins, were significantly lower in rituximab treated participants. In treated participants, vaginal cytokine concentrations were elevated, and microbiome composition shifted toward non-Lactobacillus-dominant communities. In six people with inflammatory vaginitis, both circulating and endocervical B cells were lowest in people with the most severe symptoms. MeaningSystemic B cell depletion is associated with alterations in vaginal mucosal immune markers and microbiome composition which increase local inflammation.
TRIPATHI, H.; Roy, K.; Rahimi, S.; Neupane, S.; Bozorgzad, S.
Show abstract
Sepsis is a leading cause of in-hospital mortality, yet systematically evaluating temporal adherence to the Surviving Sepsis Campaign (SSC) bundle across large patient populations remains difficult due to semantic variability in electronic health records and the loss of clinical nuance inherent in binary pass/fail compliance judgments. We present an expert-guided neuro-symbolic pipeline that pairs LLM-based semantic normalization with a Sugeno fuzzy inference system encoding eight SSC bundle rules, producing graded per-episode compliance scores whose clinical decision boundaries are set through domain expert consultation. Applied to 2,438 sepsis episodes from MIMIC-IV v3.1, the dual-classifier normalization layer achieves substantial inter-system agreement with high embedding-based confirmation, resolving hundreds of clinically relevant drug strings that purely symbolic systems miss. The graded framework reveals that Hour-1 bundle failures, particularly antibiotic timing, are the dominant driver of low overall compliance, and that higher bundle adherence is associated with notably shorter ICU stays, with antibiotic delays beyond six hours increasing median stays by 61%. These results demonstrate that neuro-symbolic graded assessment can surface actionable compliance patterns that binary evaluation frameworks cannot capture.
Shi, Z.; Zhang, Z.; Mandla, R.; Hou, K.; Pasaniuc, B.
Show abstract
Polygenic scores (PGS) have emerged as a useful biomarker for stratification of high-risk individuals in genomic medicine, with prediction intervals arising as a principled approach to incorporate statistical uncertainty in their individual-level predictions. In contrast to recent reports by Xu et al7, we show that CalPred6 provides well-calibrated prediction intervals that contain the trait phenotypes at targeted confidence levels. CalPred maintains calibration when PGS performance varies across contextual factors (e.g., ancestry, age, sex, or socio-economic factors) whereas PredInterval7 - a recently introduced method that focuses on marginal calibration across all individuals - exhibits miscalibration.
Schreck, K.; Lal, B.; Zhou, J.; Lopez Bertoni, H.; Holdhoff, M.; Ewesudo, R.; Bhatia, K.; Chamberlain, M.; Laterra, J.
Show abstract
Purpose: Limited CNS bioavailability and pharmacodynamics are obstacles to effective systemic therapies for glioblastoma. One strategy to overcome these challenges is drug combinations enhancing CNS penetration and/or tumor chemosensitivity. LP-184, a synthetic acylfulvene class alkylator, induces DNA damage and inhibits glioblastoma cell viability in pre-clinical models. LP-184 is a prodrug converted to active metabolites by intracellular prostaglandin reductase 1 (PTGR1) that is over-expressed in >70% of glioblastoma. DNA damage induced by LP-184 is MGMT agnostic and reversed by transcription-dependent NER. Patients: LP-184 was evaluated in a Phase 1a study (NCT05933265) in 63 adult patients with advanced malignancies including 16 patients with recurrent glioblastoma. All patients with glioblastoma received prior standard-of-care therapy and most had received 1 or more additional therapies before enrollment. Results: Patients with glioblastoma experienced more frequent transaminitis, Grade 1-2 nausea and a trend towards more frequent and severe thrombocytopenia compared to the non-glioblastoma cohort. Otherwise, overall toxicity profiles were similar. Clinical pharmacokinetic analysis combined with published pre-clinical intra-tumoral bioavailability data (~20% penetration) predicted that LP-184 at the recommended dose for expansion (RDE) would achieve cytotoxic levels if combined with spironolactone, a BBB permeable ERCC3 degrader and TC-NER inhibitor that sensitizes glioblastoma cells to LP-184 3-6-fold. We show that three daily doses of spironolactone deplete orthotopic glioblastoma PDX ERCC3 protein by ~ 80% and increases tumor LP-184 cytotoxicity 2-fold. Conclusions: LP-184 is well tolerated at the RDE, and we establish a clinically translatable scheme for dosing spironolactone in combination with LP-184 for a future Phase 1b clinical trial.