Med
Top medRxiv preprints most likely to be published in this journal, ranked by match strength.
Show abstract
MotivationFanconi anemia (FA) is a rare disease mainly caused by biallelic pathogenic variants, including structural variants such as large deletions and insertions in FA genes. Currently, variant detection is based on short-read sequencing and probe-based approaches. However, determining the exact genomic breakpoint or achieving allelic discrimination remains challenging. Nanopore-based long-read sequencing enables a comprehensive detection of FA variants, but a unified bioinformatic analysis p...
Show abstract
BackgroundThe 2024 blood culture bottle shortage brought diagnostic resource allocation to the forefront, reflecting persistent, foundational challenges with low-value testing and empiric treatment approaches under clinical uncertainty. ObjectiveTo determine whether a machine learning approach using electronic medical record data can predict bacteremia more effectively than existing systems and practices to guide diagnostic testing and empiric treatment strategies. MethodsIn a retrospective co...
Show abstract
Background Whole-genome sequencing (WGS) has improved the diagnosis of rare genetic disorders, yet interpretation of non-coding variants that affect splicing remains challenging. In silico predictions alone are insufficient, and short-read RNA sequencing may fail to capture complex or low-abundance splicing events. Targeted amplicon-based long-read RNA sequencing (Amp-LRS) offers a cost-effective approach for functional validation of candidate splice-altering variants. Methods We applied Amp-LRS...
Show abstract
BackgroundCandidemia is a rare but life-threatening bloodstream infection that remains difficult to predict using conventional risk stratification approaches, highlighting the need for improved predictive strategies. As a result, empiric antifungal therapy is often delayed even in high-risk patients. MethodsWe developed a deep learning model (PyTorch_EHR) to predict 7-day candidemia risk by using electronic health record data from two large cohorts (Houston Methodist Hospital System [HMHS] and ...
Show abstract
Syphilis remains a major public health concern. However, current serologic assays are limited in their ability to distinguish active from previously treated disease. We applied tandem mass tag-based quantitative proteomics to plasma from 10 adults with active syphilis and 10 age- and gender-matched non-diseased controls. We identified 54 differentially regulated proteins (36 upregulated, 18 downregulated). Those proteins map to immune and inflammatory responses, acute-phase signaling, coagulatio...
Show abstract
Introduction Vonoprazan, a new oral potassium-competitive acid blocker (PCAB), has shown promise in terms of superior acid suppression when compared to Proton pump inhibitors (PPIs). We evaluated the efficacy of PCABs versus PPIs in preventing rebleeding in high-risk peptic ulcer patients after endoscopic hemostasis. Methods Following the Preferred Reporting Items for Systematic Reviews and Meta Analyses (PRISMA) guidelines, we conducted a comprehensive search for relevant studies across Medline...
Show abstract
Background and aimsPopulation screening for liver disease in high-risk groups is recommended. Community diagnosis of liver disease is a challenge due to the asymptomatic nature of disease until very advanced stages. Moreover, regional variation in testing availability can result in people with clinically significant liver disease being missed. Machine learning (ML) has been proposed as a method to reduce diagnostic error and automate screening. We present a novel machine learning derived algorit...
Show abstract
Accurate polyp segmentation from colonoscopy images is critical for colorectal cancer prevention, yet the generalization of deep learning models under domain shift remains insufficiently explored. We propose Boundary-Explicit Guided Attention U-Net (BEGA-UNet), a boundary-aware segmentation architecture that introduces explicit edge modeling as a structural inductive bias to enhance both segmentation accuracy and cross-domain robustness. The framework integrates three components: an Edge-Guided ...
Show abstract
BackgroundIn immune checkpoint inhibitor (ICI) trials, overall survival (OS) benefits are well established, yet improvements in quality of life (QoL) are often inconsistent or absent in conventional analyses. This apparent discordance raises important questions: are QoL outcomes truly unrelated to survival, and how can QoL results be better utilized and interpreted? MethodsA model-based meta-analysis (MBMA) of longitudinal EORTC QLQ-C30 global health status/quality of life data from randomized ...
Show abstract
Protocadherin-12 (PCDH12), a cell-adhesion protein belonging to the non-clustered protocadherin family, plays a crucial role in the establishment and regulation of neuronal connections and communication. Bi-allelic loss-of-function (LoF) variants in the PCDH12 gene have been associated with several neurodevelopmental disorders (NDDs) such as diencephalic-mesencephalic junction dysplasia (DMJD) syndrome, cerebral palsy, and cerebellar ataxia, often accompanied by ocular abnormalities. However, ge...
Show abstract
Immune effector cell-associated neurotoxicity syndrome (ICANS) is a common and life-threatening complication of chimeric antigen receptor (CAR) T-cell therapy, with early detection being critical for timely intervention and improved outcomes. Cytokines such as interleukin-6 (IL-6) are key mediators of the inflammatory cascade underlying ICANS pathogenesis, but prospective clinical evidence for their predictive value is limited. Here we quantify IL-6 levels in a prospective cohort of 40 CAR-T pat...
Show abstract
Background Short-chain fatty acids (SCFAs) are widely used as functional readouts of gut microbial activity in vivo. The growing adoption of decentralised study designs and self-collection protocols has amplified the need for reliable room-temperature storage and shipment strategies. However, SCFAs volatility and the persistence of post-collection microbial metabolism raise concerns regarding pre-analytical stability and the interpretability of measured concentrations. Methods We assessed the te...
Show abstract
Introduction: Manual data extraction from unstructured clinical notes is labor-intensive and impractical for large-scale clinical and research operations. Existing automated approaches typically require large language models, dedicated computational infrastructure, and/or task-specific fine-tuning that depends on curated data. The objective of this study is to enable accurate extraction with smaller locally deployed models using a disease-site specific pipeline and prompt configuration that are ...
Show abstract
Serological surveillance is fundamental to infectious disease research and informed public-health decision making. Immunoassays used in the study of pathogen-specific immunity have historically relied on the collection of venous blood. While critical for many public-health applications, this sample collection method is invasive and resource intensive. The costs and logistical barriers associated with venous blood collection are exacerbated in resource-limited regions, and the shift to less invas...
Show abstract
Accurate classification of BRCA1 and BRCA2 variants is essential for cancer risk assessment and therapy selection, yet over one-third remain variants of uncertain significance (VUS). Here, using 120,660 real-world cancer genomic profiles with BRCA1 or BRCA2 variants from a >800,000-sample cohort, we develop machine learning models that predict pathogenicity using clinical and tumor-derived features, including a pan-cancer homologous recombination deficiency signature, co-mutated genes, zygosity,...
Show abstract
Strengthening in-country sequencing capacity generated 28 Lassa virus genomes from human clinical cases, expanding our knowledge of Lassa fever in Guinea. Phylogeographic analysis revealed cross-border exchange between Liberia and the NZerekore region, and a Sierra Leone introduction into the Gueckedou area. Enhanced genomic surveillance is crucial to guide future public health actions.
Show abstract
BackgroundHeartland virus (HRTV) is an emerging tick-borne virus capable of causing severe illness and death. The burden of disease is likely underestimated due to limited seroprevalence studies, lack of commercially available diagnostic tests, and an overlapping clinical syndrome with more commonly diagnosed bacterial diseases such as spotted fever group rickettsiosis or ehrlichiosis. MethodsActive surveillance for Heartland virus disease was conducted at a large academic center from March to ...
Show abstract
BackgroundElimination of Plasmodium vivax is challenging due to its dormant liver stages (hypnozoites), which can reactivate weeks or months after the primary infection, causing relapses and ongoing transmission of the parasite. Despite these challenges, P. vivax clinical case numbers have declined over the past decade in Cambodia. We used parasite genotyping to assess whether the decline in case numbers was reflected in parasite diversity and relatedness as a proxy to transmission. MethodsGeno...
Show abstract
Understanding host susceptibility to Mycobacterium tuberculosis (Mtb) is critical for the development of new vaccines. Certain individuals "resist" becoming infected with Mtb despite intensive exposure; however, it is unknown whether there is a genetic basis for "resistance" to Mtb infection across populations. Here we conducted a genome-wide association study (GWAS) of resistance to Mtb infection by carefully characterizing exposure to TB patients among 4,058 close contacts in India, Brazil, an...
Show abstract
Background and objectives STXBP1-related disorder (STXBP1-RD), caused by pathogenic variants in the STXBP1 gene, is a rare neurodevelopmental condition, characterized by early-onset seizures, developmental delay, intellectual disability (ID), and prominent motor dysfunction. Despite the high prevalence of motor symptoms, systematic gait characterization remains limited. We therefore aimed to quantitively assess gait in individuals with STXBP1-RD. Methods In this cross-sectional study, we include...