Back

Med

26 training papers 2019-06-25 – 2026-03-07

Top medRxiv preprints most likely to be published in this journal, ranked by match strength.

1
FA-NIVA: A Nextflow framework for automated analysis of Nanopore based long-read sequencing data for genetic analysis in Fanconi anemia
2026-03-04 genetic and genomic medicine 10.64898/2026.02.27.26346867
Top 1% (0.7%)
Show abstract

MotivationFanconi anemia (FA) is a rare disease mainly caused by biallelic pathogenic variants, including structural variants such as large deletions and insertions in FA genes. Currently, variant detection is based on short-read sequencing and probe-based approaches. However, determining the exact genomic breakpoint or achieving allelic discrimination remains challenging. Nanopore-based long-read sequencing enables a comprehensive detection of FA variants, but a unified bioinformatic analysis p...

2
Cultryx: Precision Diagnostic Stewardship for Blood Cultures Using Machine Learning
2026-03-04 infectious diseases 10.64898/2026.02.27.26347214
Top 2% (0.4%)
Show abstract

BackgroundThe 2024 blood culture bottle shortage brought diagnostic resource allocation to the forefront, reflecting persistent, foundational challenges with low-value testing and empiric treatment approaches under clinical uncertainty. ObjectiveTo determine whether a machine learning approach using electronic medical record data can predict bacteremia more effectively than existing systems and practices to guide diagnostic testing and empiric treatment strategies. MethodsIn a retrospective co...

3
Targeted Long-Read sequencing provides functional validation of variants predicted to alter splicing
2026-03-06 neurology 10.64898/2026.03.02.26346984
Top 2% (0.4%)
Show abstract

Background Whole-genome sequencing (WGS) has improved the diagnosis of rare genetic disorders, yet interpretation of non-coding variants that affect splicing remains challenging. In silico predictions alone are insufficient, and short-read RNA sequencing may fail to capture complex or low-abundance splicing events. Targeted amplicon-based long-read RNA sequencing (Amp-LRS) offers a cost-effective approach for functional validation of candidate splice-altering variants. Methods We applied Amp-LRS...

4
Two-step deep-learning candidemia prediction model using two large time-sequence electronic health datasets
2026-03-04 infectious diseases 10.64898/2026.03.03.26347531
Top 3% (0.4%)
Show abstract

BackgroundCandidemia is a rare but life-threatening bloodstream infection that remains difficult to predict using conventional risk stratification approaches, highlighting the need for improved predictive strategies. As a result, empiric antifungal therapy is often delayed even in high-risk patients. MethodsWe developed a deep learning model (PyTorch_EHR) to predict 7-day candidemia risk by using electronic health record data from two large cohorts (Houston Methodist Hospital System [HMHS] and ...

5
An Exploratory Study of Host Plasma Proteomic Signatures that Distinguish Active Syphilis in Adults
2026-03-05 infectious diseases 10.64898/2026.03.04.26347505
Top 3% (0.4%)
Show abstract

Syphilis remains a major public health concern. However, current serologic assays are limited in their ability to distinguish active from previously treated disease. We applied tandem mass tag-based quantitative proteomics to plasma from 10 adults with active syphilis and 10 age- and gender-matched non-diseased controls. We identified 54 differentially regulated proteins (36 upregulated, 18 downregulated). Those proteins map to immune and inflammatory responses, acute-phase signaling, coagulatio...

6
Potassium-competitive acid channel blockers versus Proton-Pump inhibitors in the prevention of post-endoscopic peptic ulcer rebleeding: A systematic review and meta-analysis
2026-03-06 gastroenterology 10.64898/2026.03.02.26346403
Top 4% (0.3%)
Show abstract

Introduction Vonoprazan, a new oral potassium-competitive acid blocker (PCAB), has shown promise in terms of superior acid suppression when compared to Proton pump inhibitors (PPIs). We evaluated the efficacy of PCABs versus PPIs in preventing rebleeding in high-risk peptic ulcer patients after endoscopic hemostasis. Methods Following the Preferred Reporting Items for Systematic Reviews and Meta Analyses (PRISMA) guidelines, we conducted a comprehensive search for relevant studies across Medline...

7
Improving the detection of clinically significant steatotic liver disease using a machine learning algorithm in a real-world primary care population
2026-03-05 gastroenterology 10.64898/2026.03.04.26347631
Top 4% (0.3%)
Show abstract

Background and aimsPopulation screening for liver disease in high-risk groups is recommended. Community diagnosis of liver disease is a challenge due to the asymptomatic nature of disease until very advanced stages. Moreover, regional variation in testing availability can result in people with clinically significant liver disease being missed. Machine learning (ML) has been proposed as a method to reduce diagnostic error and automate screening. We present a novel machine learning derived algorit...

8
BEGA-UNet: Boundary-Explicit Guided Attention U-Net with Multi-Scale Feature Aggregation for Colonoscopic Polyp Segmentation
2026-03-05 gastroenterology 10.64898/2026.03.04.26347608
Top 5% (0.3%)
Show abstract

Accurate polyp segmentation from colonoscopy images is critical for colorectal cancer prevention, yet the generalization of deep learning models under domain shift remains insufficiently explored. We propose Boundary-Explicit Guided Attention U-Net (BEGA-UNet), a boundary-aware segmentation architecture that introduces explicit edge modeling as a structural inductive bias to enhance both segmentation accuracy and cross-domain robustness. The framework integrates three components: an Edge-Guided ...

9
When Survival Improves But Quality of Life Does Not: A Model-Based Meta-Analysis of Immune Checkpoint Inhibitors
2026-03-05 oncology 10.64898/2026.03.04.26347610
Top 6% (0.3%)
Show abstract

BackgroundIn immune checkpoint inhibitor (ICI) trials, overall survival (OS) benefits are well established, yet improvements in quality of life (QoL) are often inconsistent or absent in conventional analyses. This apparent discordance raises important questions: are QoL outcomes truly unrelated to survival, and how can QoL results be better utilized and interpreted? MethodsA model-based meta-analysis (MBMA) of longitudinal EORTC QLQ-C30 global health status/quality of life data from randomized ...

10
Novel PCDH12 pathogenic missense variants cause neurodevelopmental disorders with ocular malformation
2026-03-06 neurology 10.64898/2026.03.05.26343794
Top 8% (0.3%)
Show abstract

Protocadherin-12 (PCDH12), a cell-adhesion protein belonging to the non-clustered protocadherin family, plays a crucial role in the establishment and regulation of neuronal connections and communication. Bi-allelic loss-of-function (LoF) variants in the PCDH12 gene have been associated with several neurodevelopmental disorders (NDDs) such as diencephalic-mesencephalic junction dysplasia (DMJD) syndrome, cerebral palsy, and cerebellar ataxia, often accompanied by ocular abnormalities. However, ge...

11
Early Detection of CAR-T-Associated Neurotoxicity via Cytokine Monitoring in Serum
2026-03-04 oncology 10.64898/2026.03.03.26347491
Top 8% (0.3%)
Show abstract

Immune effector cell-associated neurotoxicity syndrome (ICANS) is a common and life-threatening complication of chimeric antigen receptor (CAR) T-cell therapy, with early detection being critical for timely intervention and improved outcomes. Cytokines such as interleukin-6 (IL-6) are key mediators of the inflammatory cascade underlying ICANS pathogenesis, but prospective clinical evidence for their predictive value is limited. Here we quantify IL-6 levels in a prospective cohort of 40 CAR-T pat...

12
Stability of Microbiome-Derived Fatty Acids in Self-Collected Samples: A Comparative Evaluation of Stool and Blood Matrices
2026-03-06 gastroenterology 10.64898/2026.03.05.26347712
Top 8% (0.3%)
Show abstract

Background Short-chain fatty acids (SCFAs) are widely used as functional readouts of gut microbial activity in vivo. The growing adoption of decentralised study designs and self-collection protocols has amplified the need for reliable room-temperature storage and shipment strategies. However, SCFAs volatility and the persistence of post-collection microbial metabolism raise concerns regarding pre-analytical stability and the interpretability of measured concentrations. Methods We assessed the te...

13
OncoRAG: Graph-Based Retrieval Enabling Clinical Phenotyping from Oncology Notes Using Local Mid-Size Language Models
2026-03-06 oncology 10.64898/2026.03.05.26347717
Top 9% (0.3%)
Show abstract

Introduction: Manual data extraction from unstructured clinical notes is labor-intensive and impractical for large-scale clinical and research operations. Existing automated approaches typically require large language models, dedicated computational infrastructure, and/or task-specific fine-tuning that depends on curated data. The objective of this study is to enable accurate extraction with smaller locally deployed models using a disease-site specific pipeline and prompt configuration that are ...

14
Development and optimization of self-collected, field stable, saliva-based immunoassays for scalable epidemiological surveillance of pathogen-specific immunity
2026-03-06 infectious diseases 10.64898/2026.03.05.26347729
Top 9% (0.3%)
Show abstract

Serological surveillance is fundamental to infectious disease research and informed public-health decision making. Immunoassays used in the study of pathogen-specific immunity have historically relied on the collection of venous blood. While critical for many public-health applications, this sample collection method is invasive and resource intensive. The costs and logistical barriers associated with venous blood collection are exacerbated in resource-limited regions, and the shift to less invas...

15
Cancer genomic profiling predicts pathogenicity of BRCA1 and BRCA2 variants
2026-03-06 genetic and genomic medicine 10.64898/2026.03.05.26347746
Top 10% (0.3%)
Show abstract

Accurate classification of BRCA1 and BRCA2 variants is essential for cancer risk assessment and therapy selection, yet over one-third remain variants of uncertain significance (VUS). Here, using 120,660 real-world cancer genomic profiles with BRCA1 or BRCA2 variants from a >800,000-sample cohort, we develop machine learning models that predict pathogenicity using clinical and tumor-derived features, including a pan-cancer homologous recombination deficiency signature, co-mutated genes, zygosity,...

16
Genomic surveillance of Lassa virus in Guinea through in-country sequencing
2026-03-05 infectious diseases 10.64898/2026.03.04.26347418
Top 10% (0.3%)
Show abstract

Strengthening in-country sequencing capacity generated 28 Lassa virus genomes from human clinical cases, expanding our knowledge of Lassa fever in Guinea. Phylogeographic analysis revealed cross-border exchange between Liberia and the NZerekore region, and a Sierra Leone introduction into the Gueckedou area. Enhanced genomic surveillance is crucial to guide future public health actions.

17
Active Surveillance for Heartland virus in North Carolina: Clinical and Genomic Epidemiology
2026-03-04 infectious diseases 10.64898/2026.02.27.26347100
Top 11% (0.3%)
Show abstract

BackgroundHeartland virus (HRTV) is an emerging tick-borne virus capable of causing severe illness and death. The burden of disease is likely underestimated due to limited seroprevalence studies, lack of commercially available diagnostic tests, and an overlapping clinical syndrome with more commonly diagnosed bacterial diseases such as spotted fever group rickettsiosis or ehrlichiosis. MethodsActive surveillance for Heartland virus disease was conducted at a large academic center from March to ...

18
Temporal trends in Plasmodium vivax diversity in eastern Cambodia evidence declining transmission
2026-03-04 infectious diseases 10.64898/2026.03.03.26346840
Top 11% (0.3%)
Show abstract

BackgroundElimination of Plasmodium vivax is challenging due to its dormant liver stages (hypnozoites), which can reactivate weeks or months after the primary infection, causing relapses and ongoing transmission of the parasite. Despite these challenges, P. vivax clinical case numbers have declined over the past decade in Cambodia. We used parasite genotyping to assess whether the decline in case numbers was reflected in parasite diversity and relatedness as a proxy to transmission. MethodsGeno...

19
Novel Genetic Locus Associated with Resistance to M. tuberculosis Infection: A Multi-Ancestry Genome-Wide Association Study
2026-03-07 infectious diseases 10.64898/2026.03.06.26347614
Top 12% (0.3%)
Show abstract

Understanding host susceptibility to Mycobacterium tuberculosis (Mtb) is critical for the development of new vaccines. Certain individuals "resist" becoming infected with Mtb despite intensive exposure; however, it is unknown whether there is a genetic basis for "resistance" to Mtb infection across populations. Here we conducted a genome-wide association study (GWAS) of resistance to Mtb infection by carefully characterizing exposure to TB patients among 4,058 close contacts in India, Brazil, an...

20
Assessing and quantifying gait deviations in STXBP1-related disorder using three-dimensional gait analysis.
2026-03-07 neurology 10.64898/2026.03.02.26346982
Top 13% (0.3%)
Show abstract

Background and objectives STXBP1-related disorder (STXBP1-RD), caused by pathogenic variants in the STXBP1 gene, is a rare neurodevelopmental condition, characterized by early-onset seizures, developmental delay, intellectual disability (ID), and prominent motor dysfunction. Despite the high prevalence of motor symptoms, systematic gait characterization remains limited. We therefore aimed to quantitively assess gait in individuals with STXBP1-RD. Methods In this cross-sectional study, we include...