Back

Nature

58 training papers 2019-06-25 – 2026-03-07

Top medRxiv preprints most likely to be published in this journal, ranked by match strength.

1
Too rare to be random: genetic finding suggests previously unrecognized path of mutagenesis
2026-03-04 genetic and genomic medicine 10.64898/2026.03.03.26346966
Top 0.3% (3.8%)
Show abstract

We report a previously undescribed genotypic configuration identified in twins with HNRNPU-related neurodevelopmental disorder. Both twins have two closely spaced mosaic variants on the same allele that never co-occur on any single DNA molecule, resulting in three distinct cell lineages within each individual. We define this genotypic configuration as clustered monoallelic mosaicism (cMoMa). Recognizing the extreme improbability of such a configuration, we systematically explore two potential me...

2
Novel transposon Tn8026 acts as a global driver of transmissible linezolid resistance in Enterococcus via a linear plasmid
2026-03-04 infectious diseases 10.64898/2026.03.04.26347163
Top 0.8% (2.5%)
Show abstract

Linezolid is a critical last-resort antimicrobial for multidrug-resistant Enterococcus faecium, particularly against vancomycin-resistant lineages where therapeutic options are severely limited. While resistance has historically arisen through de novo chromosomal mutations, the global emergence of transferable resistance mechanisms threatens to render more infections untreatable. Here, we characterise a recent (2023-2024) hospital-associated outbreak of linezolid-resistant E. faecium in Queensla...

3
Pan-cancer tumour classification and risk stratification from whole-genome somatic variants via dual-task representation learning
2026-03-04 genetic and genomic medicine 10.64898/2026.03.02.26347318
Top 1.0% (2.0%)
Show abstract

Tumour typing from whole-genome sequencing is increasingly accurate, yet molecular subtyping from somatic variants remains challenging because of tumour heterogeneity and inconsistent clinical annotations. Here, we present Mutation-Attention Dual-Task (MuAt2), a Transformer model that jointly classifies histological tumour types and subtypes directly from somatic single-nucleotide variants, indels and structural variants. MuAt2 leverages encoders pre-trained on 2,587 pan-cancer whole genomes, an...

4
Inferring Respiratory Disease Biology from Geolocation Data
2026-03-05 infectious diseases 10.64898/2026.03.05.26347578
Top 2% (1.6%)
Show abstract

Biological fitness quantifies the efficiency and selective advantage of pathogens and hosts in their bilateral interaction. Key questions--such as how much more infectious an emerging variant is compared with its predecessor, or how much protection vaccination offers relative to no vaccination--require fitness to be measured systematically, in real time, and ideally beyond controlled laboratory settings. We propose an approach that infers biological fitness from mostly non-biological data on inf...

5
Integrative screening identifies functional variants and VNTRs underlying GWAS signals at the 5p15.33 multi-cancer susceptibility locus
2026-03-04 genetic and genomic medicine 10.64898/2026.03.03.26347427
Top 3% (1.5%)
Show abstract

Chromosome 5p15.33 harbors several independent association signals which demonstrate antagonistic pleiotropy across cancer types, with causal mechanisms largely unresolved. To identify functional variants and enhancer elements at this locus, we performed statistical fine-mapping followed by massively parallel reporter assays (MPRA) and proliferation based CRISPRi screens. This approach identified eight multi-cancer functional variants (MCFVs) across three GWAS signals. Targeting rs421629 (part o...

6
Targeted Long-Read sequencing provides functional validation of variants predicted to alter splicing
2026-03-06 neurology 10.64898/2026.03.02.26346984
Top 3% (1.5%)
Show abstract

Background Whole-genome sequencing (WGS) has improved the diagnosis of rare genetic disorders, yet interpretation of non-coding variants that affect splicing remains challenging. In silico predictions alone are insufficient, and short-read RNA sequencing may fail to capture complex or low-abundance splicing events. Targeted amplicon-based long-read RNA sequencing (Amp-LRS) offers a cost-effective approach for functional validation of candidate splice-altering variants. Methods We applied Amp-LRS...

7
Novel Genetic Locus Associated with Resistance to M. tuberculosis Infection: A Multi-Ancestry Genome-Wide Association Study
2026-03-07 infectious diseases 10.64898/2026.03.06.26347614
Top 3% (1.5%)
Show abstract

Understanding host susceptibility to Mycobacterium tuberculosis (Mtb) is critical for the development of new vaccines. Certain individuals "resist" becoming infected with Mtb despite intensive exposure; however, it is unknown whether there is a genetic basis for "resistance" to Mtb infection across populations. Here we conducted a genome-wide association study (GWAS) of resistance to Mtb infection by carefully characterizing exposure to TB patients among 4,058 close contacts in India, Brazil, an...

8
A Common Missense Variant, W335S, in β2-Glycoprotein I (APOH) is Associated with Increased Autoantibody Levels but Reduced Venous Thromboembolism Risk
2026-03-05 rheumatology 10.64898/2026.03.04.26347632
Top 3% (1.4%)
Show abstract

Anti-{beta}2-glycoprotein I (anti-{beta}2GPI) antibodies are central to the pathogenesis of antiphospholipid syndrome (APS), an autoimmune disease characterized by a strong predisposition to venous thromboembolism (VTE). In this study, we conducted a multi-ancestry genome-wide association study (GWAS) of quantitative total anti-{beta}2GPI levels in 5,969 participants enrolled in the Multi-Ethnic Study of Atherosclerosis (MESA) and identified a genome-wide significant association at the APOH locu...

9
Outburst of serotype 4 IPD after COVID-19 is driven by ST15063/GPSC162 lineage associated with high-risk behaviors and greater virulence linked to influenza H3N2 virus coinfection and cigarette smoke
2026-03-04 infectious diseases 10.64898/2026.02.27.26346872
Top 4% (1.3%)
Show abstract

The emergence of vaccine covered serotypes causing invasive pneumococcal disease (IPD) is a serious concern worldwide. We investigated the unexpected rise of serotype 4 causing IPD primarily in non-vaccinated young adults after the COVID-19 pandemic that further spread to adults [≥] 65 years in recent years. For this purpose, we conducted a retrospective study of serotype 4 IPD cases (n=827) reported in Spain between 2009 and 2024. Whole-genome sequencing was performed to assess clonal lineag...

10
DIA-PINN. A physics-informed machine learning method to estimate global intrinsic diastolic chamber properties of the left ventricle from pressure-volume data
2026-03-06 cardiovascular medicine 10.64898/2026.03.02.26347245
Top 4% (1.3%)
Show abstract

Background: Pressure volume (PV) loop analysis remains the gold standard for assessing the intrinsic global diastolic properties of the left ventricle (LV). Traditional fitting techniques rely on local, phase-constrained fittings and are limited due to their sensitivity to noise, landmark selection, violation of assumptions, and non-convergence. Objective: To develop and validate DIAPINN, a physics-informed neural network (PINN) framework capable of calculating intrinsic diastolic properties of ...

11
Cancer genomic profiling predicts pathogenicity of BRCA1 and BRCA2 variants
2026-03-06 genetic and genomic medicine 10.64898/2026.03.05.26347746
Top 4% (1.3%)
Show abstract

Accurate classification of BRCA1 and BRCA2 variants is essential for cancer risk assessment and therapy selection, yet over one-third remain variants of uncertain significance (VUS). Here, using 120,660 real-world cancer genomic profiles with BRCA1 or BRCA2 variants from a >800,000-sample cohort, we develop machine learning models that predict pathogenicity using clinical and tumor-derived features, including a pan-cancer homologous recombination deficiency signature, co-mutated genes, zygosity,...

12
Molecular characterisation of a Klebsiella pneumoniae neonatal sepsis outbreak in a rural Gambian hospital: a retrospective genomic epidemiology investigation
2026-03-04 genetic and genomic medicine 10.64898/2026.03.03.26347025
Top 5% (1.2%)
Show abstract

BackgroundKlebsiella pneumoniae is a common cause of neonatal sepsis in Africa, and is frequently hospital acquired. We recently reported an outbreak of multidrug-resistant K. pneumoniae sepsis amongst neonates at a rural hospital in The Gambia, West Africa, involving 57 cases and case fatality of 60%. Here we undertook a retrospective pathogen genomic epidemiology study of clinical and environmental K. pneumoniae isolated during the outbreak, to identify the outbreak strain, refine the epidemic...

13
Semaglutide alters the human embryo-endometrium interface
2026-03-07 obstetrics and gynecology 10.64898/2026.03.03.26347354
Top 5% (1.2%)
Show abstract

The use of semaglutide (SE), a glucagon-like peptide-1 receptor agonist (GLP-1RA) with glucose-lowering and weight-loss effects, has risen rapidly, particularly among women of reproductive age. While preclinical studies suggest benefits for ovarian function via the hypothalamic-pituitary-ovarian axis, its impact on the endometrial-embryo interface remains unclear. Here, we show that GLP-1R is dynamically expressed in fertile human endometrium, restricted to epithelial cells and markedly upregula...

14
Genetic liability to hip osteoarthritis confers neurovascular protection against Alzheimer's disease despite depression-mediated phenotypic comorbidity
2026-03-04 genetic and genomic medicine 10.64898/2026.03.04.26347509
Top 6% (1.2%)
Show abstract

BackgroundThe relationship between hip osteoarthritis (hip OA) and Alzheimers disease (AD) presents a critical paradox within the emerging "bone-brain axis": widespread phenotypic comorbidity sharply contradicts evolutionary theories of biological antagonism. This study integrates longitudinal and multi-omic analyses to determine whether this clinical overlap masks an underlying genetic neuroprotection. MethodsWe analyzed longitudinal phenotypic data from 261,767 UK Biobank participants using C...

15
Air pollution exposure in Generation Scotland: molecular fingerprints and health outcomes
2026-03-04 epidemiology 10.64898/2026.03.04.26347573
Top 6% (1.1%)
Show abstract

Ambient air pollution has been associated with increased incidence of chronic disease and is estimated to contribute towards 4.2 million early deaths annually. Whilst the health impacts are well described, less is understood about the underlying biological mechanisms, particularly when considering the co-occurrence of multiple pollutants. Using an atmospheric chemistry transportation model (EMEP4UK), we generate pre-baseline sampling pollution exposure estimates for eight pollutants in Generatio...

16
FA-NIVA: A Nextflow framework for automated analysis of Nanopore based long-read sequencing data for genetic analysis in Fanconi anemia
2026-03-04 genetic and genomic medicine 10.64898/2026.02.27.26346867
Top 6% (1.1%)
Show abstract

MotivationFanconi anemia (FA) is a rare disease mainly caused by biallelic pathogenic variants, including structural variants such as large deletions and insertions in FA genes. Currently, variant detection is based on short-read sequencing and probe-based approaches. However, determining the exact genomic breakpoint or achieving allelic discrimination remains challenging. Nanopore-based long-read sequencing enables a comprehensive detection of FA variants, but a unified bioinformatic analysis p...

17
Intelligent Guidance and Diagnostic Assistance for Handheld Ultrasound: Actor-Critic Based Approach for Carotid Artery and Thyroid Examination
2026-03-04 radiology and imaging 10.64898/2026.03.02.26347395
Top 7% (1.1%)
Show abstract

Handheld ultrasound devices have revolutionized point-of-care diagnostics, but their effectiveness remains limited by operator dependency and the need for specialized training. This paper presents an intelligent guidance and diagnostic assistance system for the handheld wireless ultrasound device, enabling automated carotid artery and thyroid examinations through handheld operation. Drawing inspiration from the Actor-Critic framework, we implement a simulation-based reinforcement learning approa...

18
Gene Portals: A Framework for Integrating Clinical, Functional, and Structural Evidence into Rare Disease Variant Classification
2026-03-06 genetic and genomic medicine 10.64898/2026.03.05.26347086
Top 7% (1.0%)
Show abstract

Rare Mendelian disorders affect 300-400 million people globally. Although genetic testing has become widely adopted, gene-specific evidence for tailored variant interpretation remains scattered across resources. We present Gene Portals, a framework for gene-centered multimodal knowledge bases that co-localize expert-harmonized clinical data, functional assays, population variation, structural annotations and gene-specific ACMG/AMP specifications within a single resource. A modular interface inte...

19
Automated Phenotyping of Mitral Stenosis Using Deep Learning
2026-03-04 cardiovascular medicine 10.64898/2026.03.03.26347557
Top 7% (1.0%)
Show abstract

Background and AimsAccurate classification of mitral stenosis (MS) remains a significant clinical challenge. This study aimed to develop an artificial intelligence (AI) framework to automatically detect clinically significant MS from echocardiography. MethodsWe developed EchoNet-MS, an open-source end-to-end integrated approach combining video based convolutional neural networks to assess MS severity and differentiate rheumatic etiology from echocardiography and validated its performance across...

20
HIPK4 is a novel gene associated with teratozoospermia and male infertility
2026-03-04 sexual and reproductive health 10.64898/2026.03.04.26346694
Top 7% (0.9%)
Show abstract

STUDY QUESTIONAre pathogenic variants in Homeodomain-interacting protein kinase (HIPK4) associated with sperm head abnormalities causing male infertility? SUMMARY ANSWERHIPK4 is a novel candidate gene associated with sperm head defects and human male infertility. WHAT IS KNOWN ALREADYNumerous genes causing male infertility due to Multiple Morphological Abnormalities of the sperm flagella (MMAF) have been described but the genetic basis of sperm head defects is less well understood. STUDY DESI...