Back

Genomics

Elsevier BV

Preprints posted in the last 90 days, ranked by how well they match Genomics's content profile, based on 60 papers previously published here. The average preprint has a 0.08% match score for this journal, so anything above that is already an above-average fit.

1
Computational predictions and evolutionary analysis of LrK10 kinase-related putative PSTOL1 gene homeologs in wheat and orthologs of its wild relatives.

Thiyagarajan, K.; Pierre, C. S.; Kumar, C.; Sanyal, D.; Thakur, G.; Singh, D.; Thakur, D.; Tomar, A.; Vikram, P.; Valluru, R.

2026-02-13 genomics 10.64898/2026.02.12.702618 medRxiv
Top 0.1%
8.3%
Show abstract

Phosphorus Starvation Tolerance 1 in rice (OsPSTOL1, known as Phosphorus uptake 1, Pup1) is a receptor-like cytoplasmic protein kinase that confers tolerance to phosphorus deficiency. The OsPSTOL1 gene possesses a Ser/Thr kinase and shows high amino-acid sequence similarity with the leaf rust receptor-like kinase (OsLrK10). We hypothesize that the putative wheat TaPSTOL1 and TaLrK10 have a common ancestral origin and that putative TaPSTOL1 diverged recently acquiring new structural modifications and biological functions in the process. In this study, we identified all putative TaPSTOL1 homeologs and examine the evolutionary relationship between TaPSTOL1 and TaLrK10 in Triticum species. Our results indicate that the putative TaPSTOL1 diverged recently without possessing the amino-terminal domain, which is a typical characteristic of TaLrK10. We observed numerous conversions tracts between these two genes and the substitution pattern of randomly selected amino acids indicates that dynamic selection pressures acted on both genes. The putative TaPSTOL1 shows high nucleotide diversity compared to TaLrK10 within Triticum species. Further, a multiple-sequence analysis reveals that the third exon of TaLrK10 appears to have been duplicated and diverged as a putative single-exon based TaPSTOL1 in bread wheat. Overall, our comparative analysis indicates that both TaPSTOL1 and TaLrK10 appears to have diverged from a common ancestor, acquiring distinct structural organizations and biological functions.

2
Genome-wide analysis of Heavy metal ATPase (P1B-type ATPase) gene family in Mung bean and their expression analysis under heavy metal (Zn, Cd and Cu) stress

Panigrahi, J.; Panigrahy, D.; Rath, B.; Gupta, K.

2026-03-27 molecular biology 10.64898/2026.03.25.713876 medRxiv
Top 0.1%
6.8%
Show abstract

Heavy metal ATPases (HMAs) are important group of transmembrane proteins involved in homeostasis of metal ions in plant systems. In this study, a comprehensive analysis of genome assembly (VC1973A v7.1) resulted in the identification of nine HMA genes (VrHMA) and their corresponding proteins in Mungbean, an agronomically important legume crop known for its nutritional values. VrHMA proteins were also characterized based on their biomolecular features, conserved domains and motifs arrangement, transmembrane helices, pore-line helices, subcellular location and occurrence of signal peptides. Based on sequence homology, nine VrHMAs were clustered into two major substrate-specific groups: VrHMA1, VrHMA5 and VrHMA7 were categorized under the Zn/Co/Cd/Pb ATPase group, whereas the remaining six VrHMAs belong to the Cu/Ag subgroup. Gene structure analysis and promoter scanning revealed the structural divergence and presence of various stress-responsive cis-acting elements, respectively. The expression analysis of VrHMA genes in root and leaf tissues, in response to heavy metal (Zn, Cd and Cu) stress, indicates their role in the uptake, transport and sequestration of metal ions. Interestingly, VrHMA5 showed incremental upregulation in roots in response to all three heavy metal stresses, whereas its expression was only upregulated in the leaf tissues under Zn stress, which indicates its role in vascular transport in V. radiata. In addition, this study provides valuable insights into the functional roles of VrHMA genes and will lay a foundation for future genetic improvement in mung bean aimed at enhanced heavy metal stress tolerance and micronutrient homeostasis.

3
Two domesticated species of rice shaped the population structure of Xanthomonas oryzae pv. oryzae in Africa

Quibod, I. L.; Sciallano, C.; Auguy, F.; Brottier, L.; Dereeper, A.; Diagne, D.; Diallo, A.; Doucoure, H.; Mayaki, S. I.; Keita, I.; Konate, L.; Tall, H.; Tekete, C.; Zougrana, S.; Hutin, M.; Koita, O.; Kone, D.; Sarra, S.; Verdier, V.; Wonni, I.; Szurek, B.; Cunnac, S.; Perez-Quintero, A. L.

2026-02-20 genomics 10.64898/2026.02.20.706980 medRxiv
Top 0.1%
6.3%
Show abstract

African rice (Oryza glaberrima) was independently domesticated in West Africa around 3000 years ago, and has long been intertwined in the history of the region. The gradual replacement of African rice by Asian rice (Oryza sativa), which was introduced when European settlers arrived, has since dominated rice cultivation in Africa. Domesticated rice species are affected by bacterial leaf blight (BLB), which is caused by the pathogen Xanthomonas oryzae pv. oryzae (Xoo). Here we provide evidence that the bacterial leaf blight pathogen in Africa (AfXoo) belongs to a distinct phylogroup from the one circulating in Asia (AsXoo), and has a different evolutionary history. Analysis of 88 AfXoo genomes identified five groups, one of which is a highly diverse population that might have probably given rise to three independent clonal populations based on multiple genetic tests. Tip-dating analysis revealed that the emergence and expansion of AfXoo coincided with the rise and fall of African rice nearly a thousand years ago, and O. sativa served as a bottleneck in the evolution of AfXoo over time. Although the type III effectors (T3E), proteins that are secreted by the pathogen to evade host resistance or seize control of host nutrients, are highly conserved in AfXoo, we observed some variation in effector families. Different evolutionary modifications in the transcription activator-like effectors (TALEs), especially in repeat variable di-residues (RVDs), likely enabled adaptation to both host species. Previous analyses carried out on samples collected in Burkina Faso have shown that there could be more than one TALE repertoire combination in the field, and genome sequencing data revealed potential TALE evolutionary mechanisms that could happen. Our research provides a comprehensive genetic history of bacterial blight in West Africa, and its past and present impact on rice cultivation in the region. Author summaryFor thousands of years, rice cultivation has been an integral part of African agriculture. However, the cultivation of the locally domesticated African rice cultivar (Oryza glaberrima) has been gradually shifted towards Asian rice varieties (Oryza sativa), which has affected the adaptation of the native pathogen population. One of these pathogens is the causal agent of bacterial leaf blight, Xanthomonas oryzae pv. oryzae (Xoo). Here we performed a population genomics approach to understand the evolutionary history and virulence spectrum of African Xoo (AfXoo), a unique phylogroup within the Xanthomonas oryzae species. Our results suggest that AfXoo were first adapted to African rice at least a thousand years ago. The introduction of O. sativa has shaped the recent population dynamics of AfXoo. TALEs are tightly conserved in AfXoo with multiple sequence variations unique to different populations, which could be explained by different evolutionary forces acting upon both domesticated rice. Our results highlight the interplay between crop domestication and cultivation and pathogen evolution.

4
RNA-binding proteins and regulatory networks involved in life-stage, stress temperature, and drug resistance in Leishmania parasites

Martinez-Hernandez, J. E.; Aliaga Tobar, V.; Hidalgo-Cabrera, A.; Requena, J. M.; Monte-Neto, R.; Maracaja-Coutinho, V.; Martin, A. J. M.

2026-02-17 genomics 10.64898/2026.02.16.706094 medRxiv
Top 0.1%
6.2%
Show abstract

Trypanosomatid parasites of the genus Leishmania rely on post-transcriptional regulation of gene expression because gene transcription is not canonically regulated at the single-gene level. RNA-binding proteins (RBPs) are central to this regulatory architecture, yet their genome-wide diversity, conservation, and condition-specific associations remain incompletely defined across the genus. Here, we combined comparative genomics and systems-level transcriptomic analyses to map the RBP repertoire in 33 strains spanning 19 Leishmania species. We also connect RBPs to developmental, stress, and drug-resistance contexts, using Leishmania braziliensis as an example. Using probabilistic domain detection and homology-based annotation, we identified 38,662 putative RBPs across all genomes, with 1,114 to 1,491 RBPs per genome. Comparative genomics analysis revealed a conserved core of 404 RBP clusters shared across all examined species, alongside lineage-restricted clusters in major Leishmania groups. We notably detected widespread conservation of enzymes linked to messenger RNA modification (for example, NAT10, TRMT6/61A, and NSUN2), but failed to identify canonical N6-methyladenosine writer orthologs, suggesting divergence of this machinery across Leishmania genomes. In L. braziliensis, expression profiling in different life-cycle stages and stress conditions highlighted stage-biased RBPs, including elevated ZC3H20 in amastigotes and increased RBP6 in metacyclic promastigotes. Finally, co-expression network analysis of trivalent antimony (SbIII) resistant genotypes identified RBPs co-expressed with genes previously associated with antimony resistance. In contrast, motif-based analysis supported a putative DRBD3-centered post-transcriptional module that includes 10 candidate stabilized transcripts in SbIII-resistant promastigotes. Together, these results provide a comparative framework to prioritize RBPs and associated regulatory programs implicated in parasite adaptation and antimony resistance. AUTHOR SUMMARYLeishmaniasis results from infection by Leishmania parasites and remains challenging to control due to limited treatments and the continual emergence of drug resistance, especially to antimonials. These parasites exhibit an unusual gene regulation method; instead of activating or deactivating transcription for individual genes, they depend heavily on proteins that bind RNA to determine which messages are retained, translated, or degraded. In this study, we created a comprehensive atlas of RNA-binding proteins across the Leishmania genus by analyzing 33 parasite strains from 19 species. We identified a large core set of conserved RNA-binding proteins shared by all species, along with lineage-specific proteins that may help different parasite groups adapt to various hosts and environments. Additionally, we mapped enzymes responsible for RNA chemical modifications and found that the typical machinery for m6A marks in many organisms appears to be divergent in Leishmania. Finally, by associating RNA-binding proteins with specific life-cycle stages, stress responses, and antimony resistance in L. braziliensis, we pinpointed candidate regulators and gene modules for further experimental validation. This resource helps prioritize regulatory factors that could drive parasite adaptation and resistance.

5
Genetic and heat-stress related environmental influences on pig whole-blood gene expression levels

Durante, A.; Feve, K.; Naylies, C.; Labrune, Y.; Gress, L.; Lippi, Y.; Legoueix, S.; Milan, D.; Gourdine, J.-L.; Gilbert, H.; Renaudeau, D.; Riquet, J.; Devailly, G.

2026-03-18 genomics 10.64898/2026.03.17.712411 medRxiv
Top 0.1%
5.0%
Show abstract

BackgroundGene expression levels are affected by genetics and environmental effects. However, quantification of the influence of genetics and environmental effects on gene expression remains limited, especially in farm animals. Here, the relative influence of genetic and heat-related environmental variations on gene expression levels was investigated in pigs, using a backcross herd of diverse heat adaptation levels. Backcross animals were raised in either a tropical or temperate environment. Animals raised in temperate environment were subjected to an experimental heat stress at the end of their growth. ResultsWe identified 1,967 differentially expressed genes (DEGs) between pigs raised in the tropical (n = 181) and temperate (n = 180) facilities, and 472 DEGs throughout a 3 weeks experimental heat stress. Transcriptome-wide association (TWAS) study identified 139 associations between gene expression levels and thermoregulation/production traits. We detected 6,014 expression quantitative trait loci (eQTLs) associated with the expression level of 3,297 genes. Genetic variance was estimated to explain 36.3% of gene expression variance on average, and was the main source of variance for 27.7% of transcripts. Most eQTLs found are located in proximal regions (cis-eQTLs) and few within distal regions (trans-eQTLs) to their assigned genes. A trans-eQTL hotspot highlighted a hematopoietic mechanism driven by GPATCH8. An integration of GWAS and TWAS pointed to TMCO1 and ZNF184 as candidate genes for backfat thickness. ConclusionsThis study provides a better understanding of the impact of climate, heat stress and genetic influences on the pig whole blood transcriptome.

6
Comparison of High-Resolution Computed Tomography Patterns in Adult with Cystic Fibrosis and Non- Cystic Fibrosis Bronchiectasis in a South Asian Country Bangladesh: A Retrospective Cross-sectional Study

Majumder, S.; Biswas, P.; Chakrabortty, R.; Ahmed, S.; Rahman, M. A.

2026-03-10 respiratory medicine 10.64898/2026.03.09.26347994 medRxiv
Top 0.1%
4.9%
Show abstract

BackgroundBronchiectasis in adults often goes undiagnosed following the routine assessment. Cystic Fibrosis (CF) is usually diagnosed during childhood, but some cases are identified in adulthood when disease is mild. High-resolution computed tomography (HRCT) of chest may offer structural information that can indicate CF as an underlying etiology. ObjectiveTo compare the HRCT features of adult patients with CF and non-CF bronchiectasis and to determine the radiologic features that may be suggestive of CF. MethodsThis retrospective, analytical, cross-sectional study was carried out in Bangladesh Medical University after IRB clearance. Total 130 adults (12 with CF and 118 with non-CF bronchiectasis) of both sexes, whose bronchiectasis was confirmed by chest HRCT were included. Imaging findings were assessed based on Reid morphological classification, anatomical distribution and extent of spread within the lungs, and their association was tested using chi-square test with statistical significance of p<0.05. ResultsCystic bronchiectasis was more common in CF than non-CF patients (83.3% vs 29.7; p<0.001). Mixed central-peripheral extension had been found a considerable associated with CF (66.7% vs. 42.4; p=0.034). There was no statistically significant difference in right lung lobar distribution (p=0.540) but combined upper and lower lobe involvement on the left side was more common in CF patients (54.5%) than non-CF patients (21.3) (p=0.054). ConclusionAdult CF had unique chest HRCT imaging characteristics when compared to non-CF bronchiectasis, especially cystic morphology and mixed extension. Identification of such features could help physician in the early diagnosis and selection of treatment strategy.

7
The genetic basis of resistance to pathogens in rainbow trout: a meta-QTL analysis

Rodriguez-Vazquez, R.; Karami, A. M.; Robledo, D.; Buchmann, K.

2026-02-14 genomics 10.64898/2026.02.13.705103 medRxiv
Top 0.1%
4.8%
Show abstract

Rainbow trout is affected by a broad range of pathogens causing large economic losses and animal welfare concerns. Marker-assisted selection can significantly enhance resistance to pathogens in a few generations, and to this end many studies have focused on identifying quantitative trait loci (QTLs) for resistance traits. The integration of accumulated genetic resources provides an opportunity to uncover important genetic variation and candidate genes crucially involved in rainbow trout immunity. Here, we present a comprehensive meta-QTL (MQTL) analysis based on the integration of 145 QTLs related to pathogen resistance. These QTLs were refined into 26 MQTLs, of which 15 were validated by genome-wide association studies (GWAS). The average confidence interval (CI) of these MQTLs was reduced by 2.03-fold compared to the initial QTL, improving mapping precision. Integration of GWAS results revealed regions along the rainbow trout genome pivotal for pathogen resistance, and a major region in chromosome 3, which could be used in marker-assisted selection. Further, among the validated MQTLs we identified a subset of high-confidence MQTLs, based on those supported by at least three initial QTL from more than two independent studies, with a percentage of variance explained greater than 8% and a LOD score higher than three. Gene annotation identified 11 unique candidate genes within these high-confidence MQTLs involved in immune pathways, encoding proteins involved in the regulation of immune responses, signalling pathways, receptor activity, and direct immune effector production. The MQTLs and candidate genes identified are valuable resources for advancing molecular breeding and unravelling the genetic basis of pathogen resistance in rainbow trout.

8
Host genetics and pre-vaccination blood transcriptome as determinants of vaccine-induced immunity to Influenza A virus in swine

Blanc, F.; Maroilley, T.; Lemonnier, G.; Ravon, L.; Billon, Y.; BOUCHEZ, O.; Pinard van der Laan, M.-H.; ESTELLE, J.; Rogel-Gaillard, C.

2026-02-07 genomics 10.64898/2026.02.06.700797 medRxiv
Top 0.2%
3.9%
Show abstract

Influenza A virus (IAV) is a major respiratory pathogen in pigs, causing diseases that have significant economic and potential public health consequences. Vaccine effectiveness varies among animals, impacting long-term herd protection due to individual variabilities in antibody levels and persistence over time. Our aim was to identify the genetic factors and pre-vaccination blood transcriptomic profiles that influence immune response levels to the IAV vaccine. A total of 187 piglets were vaccinated at weaning (28 days of age, 0 days post-vaccination, dpv) and boosted three weeks later, and humoral responses were assessed until slaughter (21, 28, 35, and 118 dpv) by measuring serum IAV-specific IgG and hemagglutination inhibition (HAI) titers. The results revealed varying antibody responses and persistence. Genome-wide association studies identified two loci on chromosomes SSC5 and SSC8 associated with a persistence of HAI titers until slaughter. Pre-vaccination blood transcriptomic analyses showed that early and post-boost antibody responses (21, 28 and 35 dpv) and long-term persistence (118 dpv) were associated with distinct baseline immune programs, with extracellular matrix and myeloid-related signatures predicting strong early and peak responses, whereas interferon-related signatures were linked to reduced long-term antibody persistence. Our results highlight the importance of considering the role of immune competence and genetics in vaccine responses in pigs and suggest candidate biomarkers to improve vaccination strategies within breeding programs.

9
Genome-wide annotation and analyses of bifunctional genes in the human genome

Insan, J.; Menon, M. B.; Dhamija, S.

2026-01-29 genomics 10.64898/2026.01.28.702170 medRxiv
Top 0.3%
3.9%
Show abstract

Conventional gene annotation pipelines classify eukaryotic genes into protein-coding and non-coding. Alternative splicing may generate non-coding transcript variants from protein-coding genes, that are expressed in tissue- or disease-specific manner. We and others have described the genes which transcribe both coding and non-coding transcripts as bifunctional genes. Here we present a genome-wide analyses of bifunctional genes and reannotate the genes in the human genome reference assembly into coding, non-coding and bifunctional. We identify over 4000 "bifunctional genes" in the human genome, constituting approximately 10% of the transcribed genes, and present evidence that these genes are conserved in evolution and their number correlate well with genome size and complexity. These genes are enriched in gene sets involved in vesicular transport, autophagy, RNA/DNA binding, glycosylation and splicing. By monitoring the expression of non-coding exons in long-read sequencing datasets and by quantitative RT-qPCR, we provide evidence for the expression of non-coding variants from bifunctional genes. The ncRNA transcripts from these genes might have similar or different roles from their cognate mRNA counterparts. They may act as miRNA sponges or harbour non-canonical open-reading frames that encode microproteins, while also competing for binding with RNA-binding proteins. We present evidence for establishing potential biological functions of bifunctional genes and summarise the findings in a searchable database. Further studies and functional characterization focused on this special group of genes may reveal interesting gene regulatory mechanisms relevant to physiology and pathology.

10
MicroRNA modulation of viral nervous necrosis resistance in European seabass

Rodriguez-Vazquez, R.; Mukiibi, R.; Ferraresso, S.; Franch, R.; Peruzza, L.; Rovere, G. D.; Radojicic, J.; Babbucci, M.; Bertotto, D.; Toffan, A.; Pascoli, F.; Penaloza, C.; Houston, R. D.; Tsigenopoulos, C. S.; Bargelloni, L.; Robledo, D.

2026-02-18 genomics 10.64898/2026.02.16.706017 medRxiv
Top 0.3%
3.6%
Show abstract

MicroRNAs (miRNAs) are key post-transcriptional regulators of antiviral immunity, controlling gene expression by targeting 3 UTRs of immune-related transcripts. Despite their importance, the role of miRNAs in viral nervous necrosis (VNN) resistance in European seabass (Dicentrarchus labrax) is unexplored. Here, we characterized for the first time the brain miRNome of seabass from three VNN-resistance genotypes (susceptible, intermediate, resistant) across two genetically distinct seabass clusters. Differential expression analyses revealed cluster-specific patterns, with susceptible fish consistently showing overexpression of the differently expressed miRNAs (DEmiRNAs) as compared to the resistant fish. Considering the two genetic clusters in the study, miR-199-5p was differentially expressed between the VNN susceptible and resistant fish. This miRNA was found to be less expressed in the resistant individuals. Functional characterization of the miRNA predicted that it binds to two distinct miRNA recognition elements (MREs) within the ifi27l2a 3 UTR. These MREs flank a SNP (Chr3:10,082,380) previously associated with VNN survival. A strong negative correlation (r= -0.840) between miR-199-5p expression and ifi27l2a mRNA abundance further supports a post-transcriptional repression mechanism. Together, these results propose a regulatory model in which miR-199-5p modulates ifi27l2a expression, contributing to phenotypic variation in VNN resistance and positioning it as a promising biomarker for seabass aquaculture breeding.

11
Heat Stress Induces Locus-Specific DNA Hypomethylation Linked to Immune Regulation in Lactating Holstein Cows

Costa Monteiro Moreira, G.; Ruiz Gonzalez, A.; Joigner, M.; Costes, V.; Chaulot-Talmon, A.; Ali, F.; Bourgeois-Brunel, L.; Jammes, H.; Rico, D. E.

2026-03-26 genomics 10.64898/2026.03.23.713208 medRxiv
Top 0.4%
3.6%
Show abstract

Epigenetics may play a crucial role in livestock adaptation to environmental challenges like heat stress. In recent years, a growing number of studies have investigated the epigenetic mechanisms underlying dairy cow adaptation to heat stress. However, there is still limited knowledge about the effects of heat stress on immune cells and immune-related phenotypes. Herein we aim to identify heat-stress induced DNA methylation variations on blood methylome potentially affecting regulatory regions and associated phenotypes. Blood samples were collected and peripheral blood mononuclear cell (PBMC) isolated from four cows before (D0) and after (D14) a 14-d heat stress challenge (cyclical THI 72-82) and, from four cows kept in thermoneutral conditions (THI 61-64). Heat-stressed cows had ad libitum access to diets supplemented with adequate levels of vitamin D and Ca (12,000 IU/kg of vitamin D and 0.73% Ca, respectively). To eliminate confounding effects due to differences in nutrient intake, cows maintained under thermoneutral conditions were pair-fed (PF) to their heat-stressed counterparts and received adequate concentrations of vitamin D and Ca as well. Reduced representation bisulphite sequencing (RRBS) was used to profile PBMCs methylome. Differential methylation analysis was performed using methylKit and DSS softwares ({Delta}meth [&ge;] 25%, adjusted p-value < 0.01), retaining only commonly detected differentially methylated cytosines (DMCs). A total of 2,908 DMCs were identified when comparing pre- and post-heat stress samples. After excluding 649 DMCs that were also detected under thermoneutral conditions, as these changes were likely associated with feed restriction inherent to the pair-feeding design rather than with heat stress per se, 2,259 heat stress-specific DMCs remained, predominantly hypomethylated. About half of the DMCs are annotated in intronic and intergenic regions; known to harbor regulatory elements. By intersecting the DMRs with publicly available functional annotation data, we observed hypomethylation on regulatory regions putatively affecting cows immune system. As an example, we identified a loss of methylation within an enhancer region of the MSN gene, which is involved in lymphocyte homeostasis, and a loss of methylation in the promoter region of MECP2, a well-established epigenetic regulator with a central role in chromatin organization and gene expression. These findings highlight the impact of heat stress on dairy cow immunity and provide new insights into its epigenetic regulation under environmental stress. Interpretative summaryThis study examined DNA methylation changes induced by heat stress in dairy cows to elucidate epigenetic mechanisms of thermal adaptation. Using RRBS on PBMCs, 2,259 heat stress-specific differentially methylated cytosines were identified, predominantly hypomethylated and enriched in regulatory regions. Functional annotation highlighted immune-related pathways, including hypomethylated regulatory regions near genes (e.g., MSN, ZBTB33, SLC25A5, GNAS, FAM3A, and MECP2) associated with immune function. These findings indicate that heat stress induces targeted epigenetic modifications potentially affecting immune regulation in dairy cows.

12
The histone code of love: epigenetics of maturation of gonads in the human blood fluke Schistosoma mansoni

Grunau, C.; Lu, Z.; Coghlan, A.; Moescheid, M.; Quack, T.; Chaparro, C.; Aunin, E.; Allienne, J.-F.; Reid, A.; Holroyd, N.; Berriman, M.; Padalino, G.; Hoffmann, K. F.; Grevelding, C.; AUGUSTO, R. D. C.

2026-01-26 developmental biology 10.64898/2026.01.23.701327 medRxiv
Top 0.4%
3.5%
Show abstract

Schistosoma mansoni is a parasitic flatworm that has two, genetically determined, sexes. We used aggregated data of 8 posttranslational histone modifications (ChIP-Seq), chromatin accessibility (ATAC-Seq), transcription (RNA-Seq) and genome feature annotations to decipher the histone code of genes involved in the differentiation of schistosome gonads (i.e. female ovaries and male testes). We show schistosome gonads express at least two classes of protein coding genes: H3K4me3-positive genes that display canonical features of eukaryotic protein-coding genes such as peaks of H3K4me3 at the transcription start sites (TSS) and increases in histone acetylation marks towards the transcription end site (TES), but also a non-canonical H3K9/27me3 plateau just upstream of the TSS. H3K4me3 enrichment at the TSS is highly predictive for transcription strength in these genes compared to a second class of protein coding genes (H3K4me3-negative genes) that do not display this pattern and is characterised by absence of the investigated histone marks at TSS and TES. This is indicative of the existence of hitherto unknown, potentially schistosome-specific histone marks in these genes. The absence of H3K4me3 at the TSS is not associated with inducible or stable gene expression in the gonads. Instead, gene ontology analysis indicates that H3K4me3-positive genes are related to functions which typically govern processes such as metabolism or signal transduction while H3K4me3-negative genes are dedicated to cell communication or immune responses. Second, individual histone modifications and their combinations are associated with functional features of the schistosome genome, known as "chromatin colours". In H3K4me3-positive genes, there is clear co-linearity of 3 colours, which strongly suggests a functional role for histone modifications in the control of transcription pre-initiation, promotor release, and transcription termination. Third, there are striking chromatin structure changes during maturation of the gonads in all genomic features including protein-coding and non-protein coding genes as well as repetitive sequences. The nature of these changes is different in both sexes. H3K36me3 and H3K9me3, as well as H3K23ac and H3K9ac show the strongest variations. Last, we show that pharmacological inhibition of histone demethylation activity by IOX1 leads to a concentration-dependent separation ("divorce") of schistosome couples confirming the importance of H3K36/H3K9 methylation for pairing maintenance and indicating histone demethylases as a potential drug target family. Collectively, our findings offer unprecedented insights into histone codes and chromatin dynamics governing the reproductive development of S. mansoni gonads.

13
Integrative Identification and Characterization of PCOS-Associated lncRNAs From the Interface of Genetic Association, Transcriptomics, and Gene Structure Evolution

He, Z.; Li, Y.; Shkurat, T. P.; Butenko, E. V.; Derevyanchuk, E. G.; Lomteva, S. V.; Chen, L.; Lipovich, L.

2026-04-02 genomics 10.64898/2026.03.31.715548 medRxiv
Top 0.4%
3.3%
Show abstract

BackgroundPolycystic ovary syndrome (PCOS) is a prevalent endocrine disorder and a leading cause of female infertility, with complex genetic, metabolic, and hormonal etiologies. Long non-coding RNAs (lncRNAs) have emerged as important regulators of diverse biological processes, yet their roles in PCOS remain underexplored. Here, we identified and characterized PCOS differentially expressed gene-associated lncRNAs (PDEGAL) with an integrative approach combining expression data, genetic association, and evolutionary analysis. MethodsThirty-three PCOS-associated protein-coding genes were obtained from our prior study, and all their nearby and overlapping lncRNAs were annotated. These candidates were analyzed using UCSC Genome Browser-mapped annotations and datasets, including NCBI RefSeq, GENCODE, GTEx, GWAS SNPs, and conservation, as well as the FANTOM5 cap analysis of gene expression (CAGE) promoter data, to assess their expression, regulatory potential, genetic variant overlaps, and evolutionary conservation. ResultsTwenty-three PDEGALs (18 antisense to, and 5 sharing bidirectional promoters with, known PCOS-associated protein-coding genes) were identified. 17 PDEGALs contained GWAS SNPs with statistically significant disease associations, 9 of which were associated with PCOS-related traits. 5 PDEGALs demonstrated expression in the KGN granulosa cell model of PCOS. Key gene structure element (KGSE) analysis revealed that most PDEGALs are primate-specific. Integrating four criteria--GTEx expression, GWAS SNPs, FANTOM promoterome, and KGSE conservation--highlighted HELLPAR as the only lncRNA fulfilling all four, while five others--PGR-AS1, MTOR-AS1, ENSG00000265179, ENSG00000256218, and LOC105377276--fulfilled three of the four criteria. ConclusionsWe have systematically identified candidate PCOS regulatory lncRNAs with convergent genetic, expression, and evolutionary evidence. These results provide a framework for functional validation and highlight lncRNAs as potential biomarkers and therapeutic targets in PCOS that function by regulating their nearby and overlapping protein-coding genes.

14
Computational Prediction of Plasmodium falciparum Antigen-T-cell Receptor Interactions via Molecular Docking: Implications for Malaria Vaccine Design

Kipkoech, G.; Kanda, W.; Irungu, B.; Nyangi, M.; Kimani, C.; Nyangacha, R.; Keter, L.; Atieno, D.; Gathirwa, J.; Kigondu, E.; Murungi, E.

2026-03-20 bioinformatics 10.64898/2026.03.18.712575 medRxiv
Top 0.4%
3.1%
Show abstract

Malaria is one of the deadliest diseases in sub-Saharan Africa and Southeast Asia. The majority of the fatalities occur mostly in children under 5 years and pregnant women and this is due to infection by Plasmodium spp, of which Plasmodium falciparum is the most virulent and is responsible for most of the morbidity and mortality. Despite various public health interventions such as use of insecticide-treated bed nets, spraying of homes with insecticides and use of WHO recommended artemisinin-based combination therapies (ACT), malaria prevention still faces major setback due to drug and insecticide resistance by P. falciparum and mosquitoes respectively. The study uses molecular docking and immunoinformatics to screen various Plasmodium spp antigens and evaluate their antigenicity and suitability as vaccine candidates. The P. falciparum antigens and T-cell receptor (TCR) structures were obtained from Protein Data Bank (PDB) based on a range of factors related to their role in the lifecycle of the parasite and their status as vaccine targets. Protein structures not available in the PDB were predicted using AlphaFold. The 3D structures of selected P. falciparum antigens and TCR structures were downloaded in PDB format then all water molecules, Hetatm, and bound ligands were deleted from the protein structures using BIOVIA Discovery Studio Visualizer. Subsequently, molecular docking was done using ClusPro v2.0 server and docked complexes were compared. The findings of this study gave valuable insights into the interaction of human immune response with P. falciparum antigens. The best three ranked antigen complexes are PfCyRPA, PfMSP10 and PfCSP and this confirm their use as potential candidates for vaccine development. This study highlights the usefulness of computational docking in identifying P. falciparum antigens of excellent immunogenic potential as vaccine candidates.

15
Prediction and analysis of new HisKA-like domains

Silly, L.; Perriere, G.; Ortet, P.

2026-03-02 bioinformatics 10.64898/2026.02.27.708494 medRxiv
Top 0.4%
3.0%
Show abstract

Histidine kinases (HKs) are part of many signaling pathways, by being implicated in two components systems (TCS). Using autophosphorylation and phosphotransfer to a response regulators (RR), they enable organisms to adapt to their environment. Most HKs are transmembrane proteins with a sensing domain outside of the cell and two catalytic domains called HisKA and HATPase. HATPase is required for interaction with the ATP and HisKA contains the phosphorylated histidine residue. HKs are involved in various environmental adaptation mechanisms, like light sensing or biochemical changes. Studying their diversity is therefore important to better understand how cells interacts with their environment. There exist incomplete HKs (iHKs) lacking either the HisKA or HATPase domain. Some iHKs with an HATPase domain possess a section of their sequence where an HisKA domain could be expected. These iHKs may contain "true" HKs, with unknown HisKA domain, that could fill gaps in various signaling pathways. In this study we analyzed 869 964 sequences of iHKs having an HATPase domain but lacking an HisKA domain. We identified 18 HisKA-like profiles and did multiple meta-studies to assessed their HisKA-like characteristics. We found that their 3D structures matched the structure of known HisKA domains. We saw that the genomic context of the genes associated to these profiles contained genes implicated in signal transduction pathways. We cross-validated some of our profiles with curated annotations, as well as with a "negative dataset" made of non-HK proteins. We believe that our work could help improve the annotation of regulation pathways in prokaryotes.

16
Exploring genetic, expression and regulatory patterns of parental alleles in Muscovy duck (Cairina moschata) using haplotype-resolved assemblies

Li, T.; Wang, y.; Zhang, Z.; Chen, c.; Zheng, n.; Wang, j.; Ning, m.; Wang, j.; Ai, H.; Huang, Y.

2026-03-07 genomics 10.64898/2026.03.04.709678 medRxiv
Top 0.5%
2.8%
Show abstract

BackgroundAlthough the biological mechanism for heterosis has been debated for a long time, heterosis is widely utilized to increase the global productivity of crops and livestock. Recently, the mechanism has been well characterized in crops and livestock with a male-heterogametic XY system due to genomic assembly advancements, especially the availability of haploid genomes. However, the biological mechanism for heterosis remains unclear in poultry possessing the female-heterogametic ZW system. ResultsHere, we assembled chromosome-level diploid and haploid genomes of the Muscovy duck. We developed an efficient and cost-effective method to assemble 12 variation graph-haploid Muscovy duck genomes from three full-sibling pairs with high quality using short-read Illumina sequences. We further characterized genetic, expression and regulatory patterns of parental alleles at multiple scales. We found that maternal haploid genomes generally had more open chromatin organization and higher accessibility, and higher levels of gene expression, while showing similar DNA methylation levels when compared to paternal haploid genomes. In contrast, the female paternal Z chromosome showed the most, and the male paternal Z chromosome presented more, relaxed chromatin organization and chromatin accessibility, and gene expression compared to the male maternal Z chromosome. Thus, the ZW system largely relies on compensation and balance to regulate gene expression on the sex Z chromosome. Moreover, we identified non-Mendelian regions covering 0.26% of the genome ([~]3.18 Mb). These regions contained lower gene density, GC content, and repeat sequence frequency, but were enriched for DNA motifs bound by transcription factors, likely leading to a compacted chromatin structure and lower chromatin accessibility. ConclusionsOur work here provides a comprehensive profile of parental alleles genetic, expression and regulatory patterns in the female-heterogametic ZW system, and might be useful for the utilization of heterosis in poultry.

17
Comparative transcriptomics of immune response to viral and bacterial stimuli in three acanthopterygian bony fish

Rodriguez Vazquez, R.; Gundappa, M. K.; Aramburu, O.; Radojicic, J.; Tsigenopoulos, C. S.; Ferraresso, S.; Franch, R.; Bargelloni, L.; Martinez, P.; Robledo, D.; Megens, H.-J.

2026-01-30 genomics 10.64898/2026.01.30.702270 medRxiv
Top 0.5%
2.7%
Show abstract

Diseases triggered by bacterial and viral infections have caused huge economic losses for three of the most important European aquaculture species: turbot (Scophthalmus maximus), gilthead seabream (Sparus aurata) and European seabass (Dicentrarchus labrax). Understanding how they respond to pathogens is relevant for advancing aquaculture disease management and comprehending evolution of immune response within teleosts. Since mechanisms conserved across species are assumed to perform important roles, comparative analysis provides a powerful approach to pinpoint key elements of the immune defence. Here, we report the first comparative immune-transcriptomic analysis of these three species using bacterial and viral mimics after 20-24 hours post-stimulation with inactivated Vibrio anguillarum and Poly I:C in the head kidney of live fish (in vivo), and in primary leukocyte cultures (in vitro). The transcriptomic response, based on RNA-seq data, revealed a total of 503 differentially expressed orthologous genes in response to in vitro-Poly I:C, 1,472 to in vitro-Vibrio, 920 to in vivo-Poly I:C, and 832 to in vivo-Vibrio. Interestingly, consistent expression patterns were identified in seven genes across all species in both cell culture and live organisms in response to both pathogen stimuli. Functional enrichment analysis revealed associations with immunity, DNA replication and repair, and cytokine pathways, with the Toll-Like Receptor (TLR) pathway common to both conditions and stimuli. Our study suggests conservation of orthologous gene expression during infection across the three species for genes involved in chemokine pathways, interferon signalling, antigen processing and presentation, cell signalling regulators, and MAPK cascades. This study provides insights into key immune defence mechanisms in acanthopterygian bony fish.

18
Comparative LUSZ Therapeutic Study (LUSZ_AVIST) of Antiviral, Antiretroviral, and Immunosuppressive Treatments in Hospitalized COVID-19 Patients with High-Risk Factors, Biomarkers, and Disease Progression.

Makdissy, N.; Makdessi, E. W.; Fenianos, F.; Nasreddine, N.; Daher, W.; El Hamoui, S.

2026-04-13 respiratory medicine 10.64898/2026.04.10.26350587 medRxiv
Top 0.5%
2.7%
Show abstract

COVID-19 has spread rapidly and caused a global pandemic making it one of the deadliest in history. Early identification of patients with coronavirus disease 2019 who may develop critical illness is of immense importance. Therefore, novel biomarkers were needed to identify patients who will suffer rapid disease progression to severe complications and death. Many treatments were adopted including the antiviral Remdesivir, the antiretroviral Lopinavir /Ritonavir and Tocilizumab. Our study aimed not only to specify high-risk factors and biomarkers of fatal outcome in hospitalized subjects with coronavirus but also to compare the efficacy of the three considered treatments to help clinicians better choose a therapeutic strategy and reduce mortality. We divided the population (n=711) into four main groups based according to the WHO ordinal severity scale. The percentage of mortality, in and out the hospital, the length of stay in the hospital, the pulmonary inflammatory lesion and its distribution, the SARS-CoV-2 IgM and IgG variations at admission, the inflammatory markers, the complete blood count, the coagulation factors and enzymes, proteins and electrolytes profile, glucose and lipid profile, and other relevant markers were measured. The significance of the observed variation was assessed by multivariate and ANOVA analyses. We succeeded to establish a novel predictive scoring model of disease progression based on a cohort of Lebanese hospitalized patients relying on the pulmonary inflammatory lesions, inflammation biomarkers such as LDH, D-Dimer, CRP, IL-6 and the lymphocyte count, the number of comorbidities and the age of the patient which all were significantly correlated with the illness severity showing best outcomes with immunomodulatory and anticoagulant treatments by the results. As top tier, Tocilizumab was more efficient than the two other treatments in non-severe cases but none of the used treatments was insanely effective alone to reduce mortality in severe cases.

19
European taurine vs Indian indicine cattle: a comparative genomic study reveals regions of differentiation during evolution and selection during crossbreeding

Muthusamy, P. V.; Gupta, P.; Upadhyay, N.; Mani, R. V.; Kaur, M.; Bhaskar, B.; Pillai, R. R.; Kumar, T. S.; Anilkumar, T. V.; Kulkurni, S.; Azam, S.; Singh, N. S.

2026-02-05 genomics 10.64898/2026.02.05.704058 medRxiv
Top 0.5%
2.6%
Show abstract

Cattle are broadly classified into two subspecies: Bos taurus, adapted to temperate climates, and Bos indicus, adapted to tropical environments. Indicine cattle show better heat tolerance and stronger disease resistance, whereas taurine cattle are known for higher milk yield and better meat quality. Improving milk yield while maintaining resistance to heat stress and infectious diseases is an important objective in cattle breeding. Marker-assisted selection is an effective approach for improving economically important traits, highlighting the need to understand genetic differences between taurine and indicine cattle. However, genome-wide comparisons between European taurine and Indian indicine cattle remain limited. To address this gap, whole-genome sequencing of 48 Indian indicine cattle was performed and combined with publicly available data. This enabled a comparative genomic analysis of 74 Indian indicine and 83 European taurine individuals to identify genes involved in heat tolerance and immune response. Genome-wide analyses using Fst and XP-CLR identified 4,343 and 1,457 differentiated genes, respectively, with 826 genes common to both methods. These genes were mainly associated with immune response, protein stability, and cytoskeletal structure. Strong selection signals were observed in three heat shock protein genes (DNAJC11, DNAJC5, and DNAJB11) and 229 immune-related genes. To examine the inheritance of these genes through crossbreeding, a haplotype-resolved genome assembly was generated for the Indian crossbreed Sunandini, which showed predominantly taurine ancestry (69.13-96.04%), with a smaller indicine contribution (1.22-1.87%). Several genes related to heat tolerance and immune response were inherited exclusively from indicine cattle, highlighting their importance for environmental adaptation and future breeding programs.

20
The emergence of bacterial blight pathogen followed the dispersal pattern of rice in Asia

Quibod, I. L.; Nguyen, M. H.; Atienza-Grande, G.; Patarapuwadol, S.; Kositratana, W.; Nafisah, N.; Rosa, C.; Prasetiyono, J.; Fatimah, F.; Laha, G. S.; Sundaram, R. M.; Perez-Quintero, A. L.; Adorada, D.; Ash, G.; Vera Cruz, C.; SCHEPLER-LUU, V.; Oliva, R.

2026-02-19 genomics 10.64898/2026.02.19.706742 medRxiv
Top 0.6%
2.3%
Show abstract

Crop domestication has a significant effect on the evolutionary trajectory of plant pathogens by providing new ecological niches and abundant resources. The domestication of Asian rice (Oryza sativa) in Asia, approximately 9,000 years ago, might have shaped the genetic makeup of associated microbes into modern threats. In this study, we provide insight into the evolutionary history and dispersal pattern of the rice bacterial blight (BB) pathogen, Xanthomonas oryzae pv. oryzae (Xoo), one of the most destructive rice pathogens in the last century. The analysis of 433 Asian Xoo (AXoo) genomes identified twelve modern populations derived from three ancestral lineages (AXooL). Each population emerged with a unique genetic composition including the combination of pathogenicity factors. Bayesian reconstruction suggests that Xoo lineages emerged alongside O. sativa domestication hotspots and followed the dispersal pattern of rice across the continent. An ancient Xoo lineage (AXooL1) emerged in China and was likely dispersed with japonica rice. A second lineage (AXooL2) which could have turned up from China and spread across India, then evolved due to the domestication and spread of indica rice, and later on expanded eastward of Asia.. We also showed that recombination played a significant role in the emergence of AXooL3, which appeared more recently and might have spread with the rice trading routes. Our study aligns the evolution and dissemination of the phylogroup AXoo with the history of O. sativa, offering valuable insights for the formulation of precise disease management strategies. AUTHOR SUMMARYRice domestication was a crucial step in the development of Asian civilization. However, this process also affected the evolution of an associated pathogen, leading to its emergence as a global threat. Rice bacterial blight (BB), caused by the pathogen Xanthomonas oryzae pv. oryzae (Xoo), has been a scourge in many Asian countries. Using population genomics, we explored the diversity and evolutionary history of Xoo in Asia (AXoo). Here we show that two ancestral pathogen lineages emerged in rice domestication centers (China and India) and dispersed with rice across the continent. More recently, recombination played a crucial role in the appearance of a third lineage that spread through trading activity. This study provides the implications of the adaptation of AXoo in Oryza sativa, and might be valuable in forecasting BB outbreaks.