Genomics
○ Elsevier BV
Preprints posted in the last 90 days, ranked by how well they match Genomics's content profile, based on 60 papers previously published here. The average preprint has a 0.08% match score for this journal, so anything above that is already an above-average fit.
Panigrahi, J.; Panigrahy, D.; Rath, B.; Gupta, K.
Show abstract
Heavy metal ATPases (HMAs) are important group of transmembrane proteins involved in homeostasis of metal ions in plant systems. In this study, a comprehensive analysis of genome assembly (VC1973A v7.1) resulted in the identification of nine HMA genes (VrHMA) and their corresponding proteins in Mungbean, an agronomically important legume crop known for its nutritional values. VrHMA proteins were also characterized based on their biomolecular features, conserved domains and motifs arrangement, transmembrane helices, pore-line helices, subcellular location and occurrence of signal peptides. Based on sequence homology, nine VrHMAs were clustered into two major substrate-specific groups: VrHMA1, VrHMA5 and VrHMA7 were categorized under the Zn/Co/Cd/Pb ATPase group, whereas the remaining six VrHMAs belong to the Cu/Ag subgroup. Gene structure analysis and promoter scanning revealed the structural divergence and presence of various stress-responsive cis-acting elements, respectively. The expression analysis of VrHMA genes in root and leaf tissues, in response to heavy metal (Zn, Cd and Cu) stress, indicates their role in the uptake, transport and sequestration of metal ions. Interestingly, VrHMA5 showed incremental upregulation in roots in response to all three heavy metal stresses, whereas its expression was only upregulated in the leaf tissues under Zn stress, which indicates its role in vascular transport in V. radiata. In addition, this study provides valuable insights into the functional roles of VrHMA genes and will lay a foundation for future genetic improvement in mung bean aimed at enhanced heavy metal stress tolerance and micronutrient homeostasis.
Ren, Y.; Wang, F.; Li, X.; Liu, G.; Sun, R.; Zheng, X.; Zhang, Y.; Lin, R.; Lu, X.; Chen, L.; Xin, W.; Fei, Y.; Chao, Z.
Show abstract
BackgroudWuzhishan (WZS) pigs are native to Hainan Province of China, and serve as both important agricultural resources and biomedical models. Although the published WZS pig genome (T2T-pig1.0) even achieving telomere-to telomere (T2T) completeness, substantial genetic diversity still exists within the same pig breed, another WZS pig genome named WZS-T2T was assembled in this study. ResultsMultiple sequencing data were used to assemble genome, and finally yielded a [~]2.68 Gb telomere-to-telomere genome, with N50 length [~]142.87 Mb, and annotated protein coding genes of 23,100. Compared to T2T-pig1.0, QV and BUSCO value was higher, and the Y chromosome (ChrY) length was longer in WZS-T2T than that of T2T-pig1.0. ChrY of two WZS pigs shared 11 genes, including sex differentiation-related genes of SHOX, PRKX, and DDX3X, and SRY; however, energy metabolism gene SLC25A4 and the macrophage-related receptor gene CSF2RA of ChrY were specific to WZS-T2T. An inversion SV on chromosome 10 with length [~]33.86 Mb was identified between two WZS pigs, and three proofs were proposed for proving the accuracy sequence orientation of WZS-T2T.The genetic diversity was consistent with LD decay speed in population different analysis. WZS pigs exhibited higher genetic diversity than other four pig populations (Tunchang pigs, Yuxi black pigs, Large White pig, and Duroc pigs) examined in this study, and presented slower LD decay compared to other four breeds. ConclusionsTherefore, WZS-T2T provided a higher-quality assembly, and potential advantages of both agricultural production and biomedical targets for WZS pigs.
Gruenebast, J.; Singhal, R.; Olson, S.; Bromley, R.; Kanatani, S.; Ko, K.; Dunning Hotopp, J. C.; Sinnis, P.; Llinas, M.; Serre, D.
Show abstract
Long non-coding RNAs (lncRNAs) are critical regulators of gene expression in eukaryotes. Short reads from Illumina sequencing, reverse transcriptase artefacts, and incomplete second-strand degradation in strand-specific cDNA libraries hamper genome-wide identification of lncRNAs, especially in gene-dense genomes such as Plasmodium. Here, we integrated long-read Oxford Nanopore Technology direct RNA sequencing, ribosome profiling, and single-cell transcriptomics to generate a robust and stage-specific characterization of P. falciparum lncRNAs. We generated comprehensive annotations of lncRNAs expressed in both asexual and sexual blood stages and confirmed their non-coding nature using ribosome profiling. Most lncRNAs showed pronounced stage-specific expression and appeared to be particularly abundant in mature gametocytes. Single-cell RNA sequencing revealed differential expression of many lncRNAs in female and male gametocytes, suggesting important roles in gametocytogenesis and transmission. Many lncRNAs are located antisense to protein-coding genes and are co-expressed with their sense mRNA, possibly from putative bidirectional promoters, while others overlap mRNA coding sequences or 3 untranslated regions and showed negatively correlated expression patterns. Overall, our study shows the prevalence of P. falciparum lncRNAs and highlights their possible roles in controlling the regulation of gene expression, particularly during gametocytogenesis.
Durante, A.; Feve, K.; Naylies, C.; Labrune, Y.; Gress, L.; Lippi, Y.; Legoueix, S.; Milan, D.; Gourdine, J.-L.; Gilbert, H.; Renaudeau, D.; Riquet, J.; Devailly, G.
Show abstract
BackgroundGene expression levels are affected by genetics and environmental effects. However, quantification of the influence of genetics and environmental effects on gene expression remains limited, especially in farm animals. Here, the relative influence of genetic and heat-related environmental variations on gene expression levels was investigated in pigs, using a backcross herd of diverse heat adaptation levels. Backcross animals were raised in either a tropical or temperate environment. Animals raised in temperate environment were subjected to an experimental heat stress at the end of their growth. ResultsWe identified 1,967 differentially expressed genes (DEGs) between pigs raised in the tropical (n = 181) and temperate (n = 180) facilities, and 472 DEGs throughout a 3 weeks experimental heat stress. Transcriptome-wide association (TWAS) study identified 139 associations between gene expression levels and thermoregulation/production traits. We detected 6,014 expression quantitative trait loci (eQTLs) associated with the expression level of 3,297 genes. Genetic variance was estimated to explain 36.3% of gene expression variance on average, and was the main source of variance for 27.7% of transcripts. Most eQTLs found are located in proximal regions (cis-eQTLs) and few within distal regions (trans-eQTLs) to their assigned genes. A trans-eQTL hotspot highlighted a hematopoietic mechanism driven by GPATCH8. An integration of GWAS and TWAS pointed to TMCO1 and ZNF184 as candidate genes for backfat thickness. ConclusionsThis study provides a better understanding of the impact of climate, heat stress and genetic influences on the pig whole blood transcriptome.
Majumder, S.; Biswas, P.; Chakrabortty, R.; Ahmed, S.; Rahman, M. A.
Show abstract
BackgroundBronchiectasis in adults often goes undiagnosed following the routine assessment. Cystic Fibrosis (CF) is usually diagnosed during childhood, but some cases are identified in adulthood when disease is mild. High-resolution computed tomography (HRCT) of chest may offer structural information that can indicate CF as an underlying etiology. ObjectiveTo compare the HRCT features of adult patients with CF and non-CF bronchiectasis and to determine the radiologic features that may be suggestive of CF. MethodsThis retrospective, analytical, cross-sectional study was carried out in Bangladesh Medical University after IRB clearance. Total 130 adults (12 with CF and 118 with non-CF bronchiectasis) of both sexes, whose bronchiectasis was confirmed by chest HRCT were included. Imaging findings were assessed based on Reid morphological classification, anatomical distribution and extent of spread within the lungs, and their association was tested using chi-square test with statistical significance of p<0.05. ResultsCystic bronchiectasis was more common in CF than non-CF patients (83.3% vs 29.7; p<0.001). Mixed central-peripheral extension had been found a considerable associated with CF (66.7% vs. 42.4; p=0.034). There was no statistically significant difference in right lung lobar distribution (p=0.540) but combined upper and lower lobe involvement on the left side was more common in CF patients (54.5%) than non-CF patients (21.3) (p=0.054). ConclusionAdult CF had unique chest HRCT imaging characteristics when compared to non-CF bronchiectasis, especially cystic morphology and mixed extension. Identification of such features could help physician in the early diagnosis and selection of treatment strategy.
Kostareva, O. S.; Eliseeva, I. A.; Buyan, A. I.; Lyabin, D. N.; Tishchenko, S. V.; Mikhaylina, A. O.
Show abstract
Nucleobindin 1 (NUCB1) is a multifunctional conserved protein located in Golgi luminal, nucleus, extracellular and cytosolic pools. NUCB1 is multidomain protein comprised of a signal peptide, a DNA-binding domain, a leucine zipper and Ca2+ -binding domain. The multiple domains and localization of NUCB1 potentiates its interactions with various partners, such as DNA, Gi3 protein, cyclooxygenase 2, LRP10 and RNA suggests its importance in the regulation of many cellular events. We revealed that NUCB1 contains three RNA-binding regions and able to interact with two RNA fragments. It was suggested possible variants of the participation of NUCB1 in the interaction of the two partially complementary RNAs. The RNA-binding properties of the NUCB1 were also confirmed in vivo experiments.
He, Q.; Vasiljevic, S.; Kadri, N.; Watson, N.; Stratz, P.; Mapel, X. m.; Leonard, A. S.; seefried, F. R.; Pausch, H.
Show abstract
Milk urea concentration (MUC) is an indicator of dietary protein utilization and nitrogen use efficiency in dairy cows. We performed genome-wide association studies (GWAS) on MUC in early, mid, and late lactation in the Holstein (HOL) and Brown Swiss (BSW) dairy cattle breeds using imputed sequence variants. We identified 11 and 17 independent quantitative trait loci (QTL) for MUC across the three lactation stages in BSW and HOL, respectively. While many of these QTL have previously been reported for MUC and other dairy traits, our study provides evidence that some QTL exert lactation-stage specific effects. Our findings suggest that variants at the DGAT1 locus on BTA14 have pleiotropic effects on MUC and other dairy traits. This QTL showed an early lactation-specific association with MUC but impacted milk and fat yield across the entire lactation. We fine-mapped two QTL for MUC in early and mid-lactation in BSW on BTA9 (lead SNP: 9:21392941, Pcorrected = 1.1E-17) and BTA28 (lead SNP: 28:6518357; Pcorrected = 3E-11). We identified lncRNA ENSBTAG00000058688 and IBTK as positional and functional candidate genes for the BTA9 QTL, and KCNK1 as positional and functional candidate gene that harbors a highly significant missense variant for the BTA28 QTL. In conclusion, our results shed light on the genetic architecture of MUC and highlighted QTL harboring potential functional variants underpinning milk urea variation within and across breeds.
Tozzoli, R.; Schadron, T.; Knijn, A.; De Sabato, L.; Morabito, S.; Montalbano Di Filippo, M.; Fiskebeck, E.; Johannessen, G.; Antony-Samy, J. K.; Good, L.; Soderlund, R.; van Hoek, A.; Mughini Gras, L.; Franz, E.; Wieczorek, K.; Scavia, G.; Moro, O.; Chiani, P.; Michelacci, V.; Burgess, C. M.; Duffy, G.; Rodgers, J.; Kirchner, M.; Pista, A.; Silveira, L.; Amaro, A.; Clemente, L.; Chattaway, M. A.; Jenkins, C.; Dallman, T.; Schjorring, S.; Scheutz, F.; Byrne, B.; Gutierrez, M.; Lopez-Chavarrias, V.; Ugarte-Ruiz, M.; Brandal, L.; Naseer, U.; Kolackova, I.; Zomer, A. L.; Wagenaar, J. A.; Pires, S
Show abstract
Shiga toxin-producing Escherichia coli (STEC) are important foodborne pathogens, able to cause severe disease in humans. In the DiSCoVeR project (https://onehealthejp.eu/jrp-discover/) a STEC inventory from human and non-human sources from 11 European countries was set up and [≥] 3500 strains were sequenced to perform comparative genomics analysis. We used this dataset to assess STEC population structure and to investigate potential associations between genomic features, host reservoirs and symptoms. Most STEC isolates analysed by Whole Genome Sequencing (WGS) in this study were collected between years 2010-2020. An ad hoc pipeline was deployed for a harmonised characterization of the STEC in the database, allowing the determination of serotyping, stx gene subtyping, 7-loci MLST, virulotyping and cgMLST. The results were analysed with Principal Component Analysis (PCoA) in relation with isolation source to assess clustering of STEC subpopulations. When human STEC data were analysed, the PCoA revealed three distinct human STEC subpopulations (STEC_1, STEC_2 and STEC_3), which were further analysed for associations between genomic features, symptoms and variance. The non-human STEC showed a more dispersed distribution, except for one subpopulation with genes linked to specific host species, and some virulence profiles overlapping with the STEC_1 population. In conclusion, our analysis identified distinct STEC subpopulations from human cases, each characterized by specific genetic features and associated with varying proportions of severe disease outcomes. These findings provide novel insights supporting the risk assessment of STEC. Impact statement[This lay summary of your article should be no more than 200 words, and should a) provide a perspective of how this article adds to the literature in the field; b) identify breadth of interest/utility; and c) state the significance of output (incremental or step), in terms of relevance.] This study is based on the establishment of a One Health STEC genomes database, including sequences from isolates of different sources. Most of the isolates had been isolated in the ten-years time span 2010-2020, in 11 different countries, for surveillance and monitoring activities or specific surveys and research purposes. The final dataset included the whole genome sequencing of 3,418 STEC isolates, mainly from human cases of infections. The metadata included the host symptoms, where available, for human STEC strains and the animal source the strains had been isolated from. We set up a pipeline for the harmonized analysis of STEC WGS, called Discover, made available though ARIES webserver or GitHub. The analysis allowed a deep characterization of STEC strains circulating in Europe. We used this resource to assess STEC population structure and to investigate potential associations between genomic features, host reservoirs, and various symptoms associated with STEC infection by PCoA. This analysis highlighted the presence of subpopulation of human STEC associated with specific features. We provide new information useful for risk characterization, as well as a large dataset genome database and associated metadata compiled from STEC strains, representing a valuable resource for the scientific community, enabling further investigations into STEC diversity, evolution, source attribution and public health relevance. Data summaryThe authors confirm all supporting data, including sequence data accession numbers, code and protocols have been provided within the article or through supplementary data files. One supplementary method and five supplementary tables are available with the online version of this article
Choi, S.; Lee, N.; Jeon, H.; Park, J.; Kim, S.; Kim, J.-E.; Shin, J.; Moon, H.; Min, K.; Choi, Y.; Hwangbo, A.; Kim, H.; Choi, G. J.; Lee, Y.-W.; Song, D.-G.; Son, H.
Show abstract
O_LIWD40 is a highly conserved protein domain in eukaryotes, playing a critical role in various cellular process. C_LIO_LIWe conducted genome-wide functional analysis of WD40 genes in Fusarium graminearum--a phytopathogenic fungus that causes severe yield loss and mycotoxin contamination in major cereal crops. C_LIO_LIComprehensive phenome analysis of 119 WD40 gene deletion mutants across 22 distinct phenotypic traits revealed phenotypic divergence within the phenome, establishing a strong correlation between virulence and sexual reproduction. Notably, 21 "core WD40 genes" were identified, offering valuable insights into divergent biological processes. C_LIO_LIPilot interactome studies of Fgwd101 and Fgwd133 provided further insights into their potential pathobiological functions. Our investigation contributes to broadening our knowledge of the biological mechanisms underlying fungal pathogenesis and may assist in the identification of targets for antifungal agents. C_LI
Parida, A. S.; Kumar, A.; Tiwari, B.
Show abstract
The only autonomously active transposable elements in the human genome are Long interspersed nuclear element-1 (LINE-1) elements. These elements are known to play an important role in changing the transcriptome. LINE-1 sequences affect gene regulation during post-transcription processing, along with their established role in retrotransposition. Exonization is one mechanism where the LINE-1 integrated genome undergoes alternative splicing to produce new isoforms of transcripts. Our work mainly highlights the effect of LINE-1 associated exonization, focusing on the formation of isoforms of transcripts. Using Non-small cell lung cancer (NSCLC) as a model, we conducted a detailed transcriptome study that combines splice junction profiling with gene expression data. Our results show that LINE-1 sequences are often included as exons in host transcripts, leading to the formation of new exons and their various isoforms. The events are validated by solid splice junction evidence that proves the reliability and reproducibility. In particular, it was observed that repetitive analyses revealed certain LINE-1 exonization events that were consistent. The finding indicates that LINE-1 act as recurrent sources of splice ready sequences. Though exonizations do not necessarily affect the total expression levels of genes, our study reveals that they certainly contribute to transcript diversity. The diversity of isoforms generated potentially contributes to the effects of gene function. This study is limited to NSCLC, but it is likely that the exonizations events play a crucial role in the altering RNA diversity in cancers. Therefore the study elucidates new insights into how transposable elements modify gene structure and function during cancer development.
Grinstead, S.; Nemchinov, L. G.
Show abstract
We recently reported the identification of endogenous viral elements (EVEs) originating from the Caulimoviridae family within the alfalfa (Medicago sativa L.) genome. Our subsequent identification of ubiquitous rhabdoviral elements in infected and healthy alfalfa tissues by high throughput sequencing prompted us to suggest that the alfalfa genome might be populated with integrated rhabdoviruses as well. Bioinformatics analysis using 26 publicly available alfalfa genomes proved the suggestion accurate. We found multiple non-retroviral segments of the Rhabdoviridae family belonging to the genera Betanucleorhabdovirus and Betacytorhabdovirus that appeared to be stable constituents of the host genome. In that capacity they could potentially acquire functional roles in alfalfas development and response to environmental stresses. We believe this study reveals the first documented case of rhabdoviruses integrated into the alfalfa genome.
Monteiro de Barros, M. R.; Bosch, K.; Soualhi, S.; Issa Bhaloo, S.; Chu, T.; Hemrajani, T.; Cho, J.; Ozuner, K.; Fu, R.; Geiger, H.; Robine, N.; Carter, J. E. B.; Maniatis, S.; Ryeom, S.; Tavare, S.; Nowicki-Osuch, K.
Show abstract
Background & AimsGastric epithelial cells maintain homeostasis through dynamic self-renewal mechanisms involving stem and progenitor cells; however, identifying them has been challenging. This study aims to identify stem cells of healthy gastric epithelium and cell type-specific regulators defining gastric epithelial homeostasis via single-nucleus multiome analysis. MethodsTen unique gastric samples were collected from 8-12 week old wildtype mice. Isolated nuclei were subjected to simultaneous profiling of gene expression and chromatin accessibility. After quality control, 31,598 cells were analyzed with Seurat and Signac using weighted-nearest neighbors analysis for joint RNA and ATAC clustering. Furthermore, SCENIC+, MultiVelo, EpiCHAOS and Cell plasticity score were used to uncover gene regulatory networks, cell state dynamics and lineage trajectories. ResultsOur analyses were validated by the identification of known regulators of stem-cell differentiation into mature cell types. More importantly, it revealed previously uncharacterized regulatory networks comprising novel transcription factor combinations that define cell identities, including Ppara, Pparg, Arid5b and Sox5 as candidate regulators of parietal, foveolar, chief and neck cells, respectively. Further, our data support the identity of isthmus cells as stem-like cells of healthy gastric epithelium, as evidenced by epigenetic plasticity that simultaneously contains open chromatin states of all differentiated cell types in the absence of transcriptional reprogramming. ConclusionConsistent with Waddingtons epigenetic landscape hypothesis, gastric epithelial homeostasis is controlled by orchestrated epigenetic and transcriptional programs. Contrary to the prevailing hypothesis, stem cells can be defined not by a separate epigenetic state but by epigenetic superposition of differentiated cell states. Future work is needed to define the universality of these results.
Costa Monteiro Moreira, G.; Ruiz Gonzalez, A.; Joigner, M.; Costes, V.; Chaulot-Talmon, A.; Ali, F.; Bourgeois-Brunel, L.; Jammes, H.; Rico, D. E.
Show abstract
Epigenetics may play a crucial role in livestock adaptation to environmental challenges like heat stress. In recent years, a growing number of studies have investigated the epigenetic mechanisms underlying dairy cow adaptation to heat stress. However, there is still limited knowledge about the effects of heat stress on immune cells and immune-related phenotypes. Herein we aim to identify heat-stress induced DNA methylation variations on blood methylome potentially affecting regulatory regions and associated phenotypes. Blood samples were collected and peripheral blood mononuclear cell (PBMC) isolated from four cows before (D0) and after (D14) a 14-d heat stress challenge (cyclical THI 72-82) and, from four cows kept in thermoneutral conditions (THI 61-64). Heat-stressed cows had ad libitum access to diets supplemented with adequate levels of vitamin D and Ca (12,000 IU/kg of vitamin D and 0.73% Ca, respectively). To eliminate confounding effects due to differences in nutrient intake, cows maintained under thermoneutral conditions were pair-fed (PF) to their heat-stressed counterparts and received adequate concentrations of vitamin D and Ca as well. Reduced representation bisulphite sequencing (RRBS) was used to profile PBMCs methylome. Differential methylation analysis was performed using methylKit and DSS softwares ({Delta}meth [≥] 25%, adjusted p-value < 0.01), retaining only commonly detected differentially methylated cytosines (DMCs). A total of 2,908 DMCs were identified when comparing pre- and post-heat stress samples. After excluding 649 DMCs that were also detected under thermoneutral conditions, as these changes were likely associated with feed restriction inherent to the pair-feeding design rather than with heat stress per se, 2,259 heat stress-specific DMCs remained, predominantly hypomethylated. About half of the DMCs are annotated in intronic and intergenic regions; known to harbor regulatory elements. By intersecting the DMRs with publicly available functional annotation data, we observed hypomethylation on regulatory regions putatively affecting cows immune system. As an example, we identified a loss of methylation within an enhancer region of the MSN gene, which is involved in lymphocyte homeostasis, and a loss of methylation in the promoter region of MECP2, a well-established epigenetic regulator with a central role in chromatin organization and gene expression. These findings highlight the impact of heat stress on dairy cow immunity and provide new insights into its epigenetic regulation under environmental stress. Interpretative summaryThis study examined DNA methylation changes induced by heat stress in dairy cows to elucidate epigenetic mechanisms of thermal adaptation. Using RRBS on PBMCs, 2,259 heat stress-specific differentially methylated cytosines were identified, predominantly hypomethylated and enriched in regulatory regions. Functional annotation highlighted immune-related pathways, including hypomethylated regulatory regions near genes (e.g., MSN, ZBTB33, SLC25A5, GNAS, FAM3A, and MECP2) associated with immune function. These findings indicate that heat stress induces targeted epigenetic modifications potentially affecting immune regulation in dairy cows.
He, Z.; Li, Y.; Shkurat, T. P.; Butenko, E. V.; Derevyanchuk, E. G.; Lomteva, S. V.; Chen, L.; Lipovich, L.
Show abstract
BackgroundPolycystic ovary syndrome (PCOS) is a prevalent endocrine disorder and a leading cause of female infertility, with complex genetic, metabolic, and hormonal etiologies. Long non-coding RNAs (lncRNAs) have emerged as important regulators of diverse biological processes, yet their roles in PCOS remain underexplored. Here, we identified and characterized PCOS differentially expressed gene-associated lncRNAs (PDEGAL) with an integrative approach combining expression data, genetic association, and evolutionary analysis. MethodsThirty-three PCOS-associated protein-coding genes were obtained from our prior study, and all their nearby and overlapping lncRNAs were annotated. These candidates were analyzed using UCSC Genome Browser-mapped annotations and datasets, including NCBI RefSeq, GENCODE, GTEx, GWAS SNPs, and conservation, as well as the FANTOM5 cap analysis of gene expression (CAGE) promoter data, to assess their expression, regulatory potential, genetic variant overlaps, and evolutionary conservation. ResultsTwenty-three PDEGALs (18 antisense to, and 5 sharing bidirectional promoters with, known PCOS-associated protein-coding genes) were identified. 17 PDEGALs contained GWAS SNPs with statistically significant disease associations, 9 of which were associated with PCOS-related traits. 5 PDEGALs demonstrated expression in the KGN granulosa cell model of PCOS. Key gene structure element (KGSE) analysis revealed that most PDEGALs are primate-specific. Integrating four criteria--GTEx expression, GWAS SNPs, FANTOM promoterome, and KGSE conservation--highlighted HELLPAR as the only lncRNA fulfilling all four, while five others--PGR-AS1, MTOR-AS1, ENSG00000265179, ENSG00000256218, and LOC105377276--fulfilled three of the four criteria. ConclusionsWe have systematically identified candidate PCOS regulatory lncRNAs with convergent genetic, expression, and evolutionary evidence. These results provide a framework for functional validation and highlight lncRNAs as potential biomarkers and therapeutic targets in PCOS that function by regulating their nearby and overlapping protein-coding genes.
Kipkoech, G.; Kanda, W.; Irungu, B.; Nyangi, M.; Kimani, C.; Nyangacha, R.; Keter, L.; Atieno, D.; Gathirwa, J.; Kigondu, E.; Murungi, E.
Show abstract
Malaria is one of the deadliest diseases in sub-Saharan Africa and Southeast Asia. The majority of the fatalities occur mostly in children under 5 years and pregnant women and this is due to infection by Plasmodium spp, of which Plasmodium falciparum is the most virulent and is responsible for most of the morbidity and mortality. Despite various public health interventions such as use of insecticide-treated bed nets, spraying of homes with insecticides and use of WHO recommended artemisinin-based combination therapies (ACT), malaria prevention still faces major setback due to drug and insecticide resistance by P. falciparum and mosquitoes respectively. The study uses molecular docking and immunoinformatics to screen various Plasmodium spp antigens and evaluate their antigenicity and suitability as vaccine candidates. The P. falciparum antigens and T-cell receptor (TCR) structures were obtained from Protein Data Bank (PDB) based on a range of factors related to their role in the lifecycle of the parasite and their status as vaccine targets. Protein structures not available in the PDB were predicted using AlphaFold. The 3D structures of selected P. falciparum antigens and TCR structures were downloaded in PDB format then all water molecules, Hetatm, and bound ligands were deleted from the protein structures using BIOVIA Discovery Studio Visualizer. Subsequently, molecular docking was done using ClusPro v2.0 server and docked complexes were compared. The findings of this study gave valuable insights into the interaction of human immune response with P. falciparum antigens. The best three ranked antigen complexes are PfCyRPA, PfMSP10 and PfCSP and this confirm their use as potential candidates for vaccine development. This study highlights the usefulness of computational docking in identifying P. falciparum antigens of excellent immunogenic potential as vaccine candidates.
Froukh, T.
Show abstract
Currently, the genetic architecture of Middle Eastern populations is underrepresented in global genomic databases. This gap increases the rate of Variants of Uncertain Significance (VUSs) and clinical misinterpretations of genomic data especially in Middle Eastern populations. Whole exome sequencing was conducted on 90 healthy individuals from Jordan and the data were analysed using Principal Component Analysis (PCA) and multi-computational filtering. PCA revealed a double ancestry (EUR-AFR) admixture rather than a triple admixture (EUR-AFR-AMR). More than 3,500 populations-specific variants (PSVs) were identified, of which 72% were singletons. Additionally, 19 variants were significantly enriched compared to the maximum allele frequencies in public global databases (Fisher's exact test with Benjamini-Hochberg false discovery rate correction, p-value < 0.05). Consequently, the results suggest the reclassification of variants of Uncertain Significance (VUS) which reside in the ECE2 gene to likely benign and the variants of Conflicting Classification of Pathogenicity in the genes IL1RN and THPO to benign based on the significant allele frequency (AF=0.0389, p-value < 0.05). Furthermore, a pathogenic ClinVar variant was identified in a healthy individual, warranting careful interpretation. The findings underscore the importance of identifying PSVs in order to minimize or even prevent clinical misdiagnosis and highlight the unique genetic signature in Jordan. The study serves as a foundational resource for precision medicine in the region.
Rafei, R.; Njamkepo, E.; Gaillard, M.; Mohamed, S. I.; Smith, A. M.; Benamrouche, N.; Pazzani, C.; Ramamurthy, T.; Dabboussi, F.; Hamze, M.; Quilici, M.-L.; Rouard, C.; Weill, F.-X.
Show abstract
IncC plasmids played an important role in driving antimicrobial drug resistance development in the seventh pandemic Vibrio cholerae O1 El Tor (7PET) lineage during the 1970s and these have begun to re-emerge. In this study, we investigated a comprehensive dataset for 28 complete IncC plasmids -- including 17 newly sequenced plasmid genomes -- from 7PET isolates collected on various continents between 1979 and 2024 and distributed across the global phylogenetic tree for 7PET isolates. IncC type 2 predominated among the V. cholerae plasmids studied, and five new core genome sequence types (cgSTs) were identified. The antimicrobial resistance genes (ARGs) were arranged in islands inserted at specific hotspots within the common IncC backbone and were significantly associated with IncC types or islands. The IncC plasmid backbone has remained stable over the last 50 years, but the ARGs and their associated genomic islands displayed remarkable diversity, underscoring the complex evolution patterns of the 7PET lineage of V. cholerae and its considerable adaptability under selective pressure.
Li, T.; Wang, y.; Zhang, Z.; Chen, c.; Zheng, n.; Wang, j.; Ning, m.; Wang, j.; Ai, H.; Huang, Y.
Show abstract
BackgroundAlthough the biological mechanism for heterosis has been debated for a long time, heterosis is widely utilized to increase the global productivity of crops and livestock. Recently, the mechanism has been well characterized in crops and livestock with a male-heterogametic XY system due to genomic assembly advancements, especially the availability of haploid genomes. However, the biological mechanism for heterosis remains unclear in poultry possessing the female-heterogametic ZW system. ResultsHere, we assembled chromosome-level diploid and haploid genomes of the Muscovy duck. We developed an efficient and cost-effective method to assemble 12 variation graph-haploid Muscovy duck genomes from three full-sibling pairs with high quality using short-read Illumina sequences. We further characterized genetic, expression and regulatory patterns of parental alleles at multiple scales. We found that maternal haploid genomes generally had more open chromatin organization and higher accessibility, and higher levels of gene expression, while showing similar DNA methylation levels when compared to paternal haploid genomes. In contrast, the female paternal Z chromosome showed the most, and the male paternal Z chromosome presented more, relaxed chromatin organization and chromatin accessibility, and gene expression compared to the male maternal Z chromosome. Thus, the ZW system largely relies on compensation and balance to regulate gene expression on the sex Z chromosome. Moreover, we identified non-Mendelian regions covering 0.26% of the genome ([~]3.18 Mb). These regions contained lower gene density, GC content, and repeat sequence frequency, but were enriched for DNA motifs bound by transcription factors, likely leading to a compacted chromatin structure and lower chromatin accessibility. ConclusionsOur work here provides a comprehensive profile of parental alleles genetic, expression and regulatory patterns in the female-heterogametic ZW system, and might be useful for the utilization of heterosis in poultry.
Makdissy, N.; Makdessi, E. W.; Fenianos, F.; Nasreddine, N.; Daher, W.; El Hamoui, S.
Show abstract
COVID-19 has spread rapidly and caused a global pandemic making it one of the deadliest in history. Early identification of patients with coronavirus disease 2019 who may develop critical illness is of immense importance. Therefore, novel biomarkers were needed to identify patients who will suffer rapid disease progression to severe complications and death. Many treatments were adopted including the antiviral Remdesivir, the antiretroviral Lopinavir /Ritonavir and Tocilizumab. Our study aimed not only to specify high-risk factors and biomarkers of fatal outcome in hospitalized subjects with coronavirus but also to compare the efficacy of the three considered treatments to help clinicians better choose a therapeutic strategy and reduce mortality. We divided the population (n=711) into four main groups based according to the WHO ordinal severity scale. The percentage of mortality, in and out the hospital, the length of stay in the hospital, the pulmonary inflammatory lesion and its distribution, the SARS-CoV-2 IgM and IgG variations at admission, the inflammatory markers, the complete blood count, the coagulation factors and enzymes, proteins and electrolytes profile, glucose and lipid profile, and other relevant markers were measured. The significance of the observed variation was assessed by multivariate and ANOVA analyses. We succeeded to establish a novel predictive scoring model of disease progression based on a cohort of Lebanese hospitalized patients relying on the pulmonary inflammatory lesions, inflammation biomarkers such as LDH, D-Dimer, CRP, IL-6 and the lymphocyte count, the number of comorbidities and the age of the patient which all were significantly correlated with the illness severity showing best outcomes with immunomodulatory and anticoagulant treatments by the results. As top tier, Tocilizumab was more efficient than the two other treatments in non-severe cases but none of the used treatments was insanely effective alone to reduce mortality in severe cases.
Hu, Y.; Huang, Y.; Yong, Y.; Shang, E.; Zhang, B.; Sui, Z.
Show abstract
As an important cultivated red alga, Gracilariopsis lemaneiformis has great economic and ecological value. However, its existing genome assembly is highly fragmented and inadequately annotated. In this study, we constructed the first high-quality chromosome-level genome of Gp. lemaneiformis using PacBio long reads, Illumina short reads and Hi-C sequencing data. The assembled genome was approximately 86.66 Mb and the assembled sequences were anchored to 28 pseudo-chromosomes with lengths ranging from 1.70 to 7.81 Mb. 99.91% of the PacBio reads could be mapped to our assembly. In total, 8,664 genes were annotated, and the repeat elements identified in Gp. lemaneiformis constituted 65.04% of the whole genome, including 2.24% tandem repeat sequences and 62.81% interspersed repeats. We also established a high-evidence phylogenetic tree from 19 representative algae species, with the main aim to calculate their divergence times. This high-quality genome of Gp. lemaneiformis provides a crucial foundation for understanding genetic characteristics, investigating the genomic evolution, and facilitating molecular breeding.