ARACRA: Automated RNA-seq Analysis for Chemical Risk Assessment
sharma, S.; Kumar, S.; Brull, J. B.; Deepika, D.; Kumar, V.
Show abstract
Transcriptomic analysis is considered a powerful approach for biomarker discovery, however still exploring large scale omics dataset to extract meaningful biological insights remains a challenge for biologists. To address this gap, we present ARACRA a fully automated RNA-seq analysis pipeline including entire transcriptomics workflow from raw FASTQ files to the transcriptomics Point of Departure (tPoD) with human-in-the-loop review process. Overall, the analysis is performed in two phases: Phase 1 carries out the acquisition of raw reads, pre-alignment quality control, alignment to reference genome and quantification of gene expression. Whereas, Phase 2 performs statistical analysis including Differential Gene Expression analysis and Dose-Response modelling. Two phases are separated by an extensive quality control step which allows the user to visually inspect the quality of data processed and helps in filtering noise and outlier samples. ARACRA facilitates end-to-end analysis of RNA-Seq data through an interactive web-based application developed on nextflow and streamlit for minimizing computational complexities while ensuring correct downstream processing. Availability and implementationARACRA is freely available online at the GitHub with MIT License and stream lit-based web application: ARACRA. Researchers can use the demo data or even upload their own data to do the analysis. O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=78 SRC="FIGDIR/small/716912v1_fig1.gif" ALT="Figure 1"> View larger version (27K): org.highwire.dtl.DTLVardef@15170a9org.highwire.dtl.DTLVardef@1bb9822org.highwire.dtl.DTLVardef@1010f3aorg.highwire.dtl.DTLVardef@8ee6e6_HPS_FORMAT_FIGEXP M_FIG O_FLOATNOFig 1:C_FLOATNO Overall Architecture of ARACRA C_FIG
Matching journals
The top 9 journals account for 50% of the predicted probability mass.