Back

ARACRA: Automated RNA-seq Analysis for Chemical Risk Assessment

sharma, S.; Kumar, S.; Brull, J. B.; Deepika, D.; Kumar, V.

2026-04-09 bioinformatics
10.64898/2026.04.07.716912 bioRxiv
Show abstract

Transcriptomic analysis is considered a powerful approach for biomarker discovery, however still exploring large scale omics dataset to extract meaningful biological insights remains a challenge for biologists. To address this gap, we present ARACRA a fully automated RNA-seq analysis pipeline including entire transcriptomics workflow from raw FASTQ files to the transcriptomics Point of Departure (tPoD) with human-in-the-loop review process. Overall, the analysis is performed in two phases: Phase 1 carries out the acquisition of raw reads, pre-alignment quality control, alignment to reference genome and quantification of gene expression. Whereas, Phase 2 performs statistical analysis including Differential Gene Expression analysis and Dose-Response modelling. Two phases are separated by an extensive quality control step which allows the user to visually inspect the quality of data processed and helps in filtering noise and outlier samples. ARACRA facilitates end-to-end analysis of RNA-Seq data through an interactive web-based application developed on nextflow and streamlit for minimizing computational complexities while ensuring correct downstream processing. Availability and implementationARACRA is freely available online at the GitHub with MIT License and stream lit-based web application: ARACRA. Researchers can use the demo data or even upload their own data to do the analysis. O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=78 SRC="FIGDIR/small/716912v1_fig1.gif" ALT="Figure 1"> View larger version (27K): org.highwire.dtl.DTLVardef@15170a9org.highwire.dtl.DTLVardef@1bb9822org.highwire.dtl.DTLVardef@1010f3aorg.highwire.dtl.DTLVardef@8ee6e6_HPS_FORMAT_FIGEXP M_FIG O_FLOATNOFig 1:C_FLOATNO Overall Architecture of ARACRA C_FIG

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
14.6%
2
PLOS ONE
4510 papers in training set
Top 13%
14.6%
3
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.8%
4.9%
4
BMC Bioinformatics
383 papers in training set
Top 2%
4.2%
5
iScience
1063 papers in training set
Top 4%
3.6%
6
Briefings in Bioinformatics
326 papers in training set
Top 2%
2.9%
7
Bioinformatics Advances
184 papers in training set
Top 2%
2.6%
8
SoftwareX
15 papers in training set
Top 0.1%
2.4%
9
PLOS Computational Biology
1633 papers in training set
Top 13%
2.1%
50% of probability mass above
10
GigaScience
172 papers in training set
Top 1%
1.9%
11
Nature Communications
4913 papers in training set
Top 48%
1.9%
12
Journal of Proteome Research
215 papers in training set
Top 1%
1.9%
13
Analytical Chemistry
205 papers in training set
Top 1%
1.7%
14
Metabolites
50 papers in training set
Top 0.5%
1.7%
15
Scientific Reports
3102 papers in training set
Top 57%
1.7%
16
Cell Reports Methods
141 papers in training set
Top 2%
1.7%
17
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.4%
18
Analytica Chimica Acta
17 papers in training set
Top 0.4%
1.2%
19
Patterns
70 papers in training set
Top 2%
1.1%
20
npj Systems Biology and Applications
99 papers in training set
Top 2%
1.1%
21
Communications Chemistry
39 papers in training set
Top 0.7%
0.9%
22
Peer Community Journal
254 papers in training set
Top 4%
0.8%
23
Archives of Clinical and Biomedical Research
28 papers in training set
Top 2%
0.8%
24
Advanced Science
249 papers in training set
Top 18%
0.8%
25
Scientific Data
174 papers in training set
Top 2%
0.8%
26
Science of The Total Environment
179 papers in training set
Top 5%
0.8%
27
Journal of the American Society for Mass Spectrometry
33 papers in training set
Top 0.5%
0.8%
28
Chemosphere
15 papers in training set
Top 0.6%
0.7%
29
Toxicological Sciences
38 papers in training set
Top 0.6%
0.7%
30
Interface Focus
14 papers in training set
Top 0.3%
0.7%