Back

ToxiVerse: A Public Platform for Chemical Toxicity Data Sharing and Customizable Predictive Modeling

Durai, P.; Russo, D. P.; Shen, Y.; Wang, T.; Chung, E.; Li, L.; Zhu, H.

2026-03-02 bioinformatics
10.64898/2026.02.26.708255 bioRxiv
Show abstract

Chemical toxicity assessment is critical for drug development and environmental safety. Computational models have emerged as a promising alternative to animal testing and now play a significant role in efficiently evaluating new chemicals. To address the urgent need for providing user-friendly machine learning tools in computational toxicology, we developed ToxiVerse, a public web-based platform. It provides curated toxicity datasets, automatic chemical bioprofiling, and a predictive modeling interface designed for researchers who lack programming expertise. The platform comprises three integrated modules: (i) the Bioprofiler module, which provides chemical descriptors by combining chemical-bioactivity data from PubChem assay with a machine learning-based data gap-filling procedure; (ii) the Database module, which hosts around 50,000 curated unique chemicals covering diverse toxicity endpoints; and (iii) the Cheminformatics module, which allows users to upload their own datasets, use datasets from ToxiVerse, or retrieve existing data from PubChem; perform chemical curation; and automatically generate Quantitative Structure-Activity Relationship (QSAR) models to predict chemicals of interest. ToxiVerse enables researchers to carry out bioprofiling, access curated toxicity datasets, and evaluate chemical toxicity through machine learning-based modeling and prediction. The platform is supported by sample files and a detailed tutorial, and it is freely accessible at www.toxiverse.com. GRAPHICAL ABSTRACT O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=80 SRC="FIGDIR/small/708255v1_ufig1.gif" ALT="Figure 1"> View larger version (22K): org.highwire.dtl.DTLVardef@d92764org.highwire.dtl.DTLVardef@a92f4aorg.highwire.dtl.DTLVardef@15fa39corg.highwire.dtl.DTLVardef@1ee89bc_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 1%
18.9%
2
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.3%
18.9%
3
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.4%
6.9%
4
Journal of Cheminformatics
25 papers in training set
Top 0.1%
4.0%
5
BMC Bioinformatics
383 papers in training set
Top 3%
3.6%
50% of probability mass above
6
PLOS ONE
4510 papers in training set
Top 38%
3.6%
7
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.6%
8
Scientific Data
174 papers in training set
Top 0.8%
2.1%
9
Patterns
70 papers in training set
Top 0.5%
2.1%
10
Communications Chemistry
39 papers in training set
Top 0.2%
1.8%
11
Nucleic Acids Research
1128 papers in training set
Top 10%
1.7%
12
Nature Communications
4913 papers in training set
Top 51%
1.7%
13
Science of The Total Environment
179 papers in training set
Top 3%
1.5%
14
ACS Omega
90 papers in training set
Top 2%
1.5%
15
Advanced Science
249 papers in training set
Top 12%
1.5%
16
Bioinformatics Advances
184 papers in training set
Top 3%
1.3%
17
RSC Advances
18 papers in training set
Top 0.8%
1.3%
18
PLOS Computational Biology
1633 papers in training set
Top 19%
1.2%
19
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
1.2%
20
iScience
1063 papers in training set
Top 24%
1.0%
21
Clinical and Translational Science
21 papers in training set
Top 0.9%
0.8%
22
GigaScience
172 papers in training set
Top 3%
0.8%
23
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.2%
0.8%
24
Cell Reports Methods
141 papers in training set
Top 5%
0.8%
25
Journal of Hazardous Materials
19 papers in training set
Top 0.9%
0.7%
26
Science Advances
1098 papers in training set
Top 31%
0.7%
27
Scientific Reports
3102 papers in training set
Top 76%
0.7%
28
Nature Protocols
30 papers in training set
Top 0.3%
0.7%
29
Interface Focus
14 papers in training set
Top 0.4%
0.7%
30
eLife
5422 papers in training set
Top 61%
0.7%