Back

ExposoGraph: An Interactive Platform for Carcinogen Bioactivation and Detoxification Pathway Visualization

Pienta, K.; Kazi, J. U.

2026-03-24 bioinformatics
10.64898/2026.03.22.713456 bioRxiv
Show abstract

BackgroundDespite extensive cataloging of carcinogenic exposures by the International Agency for Research on Cancer (IARC) and pharmacogenomic variation by resources such as PharmVar and CPIC, few platforms unify exposure, metabolic activation and detoxification, DNA damage, and genetic annotation within a single interactive visualization framework. This gap limits systematic evaluation of gene-environment interactions in cancer risk assessment. MethodsWe developed the Carcino-Genomic Knowledge Graph, ExposoGraph, an interactive knowledge-graph platform for carcinogen metabolism and DNA damage pathways. The reference graph integrates curated data and annotations from IARC, KEGG, PharmVar, CPIC, CTD, and supporting literature/resources. The current reference graph contains 96 nodes across 5 entity types (Carcinogens, Enzymes, Metabolites, DNA Adducts, and Pathways) and 102 edges across 6 relationship types (activates, detoxifies, transports, forms adduct, repairs, and pathway). ResultsThe first-generation reference graph captures metabolic activation and detoxification pathways for 9 carcinogen classes spanning 15 index carcinogens. It represents 36 enzymes across Phase I activation (n=14), Phase II conjugation and detoxification (n=14), Phase III transport (n=3), and DNA repair (n=5). Interactive exploration supports carcinogen-class filtering, node- and edge-type filtering, metadata-based search, and detailed hover/detail views with provenance and pharmacogenomic annotations. The androgen branch highlights cross-pathway connectivity by linking androgen metabolism to estrogen quinone formation and DNA adduct generation through CYP19A1-mediated aromatization and downstream catechol estrogen chemistry. In the optional androgen-focused extension, additional receptor, tissue, and variant context further connects this branch to androgen receptor signaling and genotype-specific annotations. ConclusionsExposoGraph provides a first-generation integrated, interactive framework linking carcinogenic exposures to metabolic fates and genetic modulators. The platform supports hypothesis generation for gene-environment interaction studies and may inform future individualized risk modeling, while remaining a research-use framework rather than a clinically validated risk-assessment tool.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
Toxicological Sciences
38 papers in training set
Top 0.1%
10.7%
2
Bioinformatics
1061 papers in training set
Top 3%
8.7%
3
BMC Bioinformatics
383 papers in training set
Top 1%
8.5%
4
Genome Medicine
154 papers in training set
Top 2%
3.8%
5
PLOS ONE
4510 papers in training set
Top 37%
3.8%
6
International Journal of Cancer
42 papers in training set
Top 0.2%
3.7%
7
Scientific Reports
3102 papers in training set
Top 34%
3.7%
8
Nature Communications
4913 papers in training set
Top 38%
3.7%
9
Clinical Epigenetics
53 papers in training set
Top 0.4%
2.4%
10
BioData Mining
15 papers in training set
Top 0.2%
2.1%
50% of probability mass above
11
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
1.9%
12
Environment International
42 papers in training set
Top 0.6%
1.8%
13
Cancer Research Communications
46 papers in training set
Top 0.4%
1.7%
14
Environmental Research
46 papers in training set
Top 0.8%
1.7%
15
Clinical Pharmacology & Therapeutics
25 papers in training set
Top 0.3%
1.7%
16
JNCI Cancer Spectrum
10 papers in training set
Top 0.2%
1.7%
17
Toxicology and Applied Pharmacology
13 papers in training set
Top 0.1%
1.5%
18
Metabolites
50 papers in training set
Top 0.5%
1.5%
19
Cancer Research
116 papers in training set
Top 2%
1.4%
20
The Lancet Digital Health
25 papers in training set
Top 0.6%
1.3%
21
International Journal of Environmental Research and Public Health
124 papers in training set
Top 5%
1.3%
22
Clinical and Translational Science
21 papers in training set
Top 0.6%
1.3%
23
Scientific Data
174 papers in training set
Top 2%
1.0%
24
JMIR mHealth and uHealth
10 papers in training set
Top 0.3%
0.9%
25
Epigenetics
43 papers in training set
Top 0.7%
0.9%
26
European Journal of Human Genetics
49 papers in training set
Top 1%
0.8%
27
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%
28
The Prostate
11 papers in training set
Top 0.1%
0.7%
29
Environmental Pollution
35 papers in training set
Top 2%
0.7%
30
Journal of Translational Medicine
46 papers in training set
Top 3%
0.7%