Back

A Hierarchy-aware Gene Exploration Platform for Multi-layered Toxicogenomic Analysis: A Case Study on Acetaminophen-induced Hepatotoxicity

Kim, M.; Cui, Y.; Kim, M. G.

2026-04-14 bioinformatics
10.64898/2026.04.10.717684 bioRxiv
Show abstract

BackgroundThe interpretation of high-dimensional transcriptomic data remains a major challenge in mechanistic toxicology and drug safety assessment. Conventional clustering approaches based solely on expression profiles often fail to capture intrinsic biological relationships among genes, limiting interpretability and downstream analysis. MethodsWe developed a hierarchy-aware gene exploration platform that integrates structured biological knowledge from the HUGO Gene Nomenclature Committee (HGNC). The core of the framework is a similarity kernel based on a single-step hyperdiffusion formulation (HKH{top}), which embeds gene family hierarchy into the similarity space. The platform is implemented as an interactive web application supporting Uniform Manifold Approximation and Projection (UMAP) visualization, Leiden clustering, functional enrichment analysis, and hierarchy-based gene recommendation. ResultsApplied to a transcriptomic dataset of acetaminophen-induced acute liver failure (APAP-ALF), the proposed approach achieved a 33.8-fold improvement in functional coherence compared to an expression-only baseline. The hierarchy-aware embedding produced compact and biologically consistent clusters, enabling identification of key toxicological modules, including disruption of RNA processing, extracellular matrix remodeling, and impairment of lipid transport. In addition, the framework detected small but highly significant regulatory modules associated with epigenetic reprogramming. ConclusionBy incorporating biological hierarchy into gene similarity, the proposed platform enhances the interpretability of transcriptomic analysis and enables structured exploration of functional relationships. This approach provides a practical framework for mechanistic insight generation and supports more transparent and reproducible analysis in toxicogenomics. AvailabilityThe web application is freely available at https://hgncgeneexplorer.streamlit.app/.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
BMC Bioinformatics
383 papers in training set
Top 0.1%
23.4%
2
Bioinformatics
1061 papers in training set
Top 2%
14.9%
3
PLOS ONE
4510 papers in training set
Top 32%
4.8%
4
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.9%
4.5%
5
Toxicological Sciences
38 papers in training set
Top 0.1%
4.5%
50% of probability mass above
6
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.1%
7
PLOS Computational Biology
1633 papers in training set
Top 8%
4.1%
8
Bioinformatics Advances
184 papers in training set
Top 1%
3.8%
9
Nature Communications
4913 papers in training set
Top 46%
2.2%
10
Scientific Reports
3102 papers in training set
Top 52%
2.0%
11
Environment International
42 papers in training set
Top 0.6%
1.9%
12
GigaScience
172 papers in training set
Top 2%
1.5%
13
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.4%
14
Advanced Science
249 papers in training set
Top 14%
1.3%
15
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
1.0%
16
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
1.0%
17
Environmental Research
46 papers in training set
Top 1%
1.0%
18
Toxicology and Applied Pharmacology
13 papers in training set
Top 0.2%
0.9%
19
Cell Reports Methods
141 papers in training set
Top 5%
0.8%
20
Frontiers in Pharmacology
100 papers in training set
Top 4%
0.8%
21
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 8%
0.8%
22
Molecules
37 papers in training set
Top 2%
0.7%
23
Journal of Translational Medicine
46 papers in training set
Top 3%
0.7%
24
BioData Mining
15 papers in training set
Top 1%
0.7%
25
iScience
1063 papers in training set
Top 36%
0.7%
26
MethodsX
14 papers in training set
Top 0.6%
0.7%
27
Scientific Data
174 papers in training set
Top 3%
0.7%
28
Metabolites
50 papers in training set
Top 1%
0.7%
29
Archives of Clinical and Biomedical Research
28 papers in training set
Top 3%
0.5%