Back

TopoFuseNet: Hierarchical Graph Representation Learning with Multi-Scale Topological Features for Accurate Drug Synergy Prediction

Wang, Q.; Shi, x.

2026-05-08 bioinformatics
10.64898/2026.05.05.722940 bioRxiv
Show abstract

Accurate prediction of drug synergy is paramount for developing effective combination therapies and advancing personalized medicine. Although methods based on graph neural networks (GNNs) have become a prevalent approach, they often treat molecules as flat graphs of connected atoms, thus overlooking their inherent hierarchical structure (i.e., atoms forming functional groups) and the critical topological information that governs molecular interactions. To address this limitation, we introduce TopoFuseNet, a novel hierarchical graph representation learning framework that integrates multi-scale topological features. The core innovations of TopoFuseNet include: 1) The first-ever application of "Group Centrality" from network science to cheminformatics, enabling the identification and quantification of functional groups crucial to drug activity; 2) A systematic, multi- path strategy to seamlessly integrate node-level (atom) and group-level (functional group) topological features into a Graph Attention Network (GAT) via feature augmentation, attention biasing, and hierarchical pooling; 3) A Differential Transformer module to deeply fuse multi-modal features learned from sequences, fingerprints, and our proposed hierarchical graph representations. Extensive experiments on two large-scale benchmark datasets, DrugComb and DrugCombDB, demonstrate that TopoFuseNet significantly outperforms state-of-the-art methods across multiple key metrics, including AUC, AUPRC, and F1-score, while exhibiting exceptional generalization robustness under various stringent cold-start scenarios. In-depth ablation studies further confirm the effectiveness and necessity of each proposed innovative module. Furthermore, multi-scale interpretability analysis and zero-shot cross-domain transfer experiments reveal that the model successfully captures molecular interaction rules with clear pharmacological significance, demonstrating immense practical potential for discovering novel combination therapies through large-scale virtual screening. Our work not only delivers a superior model for drug synergy prediction, but more importantly, it establishes a novel and scalable paradigm for effectively integrating hierarchical molecular structures and topological information into GNNs.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Advanced Science
249 papers in training set
Top 0.3%
18.7%
2
Briefings in Bioinformatics
326 papers in training set
Top 0.2%
12.7%
3
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 0.8%
7.2%
4
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.8%
6.4%
5
Bioinformatics
1061 papers in training set
Top 4%
4.9%
6
Nature Machine Intelligence
61 papers in training set
Top 0.9%
3.6%
50% of probability mass above
7
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
3.1%
8
Nature Communications
4913 papers in training set
Top 47%
2.1%
9
Nucleic Acids Research
1128 papers in training set
Top 9%
2.1%
10
National Science Review
22 papers in training set
Top 0.7%
1.9%
11
Patterns
70 papers in training set
Top 0.7%
1.9%
12
Science Bulletin
22 papers in training set
Top 0.3%
1.7%
13
Chemical Science
71 papers in training set
Top 1%
1.7%
14
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.2%
1.7%
15
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 33%
1.7%
16
Science China Life Sciences
26 papers in training set
Top 1%
1.3%
17
Quantitative Biology
11 papers in training set
Top 0.4%
1.2%
18
iScience
1063 papers in training set
Top 21%
1.2%
19
PLOS ONE
4510 papers in training set
Top 62%
1.0%
20
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
1.0%
21
Cell Systems
167 papers in training set
Top 10%
1.0%
22
Communications Chemistry
39 papers in training set
Top 0.8%
0.9%
23
The Journal of Physical Chemistry Letters
58 papers in training set
Top 1%
0.9%
24
Acta Pharmaceutica Sinica B
11 papers in training set
Top 0.7%
0.9%
25
PLOS Computational Biology
1633 papers in training set
Top 22%
0.9%
26
Communications Biology
886 papers in training set
Top 21%
0.8%
27
Bioinformatics Advances
184 papers in training set
Top 5%
0.7%
28
Molecular Plant
36 papers in training set
Top 1%
0.7%
29
Protein & Cell
25 papers in training set
Top 2%
0.7%
30
Nature Biomedical Engineering
42 papers in training set
Top 2%
0.7%