Back

Discovery of TDP-43 aggregation inhibitors via a hybrid machine learning framework

Kapsiani, S.; Vora, S.; Fernandez-Villegas, A.; Kaminski, C. F.; Läubli, N. F.; Kaminski Schierle, G. S.

2026-02-14 bioinformatics
10.64898/2026.02.12.705375 bioRxiv
Show abstract

TAR DNA-binding protein 43 (TDP-43) aggregation is a hallmark of several neurodegenerative diseases, including amyotrophic lateral sclerosis and frontotemporal dementia. Recent therapeutic efforts have highlighted the potential of small molecules capable of inhibiting TDP-43 aggregation; however, no effective treatments currently exist. Here, we developed a hybrid machine learning approach combining graph neural network (GNN) embeddings with traditional chemical descriptors and biological target annotations. Using XGBoost as the final classifier enabled model interpretability through SHAP analysis, allowing the identification of key chemical features and target annotations associated with TDP-43 anti-aggregation activity. Complementary Monte Carlo Tree Search analysis highlighted specific chemical substructures linked to predicted activity. By screening an external library of 3,853 small molecules, the model identified two compounds not previously evaluated against TDP-43 aggregation, namely berberrubine and PE859. Molecular docking analysis revealed that both compounds interact favourably with the TDP-43 RNA recognition motif (RRM) domain through distinct binding modes. Experimental validation showed that both compounds significantly reduced TDP-43 aggregation in HEK cells. Further testing in Caenorhabditis elegans expressing human TDP-43 demonstrated that PE859 significantly rescued locomotor defects, while berberrubine showed partial improvement. This work establishes a hybrid machine learning approach for accelerating small molecule drug discovery, yielding two promising therapeutic candidates for TDP-43 proteinopathies.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.1%
35.0%
2
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.2%
8.6%
3
Scientific Reports
3102 papers in training set
Top 17%
6.4%
50% of probability mass above
4
International Journal of Molecular Sciences
453 papers in training set
Top 1%
5.0%
5
Advanced Science
249 papers in training set
Top 5%
3.7%
6
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.7%
7
PLOS ONE
4510 papers in training set
Top 38%
3.7%
8
Molecular Therapy Nucleic Acids
32 papers in training set
Top 0.3%
1.7%
9
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.7%
10
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.1%
1.4%
11
PLOS Computational Biology
1633 papers in training set
Top 19%
1.3%
12
International Journal of Biological Macromolecules
65 papers in training set
Top 2%
1.3%
13
ACS Omega
90 papers in training set
Top 2%
1.3%
14
Molecules
37 papers in training set
Top 1%
1.3%
15
Journal of Medicinal Chemistry
68 papers in training set
Top 0.9%
1.0%
16
Communications Chemistry
39 papers in training set
Top 0.6%
1.0%
17
Bioinformatics Advances
184 papers in training set
Top 4%
0.9%
18
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
19
Communications Biology
886 papers in training set
Top 20%
0.8%
20
Bioinformatics
1061 papers in training set
Top 9%
0.8%
21
BMC Bioinformatics
383 papers in training set
Top 7%
0.8%
22
Computers in Biology and Medicine
120 papers in training set
Top 4%
0.8%
23
Pharmaceuticals
33 papers in training set
Top 2%
0.8%
24
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
25
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 10%
0.7%
26
Biomolecules
95 papers in training set
Top 3%
0.5%
27
Chemical Science
71 papers in training set
Top 3%
0.5%
28
Journal of Biomolecular Structure and Dynamics
43 papers in training set
Top 2%
0.5%
29
npj Systems Biology and Applications
99 papers in training set
Top 3%
0.5%
30
ACS Pharmacology & Translational Science
40 papers in training set
Top 1%
0.5%