Back

Toward Early Diagnosis and Therapeutic Discovery in CLN3 Disease: A Computational Biomarker Discovery Framework

Sun, S.; Dang Do, A. N.; Thurm, A.; Soldatos, A.; Zhu, Q.

2026-05-07 genetic and genomic medicine
10.64898/2026.05.01.26352147 medRxiv
Show abstract

BackgroundCLN3 disease, also known as juvenile neuronal ceroid lipofuscinosis, is a rare and neurodegenerative disorder characterized by the accumulation of lipopigments in the cells, progressive cognitive decline, seizures, and vision loss. Biomarker discovery in CLN3 disease is essential for enabling early and accurate diagnosis, which is critical given its neurodegenerative course. Biomarkers provide objective measures to track disease progression, stratify patients, and serve as surrogate endpoints in clinical trials, thereby accelerating therapeutic development. They also offer valuable insights into underlying disease mechanisms and treatment response, ultimately advancing individualized medicine and improving clinical outcomes. MethodsWe developed various machine learning models to predict potential protein biomarkers in CLN3 disease using proteomics data and laboratory tests collected from participants in a prospective, observational cohort. To prioritize and evaluate these candidates, we conducted protein-protein interaction (PPI) network analysis and pathway enrichment, ranking proteins based on their topological importance. The top 20 proteins were selected as candidate biomarkers and corroborated using a publicly available CLN3 transcriptomic dataset. Receiver operating characteristic (ROC) curve analysis was performed to assess the discriminative power of each candidate, with AUROC values calculated to quantify their classification performance. ResultsOur computational approach identified six promising biomarker candidates: OSM, IL6R, LMNB1, HIF1A, NPM1, and CSF1. Among them, OSM and HIF1A showed marked differential expression in CLN3 patients, particularly those with slow disease progression. LMNB1 expression was elevated in patients with faster disease progression, suggesting its utility as a prognostic biomarker. These findings highlight the robustness of our biomarker selection, indicating that these six genes may serve as effective diagnostic markers for CLN3 disease. ConclusionsOur findings demonstrate the utility of data-driven approaches for biomarker discovery in CLN3 and offer new insights into the molecular mechanisms of the disease, with broader implications for improving diagnosis and prognosis in other rare diseases.

Matching journals

The top 13 journals account for 50% of the predicted probability mass.

1
Annals of Clinical and Translational Neurology
29 papers in training set
Top 0.1%
9.4%
2
Alzheimer's & Dementia
143 papers in training set
Top 0.9%
7.4%
3
Human Mutation
29 papers in training set
Top 0.1%
5.0%
4
Scientific Reports
3102 papers in training set
Top 26%
4.5%
5
Annals of Neurology
57 papers in training set
Top 0.5%
3.7%
6
PLOS ONE
4510 papers in training set
Top 37%
3.7%
7
Neurobiology of Disease
134 papers in training set
Top 2%
3.2%
8
Genetics in Medicine
69 papers in training set
Top 0.5%
2.8%
9
Frontiers in Molecular Biosciences
100 papers in training set
Top 0.9%
2.2%
10
Brain Communications
147 papers in training set
Top 1%
2.1%
11
Human Molecular Genetics
130 papers in training set
Top 1%
2.1%
12
Molecular Neurodegeneration
49 papers in training set
Top 0.3%
2.1%
13
Orphanet Journal of Rare Diseases
18 papers in training set
Top 0.2%
1.9%
50% of probability mass above
14
Journal of Neurology
26 papers in training set
Top 0.5%
1.9%
15
Biomedicines
66 papers in training set
Top 0.5%
1.9%
16
Brain
154 papers in training set
Top 3%
1.7%
17
Frontiers in Human Neuroscience
67 papers in training set
Top 1%
1.7%
18
Human Genetics and Genomics Advances
70 papers in training set
Top 0.3%
1.7%
19
npj Parkinson's Disease
89 papers in training set
Top 0.7%
1.7%
20
Frontiers in Aging Neuroscience
67 papers in training set
Top 2%
1.4%
21
Genome Medicine
154 papers in training set
Top 5%
1.4%
22
eBioMedicine
130 papers in training set
Top 2%
1.4%
23
EBioMedicine
39 papers in training set
Top 0.7%
0.9%
24
Clinical Immunology
21 papers in training set
Top 0.5%
0.8%
25
Frontiers in Oncology
95 papers in training set
Top 3%
0.8%
26
Clinical Chemistry
22 papers in training set
Top 0.8%
0.8%
27
Neurology Genetics
14 papers in training set
Top 0.3%
0.8%
28
The American Journal of Human Genetics
206 papers in training set
Top 4%
0.8%
29
Molecular Pharmaceutics
16 papers in training set
Top 0.5%
0.8%
30
Epilepsia
49 papers in training set
Top 0.7%
0.8%