Back

Transcriptomic Profiling and Regulatory Network Analysis of Ten Metabolic Transporters Across Five Diabetic Complications: A Multi-Dataset, Twelve-Phase GEO Bioinformatics Study

Adegboyega, B. B.; Ekanem, P. C.; Awolaja, O. O.; Osarietin, E.; Okorie, B.

2026-05-27 bioinformatics
10.64898/2026.05.23.727195 bioRxiv
Show abstract

ObjectiveDiabetic complications collectively represent one of the most urgent unresolved problems in medicine, yet the field continues to study them in near-complete isolation from one another. No unified framework has systematically characterised the shared and divergent molecular signatures of ten clinically critical metabolic transporters across all five major complications, cardiomyopathy (DCM), nephropathy (DN), retinopathy (DR), peripheral neuropathy (DPN), and atherosclerosis and vasculopathy (DAD), through an integrated, multi-method computational pipeline. This study was designed to address that gap directly. MethodsEleven GEO microarray datasets comprising 118 diabetic and 76 control samples were analysed through twelve sequential phases: differential expression analysis, pan-complication overlap, weighted gene co-expression network analysis (WGCNA), GO/KEGG functional enrichment with gene set enrichment analysis (GSEA), STRING protein-protein interaction (PPI) network construction, competing endogenous RNA (ceRNA) network mapping, transcription factor activity inference using a VIPER-style algorithm, immune cell infiltration estimation by single-sample GSEA, diagnostic biomarker modelling using LASSO logistic regression and Random Forest classification, CMap-style drug repurposing by connectivity scoring, and two-sample Mendelian randomisation (MR) employing four independent estimators (inverse-variance weighted [IVW], MR-Egger, weighted median, and weighted mode). ResultsCD36 was the only transporter to achieve significant dysregulation across three independently sourced tissue types (DN, DR, DPN; logFC range 0.88 to 2.18), whilst TLR4 exhibited the highest fold-change in the study (logFC = 3.88, DPN) and the greatest WGCNA module membership (kME = 0.976, DPN). SERCA2 was significantly downregulated in three complications (DCM, DN, and DR) at formal significance thresholds and trended negatively in the remaining two (DPN and DAD), constituting the most consistently suppressed transporter in the study. Its universal downregulation was explicable through four convergent mechanisms spanning transcriptional, oxidative, ceRNA-mediated, and transcription factor-level regulation, and was confirmed as causally relevant to diabetic cardiomyopathy by eQTL Mendelian randomisation (beta = -0.085, p = 0.005). miR-21-5p was identified as the dominant ceRNA regulatory bridge (betweenness centrality = 0.428; 6.7-fold above the second-ranked miRNA), with MALAT1 as the sole lncRNA hub active in all five complications. PPARgamma and TP53 repression emerged as the leading transcription factor-level explanations for the simultaneous metabolic and inflammatory dysregulation characteristic of the diabetic transcriptome. Immune deconvolution revealed DCM as immunologically quiescent, DN as comprehensively infiltrated (ten enriched cell types), and DPN as mast-cell-dominated, identifying a cellular mechanism for TLR4-driven neuroinflammation that has not previously been systematically characterised. GLUT4 achieved perfect diagnostic discrimination for DPN (AUC = 1.000, p < 0.001; LASSO coefficient = -2.143), whilst SGLT2 was the leading DAD diagnostic marker (AUC = 1.000, p = 0.002). Epalrestat was the sole pan-complication drug repurposing candidate (significant connectivity reversal in four of five complications). Mendelian randomisation confirmed causal effects of T2DM genetic liability on all five complications (all p < 0.0001, all four estimators concordant), and eQTL-MR identified TLR4 (beta = +0.073, p = 0.006) and CD36 (beta = +0.070, p = 0.008) as causal risk factors for DN, SERCA2 reduced expression as a causal driver of DCM (beta = -0.085, p = 0.005), and SGLT2 expression as a causal protector against DN (beta = -0.070, p = 0.013). ConclusionsThis twelve-phase investigation identifies a pan-complication CD36/TLR4 inflammatory dyad and a SERCA2 calcium-mitochondrial effector axis, both confirmed at seven independent analytical levels, including causal genomic inference. GLUT4 downregulation defines DPN at the diagnostic level with perfect accuracy and is explicable through a five-layer mechanistic chain from MODY transcription factor inactivation to ceRNA competitive pressure. Epalrestat warrants prospective evaluation beyond its established DPN indication. These findings collectively constitute the most comprehensive computational characterisation of metabolic transporter biology in diabetic complications to date. RESEARCH IN CONTEXTO_ST_ABSWhat is already known about this subject?C_ST_ABSThe five major diabetic complications (cardiomyopathy, nephropathy, retinopathy, peripheral neuropathy, and atherosclerosisare) individually well-characterised, and several key metabolic transporters, including SGLT2, CD36, TLR4, SERCA2, and GLUT4, have established roles in one or more of these conditions. Mendelian randomisation has confirmed that T2DM genetic liability causally increases the risk of each complication independently. However, no study has examined all ten major metabolic transporters across all five complications simultaneously, and the shared versus complication-specific regulatory architectures of these transporters remain entirely uncharacterised. What is the key question?Which metabolic transporters are consistently dysregulated across all five diabetic complications, which are complication-specific, and can their shared regulatory mechanisms, from RNA regulation through to causal genetic evidence be used to identify diagnostic biomarkers and actionable therapeutic targets that transcend individual complication boundaries? What are the key findings and their implications for the field?CD36 and TLR4 constitute a pan-complication inflammatory dyad confirmed at seven independent analytical levels, including Mendelian randomisation causal evidence (both p < 0.01 for diabetic nephropathy). SERCA2 is universally suppressed across all five complications and is a causal driver of diabetic cardiomyopathy by eQTL-MR (p = 0.005). GLUT4 is a perfect single-gene diagnostic for diabetic peripheral neuropathy (AUC = 1.000) and a causal renal protector. Mast cells are identified as the innate cellular effectors of TLR4-driven diabetic neuropathy. Epalrestat demonstrates pan-complication therapeutic potential beyond its licensed DPN indication. These findings provide a unified mechanistic framework and a translational roadmap grounded in causal genomic evidence, with implications for both complication-targeted and pan-complication therapeutic strategies.

Matching journals

The top 12 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 22%
8.7%
2
Cell Reports Medicine
140 papers in training set
Top 0.3%
7.0%
3
Genome Medicine
154 papers in training set
Top 0.9%
6.6%
4
eBioMedicine
130 papers in training set
Top 0.1%
5.0%
5
Diabetologia
36 papers in training set
Top 0.2%
5.0%
6
JCI Insight
241 papers in training set
Top 1%
3.8%
7
The Journal of Clinical Endocrinology & Metabolism
35 papers in training set
Top 0.3%
3.7%
8
Journal of Translational Medicine
46 papers in training set
Top 0.4%
2.4%
9
Clinical Pharmacology & Therapeutics
25 papers in training set
Top 0.2%
2.4%
10
Scientific Reports
3102 papers in training set
Top 48%
2.2%
11
The American Journal of Human Genetics
206 papers in training set
Top 2%
2.1%
12
PLOS ONE
4510 papers in training set
Top 46%
2.1%
50% of probability mass above
13
Diabetes
53 papers in training set
Top 0.3%
2.1%
14
Advanced Science
249 papers in training set
Top 9%
1.9%
15
PLOS Genetics
756 papers in training set
Top 8%
1.8%
16
Frontiers in Genetics
197 papers in training set
Top 4%
1.7%
17
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.7%
18
Frontiers in Immunology
586 papers in training set
Top 5%
1.3%
19
eLife
5422 papers in training set
Top 48%
1.3%
20
PLOS Medicine
98 papers in training set
Top 3%
1.3%
21
Bioinformatics Advances
184 papers in training set
Top 4%
1.1%
22
Nature Metabolism
56 papers in training set
Top 2%
1.1%
23
Human Molecular Genetics
130 papers in training set
Top 2%
1.0%
24
Molecular Therapy - Nucleic Acids
24 papers in training set
Top 0.3%
0.8%
25
Clinical and Translational Science
21 papers in training set
Top 0.9%
0.8%
26
Disease Models & Mechanisms
119 papers in training set
Top 2%
0.8%
27
Journal of Molecular and Cellular Cardiology
39 papers in training set
Top 0.7%
0.8%
28
Nature Medicine
117 papers in training set
Top 5%
0.8%
29
Nature Genetics
240 papers in training set
Top 7%
0.8%
30
Communications Biology
886 papers in training set
Top 24%
0.7%