Back

Directional Gene-Level Concordance and Methodological Constraints in Blood Transcriptomic and DNA Methylation Studies of Parkinson's Disease

Kaur, R.; Dewan, C.; Chauhan, I.; Sharma, K.; Sharma, S.

2026-05-20 neuroscience
10.64898/2026.05.17.725808 bioRxiv
Show abstract

Assessing reproducibility across different molecular profiling studies is a persistent methodological challenge (Zhang et al., 2009; Sweeney et al., 2017; Ioannidis, 2005). Differences in platform technology, cohort composition, analytical pipelines, and feature definitions often make it difficult to interpret cross-study comparisons based solely on gene-identity overlap. In this study, we conducted a retrospective computational analysis of seven publicly available analytical datasets (including alternative analytical pipelines applied to the same cohort) derived from five biologically independent peripheral blood transcriptomic and DNA methylation cohorts, comprising 3,487 samples (1,824 Parkinsons disease cases and 1,663 controls). Reproducibility was evaluated using gene-identity overlap, enrichment-based comparisons, and a permutation-based framework to assess directional consistency of effect estimates across datasets. We also tested the robustness of results by varying false discovery rate thresholds and applying alternative probe-to-gene collapsing strategies. All analyses were performed using reproducible workflows implemented in R and Python with fixed random seeds. Across independent cohorts, gene-identity overlap was generally limited, with enrichment ratios close to one, especially when datasets were generated using different platforms. In several datasets, limited numbers of statistically significant features further constrained overlap-based comparisons. In contrast, directional consistency showed greater stability. High levels of directional consistency were observed across independent cohort comparisons when restricted to overlapping statistically significant features and remained stable across statistical thresholds (90.0% at FDR < 0.05 and 82.8% at FDR < 0.10). When evaluated across the full shared gene universe without conditioning on statistical significance, directional consistency was substantially lower ([~]30 to 32%) but remained significantly above permutation-based null expectations. Permutation testing confirmed that the observed directional consistency exceeded what would be expected by chance. A combined analysis including methodological replicates (n [&ge;] 3 datasets) showed 98.3% directional consistency; however, this estimate includes non-independent analytical pipelines applied to the same cohort and reflects analytical stability rather than independent biological replication. Rather than introducing a new statistical method, this study examines how commonly used reproducibility metrics behave under crossstudy heterogeneity and identifies their practical limitations and appropriate use boundaries.

Matching journals

The top 14 journals account for 50% of the predicted probability mass.

1
BMC Genomics
328 papers in training set
Top 0.1%
8.6%
2
Scientific Reports
3102 papers in training set
Top 16%
6.5%
3
PLOS ONE
4510 papers in training set
Top 30%
5.0%
4
eLife
5422 papers in training set
Top 16%
5.0%
5
Frontiers in Neurology
91 papers in training set
Top 1%
3.7%
6
npj Parkinson's Disease
89 papers in training set
Top 0.5%
3.7%
7
Brain Communications
147 papers in training set
Top 0.8%
3.1%
8
eneuro
389 papers in training set
Top 3%
3.1%
9
Neurobiology of Disease
134 papers in training set
Top 2%
2.8%
10
BioTechniques
24 papers in training set
Top 0.1%
2.4%
11
International Journal of Molecular Sciences
453 papers in training set
Top 6%
1.9%
12
Acta Neuropathologica Communications
81 papers in training set
Top 0.4%
1.9%
13
Movement Disorders
62 papers in training set
Top 0.6%
1.8%
14
PeerJ
261 papers in training set
Top 7%
1.7%
50% of probability mass above
15
Human Brain Mapping
295 papers in training set
Top 3%
1.7%
16
Epigenetics
43 papers in training set
Top 0.4%
1.7%
17
Neurobiology of Aging
95 papers in training set
Top 2%
1.4%
18
Human Genetics and Genomics Advances
70 papers in training set
Top 0.4%
1.4%
19
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.4%
20
GeroScience
97 papers in training set
Top 1%
1.3%
21
eBioMedicine
130 papers in training set
Top 2%
1.3%
22
Frontiers in Neuroscience
223 papers in training set
Top 5%
1.1%
23
European Journal of Neuroscience
168 papers in training set
Top 0.9%
1.0%
24
Frontiers in Aging Neuroscience
67 papers in training set
Top 3%
1.0%
25
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
26
Neurology Genetics
14 papers in training set
Top 0.2%
0.9%
27
The American Journal of Human Genetics
206 papers in training set
Top 3%
0.8%
28
Life Science Alliance
263 papers in training set
Top 1%
0.8%
29
NeuroImage: Clinical
132 papers in training set
Top 4%
0.8%
30
Imaging Neuroscience
242 papers in training set
Top 3%
0.8%