Back

CD276 in Meningioma Transcriptomic Classification: Internal Development, External Validation, and Stability-Informed Interpretation

Lee, H.; Kim, H.

2026-04-05 health informatics
10.64898/2026.04.03.26350116 medRxiv
Show abstract

Background: CD276 has been proposed as a candidate gene associated with the biological characteristics of meningioma, but its predictive position and interpretive significance within a transcriptomic classifier have not yet been clearly established. Accordingly, this study aimed to evaluate CD276 stepwise across internal model development, external validation, calibration, decision-analytic assessment, feature stability, and robustness analyses using public transcriptomic cohorts. Methods: The analyses in this study were organized into two interconnected notebooks. In Notebook A, we reconstructed the internal training cohort (GSE183653), evaluated the CD276 single-gene signal, and then developed a transcriptome-wide multigene classifier. We also performed permutation importance, bootstrap confidence interval, label permutation test, repeated cross-validation, CD276 ablation, and internal calibration analyses. In Notebook B, we reproduced the external validation cohort (GSE136661) in a fixed common-gene space, applied train-only recalibration and train-only threshold transfer, and extended the interpretation through decision curve analysis, stability analysis, enrichment analysis, and one-factor-at-a-time robustness analysis. Results: The internal training cohort consisted of 185 samples and 58,830 genes, of which 25 were WHO grade III cases. CD276 expression showed a significant association with WHO grade, but the internal discrimination of the CD276-only baseline was limited (ROC-AUC 0.628, average precision 0.323, balanced accuracy 0.540). In contrast, the initial transcriptome-wide model showed ROC-AUC 0.834 and PR-AUC 0.509, and under 5-fold cross-validation, the canonical fulltranscriptome model and the CD276-forced 5,001-feature branch showed mean ROC-AUC/PR-AUC of 0.854/0.564 and 0.855/0.606, respectively, outperforming the CD276-only baseline at 0.644/0.391. CD276 was not included in the initial 5,000-feature filtered set and ranked 900th among 5,001 features even in the forcibly included 5,001-feature branch. In paired ablation analysis, the performance difference attributable to inclusion of CD276 was effectively close to zero (delta ROCAUC 0.000062, delta PR-AUC 0.000056). Internal calibration analysis showed an overconfident probability pattern (Brier score 0.10501, intercept -1.421392, slope 0.413241). In external validation, the fixed multigene pipeline achieved ROC-AUC 0.928 and PR-AUC 0.335. Train-only recalibration improved calibration metrics while preserving discrimination, and decision curve analysis showed threshold-dependent but limited external utility. Stability analysis showed overlap between core-stable genes and high-impact genes, but CD276 was not supported as a dominant stable core feature and remained in the target-of-interest tier. In robustness analysis, some perturbations preserved the primary interpretation, whereas others revealed transform sensitivity or an alternative high-performing feature-space solution. Conclusions: CD276 is a gene of interest associated with meningioma grade, but it was difficult to interpret it as a strong standalone predictor or a dominant stable classifier feature. In this study, the main basis of predictive performance lay not in CD276 alone but in a broader multigene transcriptomic structure, and probability output needed to be interpreted conservatively with calibration taken into account. These findings position CD276 not as a direct single-gene classifier but as a biologymotivated target-of-interest that should be interpreted within a broader transcriptomic program.

Matching journals

The top 12 journals account for 50% of the predicted probability mass.

1
Scientific Reports
3102 papers in training set
Top 0.7%
19.1%
2
Journal of Infection
71 papers in training set
Top 0.1%
6.5%
3
Biology Methods and Protocols
53 papers in training set
Top 0.3%
3.7%
4
Communications Medicine
85 papers in training set
Top 0.1%
3.7%
5
eBioMedicine
130 papers in training set
Top 0.3%
3.7%
6
PLOS ONE
4510 papers in training set
Top 46%
2.5%
7
Computers in Biology and Medicine
120 papers in training set
Top 1%
2.4%
8
PeerJ
261 papers in training set
Top 4%
2.4%
9
Artificial Intelligence in Medicine
15 papers in training set
Top 0.2%
1.9%
10
Cancers
200 papers in training set
Top 2%
1.9%
11
Annals of Neurology
57 papers in training set
Top 1%
1.8%
12
Biomedicines
66 papers in training set
Top 0.7%
1.8%
50% of probability mass above
13
Diagnostics
48 papers in training set
Top 0.9%
1.8%
14
BMC Medicine
163 papers in training set
Top 3%
1.7%
15
Frontiers in Bioinformatics
45 papers in training set
Top 0.3%
1.5%
16
Informatics in Medicine Unlocked
21 papers in training set
Top 0.6%
1.4%
17
BMC Bioinformatics
383 papers in training set
Top 5%
1.3%
18
NeuroImage: Clinical
132 papers in training set
Top 3%
1.3%
19
Computational and Structural Biotechnology Journal
216 papers in training set
Top 6%
1.3%
20
Neuro-Oncology Advances
24 papers in training set
Top 0.4%
1.1%
21
Neuropathology and Applied Neurobiology
14 papers in training set
Top 0.4%
1.0%
22
The Lancet Digital Health
25 papers in training set
Top 0.7%
1.0%
23
Cancer Medicine
24 papers in training set
Top 1%
1.0%
24
Frontiers in Immunology
586 papers in training set
Top 6%
0.9%
25
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.6%
0.9%
26
Nature Communications
4913 papers in training set
Top 59%
0.9%
27
Frontiers in Medicine
113 papers in training set
Top 5%
0.9%
28
NeuroImage
813 papers in training set
Top 5%
0.8%
29
npj Precision Oncology
48 papers in training set
Top 1%
0.8%
30
Acta Neuropathologica
51 papers in training set
Top 1%
0.8%