Back

Improving the reliability of fMRI-based predictions of intelligence via semi-blind machine learning

Lohmann, G.; Heczko, S.; Mahler, L.; Wang, Q.; Steiglechner, J.; Kumar, V. J.; Roost, M.; Jost, J.; Scheffler, K.

2023-11-07 neuroscience
10.1101/2023.11.03.565485 bioRxiv
Show abstract

Predicting neuromarkers for cognitive abilities using fMRI has been a major focus of research in the past few years. However, it has recently been reported that many thousands of participants are required to obtain reproducible results (Marek et al (2022)). This appears to be a major impediment to obtaining neuromarkers from fMRI because large sample sizes are typically not available in neuroimaging studies. Here we show that the out-of-sample prediction accuracy can be dramatically improved by supplementing fMRI with readily available non-imaging information so that reliable predictive modeling becomes feasible even for small sample sizes. Specifically, we introduce a novel machine learning method that predicts intelligence from resting-state fMRI data, leveraging educational level as supplementary information. We refer to our approach as "semi-blind machine learning (SML)" because it operates under the assumption that supplementary information, such as educational level, is available for subjects in both the training and test sets. This setup closely mirrors real-world scenarios, especially in clinical contexts, where patient background information typically exists and can be utilized to boost prediction accuracy. However, guarding against bias is crucial. Subjects should not be categorized as more intelligent simply based on their higher education levels. Therefore, our approach contains a component explicitly designed for bias control. We have applied our method to three different data collections and observed marked improvements in prediction accuracies across a wide range of sample sizes. We anticipate that semi-blind machine learning provides a promising approach to fMRI-based predictive modelling with the potential for a wide range of future applications.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
NeuroImage
813 papers in training set
Top 0.2%
32.7%
2
Aperture Neuro
18 papers in training set
Top 0.1%
10.0%
3
Human Brain Mapping
295 papers in training set
Top 0.8%
8.3%
50% of probability mass above
4
Imaging Neuroscience
242 papers in training set
Top 0.6%
6.3%
5
Medical Image Analysis
33 papers in training set
Top 0.4%
3.6%
6
Frontiers in Neuroscience
223 papers in training set
Top 2%
3.0%
7
eLife
5422 papers in training set
Top 31%
2.7%
8
Frontiers in Psychiatry
83 papers in training set
Top 1%
2.7%
9
NeuroImage: Clinical
132 papers in training set
Top 2%
2.6%
10
Frontiers in Neuroimaging
11 papers in training set
Top 0.1%
1.9%
11
PLOS ONE
4510 papers in training set
Top 51%
1.9%
12
Scientific Reports
3102 papers in training set
Top 59%
1.7%
13
Developmental Cognitive Neuroscience
81 papers in training set
Top 0.4%
1.5%
14
PLOS Computational Biology
1633 papers in training set
Top 18%
1.5%
15
Journal of Neuroscience Methods
106 papers in training set
Top 1%
1.5%
16
Frontiers in Neuroinformatics
38 papers in training set
Top 0.5%
1.3%
17
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 40%
0.9%
18
Communications Biology
886 papers in training set
Top 24%
0.7%
19
Network Neuroscience
116 papers in training set
Top 1%
0.7%
20
Neuroimage: Reports
22 papers in training set
Top 0.1%
0.7%
21
Cerebral Cortex
357 papers in training set
Top 2%
0.7%
22
eneuro
389 papers in training set
Top 10%
0.6%