Back

The effect of data preprocessing on spike correlation analysis results

Oberste-Frielinghaus, J.; Ito, J.; Grün, S.

2025-12-02 neuroscience
10.1101/2025.11.28.691090 bioRxiv
Show abstract

In recent years, an increasing number of large electrophysiological data sets have become publicly available, thereby providing researchers with the opportunity to analyze spike train data without conducting their own experiments. While this is undoubtedly a positive development, it increases the need for proper documentation on how the data were collected and what preprocessing was performed on the data, since interpreting analysis results in ignorance of these pieces of information can lead to wrong conclusions. An important preprocessing step is the removal of artifacts from the recordings. Electrophysiological recordings are particularly susceptible to electrical cross-talks between recording channels, resulting in artifact spikes that are coincident in multiple channels on the time scale of the data sampling rate, i.e., 1/30 ms in popular setups. The removal by signal whitening is only possible if also the raw sampled data are available, thus to eliminate this type of artifact is to remove all coincident spikes on the recording time scale to definitely avoid artifact spikes. However, given the lack of the "ground truth", this step has the potential to eliminate, in conjunction with the artifacts, components of the data that are pertinent to the research objective. In this study, we use a modified version of the Unitary Event Analysis and demonstrate that such preprocessing results in significantly lower correlations than expected by chance even on longer time scales. We also propose a method to correct for the bias introduced by this preprocessing. Thus, slight changes in the preprocessing have potentially strong impact on analysis results and methods need to consider these effects.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Journal of Neuroscience Methods
106 papers in training set
Top 0.1%
28.2%
2
PLOS ONE
4510 papers in training set
Top 18%
10.3%
3
Scientific Reports
3102 papers in training set
Top 12%
7.3%
4
Neuroinformatics
40 papers in training set
Top 0.1%
6.4%
50% of probability mass above
5
eneuro
389 papers in training set
Top 1%
4.9%
6
Frontiers in Neuroscience
223 papers in training set
Top 1%
3.7%
7
Journal of Neural Engineering
197 papers in training set
Top 0.8%
3.1%
8
Frontiers in Neuroinformatics
38 papers in training set
Top 0.2%
3.1%
9
Sensors
39 papers in training set
Top 0.8%
1.9%
10
PLOS Computational Biology
1633 papers in training set
Top 14%
1.9%
11
Frontiers in Human Neuroscience
67 papers in training set
Top 1%
1.7%
12
European Journal of Neuroscience
168 papers in training set
Top 0.6%
1.5%
13
Biomedical Signal Processing and Control
18 papers in training set
Top 0.3%
1.4%
14
Behavior Research Methods
25 papers in training set
Top 0.1%
1.4%
15
Neuroscience
88 papers in training set
Top 2%
1.2%
16
BMC Bioinformatics
383 papers in training set
Top 5%
1.2%
17
Network Neuroscience
116 papers in training set
Top 1%
0.8%
18
Human Brain Mapping
295 papers in training set
Top 4%
0.8%
19
Bioengineering
24 papers in training set
Top 1%
0.7%
20
Scientific Data
174 papers in training set
Top 3%
0.7%
21
Frontiers in Psychiatry
83 papers in training set
Top 3%
0.7%
22
Journal of Visualized Experiments
30 papers in training set
Top 0.8%
0.7%
23
MethodsX
14 papers in training set
Top 0.6%
0.7%
24
Physiological Reports
35 papers in training set
Top 2%
0.5%
25
Imaging Neuroscience
242 papers in training set
Top 4%
0.5%
26
IEEE Transactions on Neural Systems and Rehabilitation Engineering
40 papers in training set
Top 0.7%
0.5%
27
Neuroscience Letters
28 papers in training set
Top 2%
0.5%
28
Communications Biology
886 papers in training set
Top 31%
0.5%