Back

Batch Effect Correction in a Functional Colorectal Cancer Organoid Clinical Correlation Study

Oliver, G. R.; de Jesus Domingues, A.; Barnett, C. C.

2026-02-09 bioinformatics
10.64898/2026.02.05.704065 bioRxiv
Show abstract

Batch effects are recognized as major sources of technical confounding in high-throughput assays. However, their impact on organoid studies receives little attention in the literature. As organoids gain prominence as a class of emerging new approach methodologies (NAMs), consideration of batch variation will become increasingly important to ensure data reproducibility and accurate interpretation in pre-clinical and clinical studies. In this manuscript, we provide a practical description of our work in detecting, characterizing, and correcting batch effects in a prior published retrospective clinical colorectal cancer organoid drug-response study. We outline the workflow we employed, including exploratory diagnostics, experimental drift detection, and statistical adjustment. We detail the methods employed to evaluate batch effects, monitor longitudinal drift, and select approaches to remove technical artifacts, preserve biological signal and test for robustness. Our experience demonstrates that in even modestly sized studies, results can be adversely affected by insufficient consideration and attempts at ameliorating batch effects. By documenting the challenges we encountered and the solutions implemented within our study, we hope that we can provide a seminal practical reference for organoid researchers and enable increased discussion and adoption of robust batch-compensation practices in the organoid field, ensuring that the topic is more routinely addressed, improved, and eventually standardized.

Matching journals

The top 13 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 20%
9.2%
2
Analytical Chemistry
205 papers in training set
Top 0.5%
6.4%
3
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.5%
6.4%
4
BMC Bioinformatics
383 papers in training set
Top 2%
4.9%
5
Scientific Reports
3102 papers in training set
Top 27%
4.3%
6
SLAS Discovery
25 papers in training set
Top 0.1%
3.7%
7
Communications Biology
886 papers in training set
Top 2%
3.6%
8
Nature Communications
4913 papers in training set
Top 42%
3.3%
9
PLOS Computational Biology
1633 papers in training set
Top 13%
2.1%
10
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
1.9%
11
GigaScience
172 papers in training set
Top 1%
1.9%
12
SLAS Technology
11 papers in training set
Top 0.1%
1.9%
13
PNAS Nexus
147 papers in training set
Top 0.2%
1.7%
50% of probability mass above
14
Nucleic Acids Research
1128 papers in training set
Top 11%
1.7%
15
Molecular & Cellular Proteomics
158 papers in training set
Top 1%
1.7%
16
International Journal of Molecular Sciences
453 papers in training set
Top 9%
1.5%
17
Frontiers in Pharmacology
100 papers in training set
Top 3%
1.3%
18
Bioinformatics
1061 papers in training set
Top 8%
1.3%
19
Cancer Research Communications
46 papers in training set
Top 0.6%
1.3%
20
npj Systems Biology and Applications
99 papers in training set
Top 2%
1.2%
21
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.2%
22
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 1%
1.2%
23
iScience
1063 papers in training set
Top 21%
1.2%
24
eLife
5422 papers in training set
Top 53%
0.9%
25
Communications Chemistry
39 papers in training set
Top 0.8%
0.9%
26
Frontiers in Bioinformatics
45 papers in training set
Top 0.6%
0.9%
27
Genome Medicine
154 papers in training set
Top 7%
0.8%
28
Metabolites
50 papers in training set
Top 1%
0.8%
29
PeerJ
261 papers in training set
Top 15%
0.7%
30
Genome Biology
555 papers in training set
Top 7%
0.7%