Back

simCRISPR: Modeling experimental complexity in pooled CRISPR screens

Zhu, Z.; Dong, X.; KIM, C. H.; Maugee, C.; Barbazuk, W. B.; Vulpe, C.; Bacher, R.

2026-05-15 bioinformatics
10.64898/2026.05.14.725042 bioRxiv
Show abstract

Pooled CRISPR screens are widely used to investigate gene function and uncover genetic interactions. However, benchmarking computational methods for detecting gene-by-environment (GxE) interactions remains difficult because ground truth is rarely available and existing simulation tools are not designed for GxE screening contexts. To address this, we developed simCRISPR, a flexible simulation framework for generating pooled CRISPR screen data under complex experimental designs. Using simulated datasets informed by empirical CRISPR screen designs, we evaluated commonly used analysis methods, comparing normalization strategies based on safe-harbor versus non-targeting sgRNAs and assessing empirical log2FC thresholds as an additional effect-size criterion. We found that safe-harbor-based normalization improved interaction detection when DNA damage-related effects were present, particularly when combined with empirical log2FC thresholding for DESeq2. Application of this workflow to a doxorubicin GxE screen further showed that safe-harbor-based normalization reduced bias in log2FC distributions and identified additional biologically relevant candidates. simCRISPR is available at https://github.com/bachergroup/simCRISPR.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 1%
19.6%
2
Genome Biology
555 papers in training set
Top 0.3%
10.2%
3
PLOS Computational Biology
1633 papers in training set
Top 4%
9.2%
4
Cell Systems
167 papers in training set
Top 2%
7.2%
5
Briefings in Bioinformatics
326 papers in training set
Top 0.7%
6.9%
50% of probability mass above
6
Nucleic Acids Research
1128 papers in training set
Top 3%
6.4%
7
Nature Communications
4913 papers in training set
Top 33%
4.9%
8
BMC Bioinformatics
383 papers in training set
Top 2%
4.2%
9
NAR Genomics and Bioinformatics
214 papers in training set
Top 1.0%
2.9%
10
Bioinformatics Advances
184 papers in training set
Top 2%
2.6%
11
Genome Medicine
154 papers in training set
Top 4%
1.7%
12
Computational and Structural Biotechnology Journal
216 papers in training set
Top 6%
1.3%
13
Nature Biotechnology
147 papers in training set
Top 5%
1.3%
14
PLOS ONE
4510 papers in training set
Top 58%
1.3%
15
Nature Methods
336 papers in training set
Top 5%
1.2%
16
Genome Research
409 papers in training set
Top 3%
1.2%
17
GigaScience
172 papers in training set
Top 2%
1.2%
18
Cell Reports Methods
141 papers in training set
Top 5%
0.8%
19
The American Journal of Human Genetics
206 papers in training set
Top 3%
0.8%
20
Patterns
70 papers in training set
Top 2%
0.8%
21
The CRISPR Journal
33 papers in training set
Top 0.3%
0.8%
22
Scientific Reports
3102 papers in training set
Top 74%
0.8%
23
Advanced Science
249 papers in training set
Top 19%
0.8%
24
Nature Machine Intelligence
61 papers in training set
Top 4%
0.7%
25
Frontiers in Genetics
197 papers in training set
Top 11%
0.6%
26
Cell Genomics
162 papers in training set
Top 8%
0.6%
27
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 7%
0.6%
28
BMC Genomics
328 papers in training set
Top 7%
0.6%