Back

Weighted Off-target and Efficiency Scoring Reveal Genome Composition-Dependent Optimal CRISPR/Cas9 Guide Design

Krishna Y K, Y.

2025-08-13 bioinformatics
10.1101/2025.08.13.670047 bioRxiv
Show abstract

The efficiency and specificity of guide RNAs continue to be crucial obstacles for successful experimental design, despite the fact that CRISPR/Cas9 has transformed genome editing. In this work, we introduce a computational method for optimizing CRISPR/Cas9 guide RNA that combines PAM diversity, local efficiency penalties, and weighted off-target scoring to find high-performing guides across a range of genome compositions. To capture a variety of natural genomic complexity, we simulated five sample genomes: AT-rich, GC-rich, balanced GC content, and high-repeat variations. All twenty-nucleotide target sequences were scanned for each genome, and off-target potential was assessed by permitting up to two mismatches with weighted penalties for seed region sites. To accommodate for any secondary structure impacts, efficiency assessment included both local sliding window penalties and global GC content. Furthermore, we looked at several PAM sequences that were pertinent to various Cas9 variations in order to assess how they affected guide selection. The findings show that efficiency scores vary by genome composition, with the highest scoring guides consistently displaying zero anticipated off-target events. While balanced genomes showed intermediate tendencies, GC-rich genomes tended to choose slightly higher efficiency guides than AT-rich genomes. PAM type affects guide efficiency, according to analysis across several genomes, and the combination of efficiency and off-target score consistently indicates guides with good expected performance. Three-dimensional scatter plots of efficiency and off-target counts versus genomic position, violin plots of off-target distributions, and genome-wide heatmaps emphasizing the best guide positions were used to illustrate these findings. In addition to offering a generalizable computational method for choosing CRISPR/Cas9 guides that optimize specificity and efficiency, our study gives fresh insights into the interactions among genome composition, PAM selection, and guide design criteria. By taking into account weighted off-target penalties, genome complexity, and local efficiency effects, this in silico framework overcomes some of the main drawbacks of earlier simulations. It is also easily applicable to direct selection for experimental research on a variety of organisms. The results provide the groundwork for future advancements in genome editing techniques by establishing a predictive computational framework that can expedite CRISPR/Cas9 research and minimize trial and error in guide selection.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.1%
12.4%
2
PLOS Computational Biology
1633 papers in training set
Top 3%
10.5%
3
Bioinformatics
1061 papers in training set
Top 3%
10.1%
4
The CRISPR Journal
33 papers in training set
Top 0.1%
4.9%
5
BMC Bioinformatics
383 papers in training set
Top 2%
4.9%
6
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.2%
7
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 0.1%
4.0%
50% of probability mass above
8
PLOS ONE
4510 papers in training set
Top 39%
3.6%
9
Scientific Reports
3102 papers in training set
Top 36%
3.6%
10
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 2%
3.3%
11
Frontiers in Genetics
197 papers in training set
Top 3%
3.1%
12
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.7%
13
BMC Genomics
328 papers in training set
Top 2%
1.7%
14
ACS Synthetic Biology
256 papers in training set
Top 2%
1.7%
15
Nucleic Acids Research
1128 papers in training set
Top 11%
1.7%
16
Gigabyte
60 papers in training set
Top 0.9%
1.3%
17
PeerJ
261 papers in training set
Top 9%
1.3%
18
Bioinformatics Advances
184 papers in training set
Top 4%
1.0%
19
International Journal of Molecular Sciences
453 papers in training set
Top 12%
1.0%
20
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
1.0%
21
Physical Biology
43 papers in training set
Top 2%
0.9%
22
Journal of Chemical Information and Modeling
207 papers in training set
Top 3%
0.9%
23
Biology Methods and Protocols
53 papers in training set
Top 2%
0.8%
24
Journal of Molecular Biology
217 papers in training set
Top 4%
0.8%
25
BioData Mining
15 papers in training set
Top 0.9%
0.8%
26
Biophysical Journal
545 papers in training set
Top 5%
0.7%
27
Methods
29 papers in training set
Top 0.7%
0.7%
28
GigaScience
172 papers in training set
Top 3%
0.7%
29
Journal of Genetics and Genomics
36 papers in training set
Top 2%
0.7%
30
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.7%