Context-dependent determinants of CRISPR-Cas9 editing efficiency revealed through cross-species endogenous editing analysis
Cohen, S.; Bergman, S.; Burghardt, M.; Menuhin-Gruman, I.; Eyal, E.; Arbel, N.; Emmanuel, E.; Kapel, M.; Rabinovich, L.; Avital, G.; Maoz, A.; Avitzour, M.; Bogen, M.; Orenstein, Y.; Rahimi, M.; Yaish, O.; Veksler-Lublinsky, I.; Cohen, L.; Malul, T.; Mayrose, I.; Rice, A.; Landau, E.; Burstein, D.; Arias, O.; Gertz, D.; Kutchinsky, O.; Aharoni, A.; Li, D.; Parnas, O.; Mol Jaya Prakashan, M.; Shovman, Y.; Izhiman, T.; Kunis, G.; Wiener, A.; Barhum, Y.; Steinberg Shemer, O.; Izraeli, S.; Birger, Y.; Markovich, O.; Furest, D.; Moshkovitz, S.; Yahalom, A.; Dominissini, D.; Brezinger-Dayan, K.; J.
Show abstract
Accurate prediction of CRISPR-Cas9 guide RNA (gRNA) editing efficiency remains limited, particularly outside human systems, where models trained on exogenous human datasets show poor generalization. We analyzed Cas9 efficiency and repair outcomes using novel endogenous editing data from four human cell types, two tomato cell types, and cells from giant river prawn and black soldier fly. While integrating publicly available predictors via ensemble frameworks improved performance, our analysis revealed hundreds of novel features affecting activity. Crucially, dominant features related to sites competition for gRNA, and local geometric properties varied across systems, highlighting the strong context dependence of Cas9 efficiency and arguing against a universal model. Interestingly, codon usage bias-based features also emerged as informative predictors, as they are proxies for chromatin accessibility. In contrast, trends in repair outcomes remained conserved. This work provides essential resources for more generalizable CRISPR guide design.
Matching journals
The top 1 journal accounts for 50% of the predicted probability mass.