The complete genome of the KOLF2.1J reference iPSC line
Alvarez Jerez, P.; Rhie, A.; Kim, J.; Hebbar, P.; Nag, S.; Antipov, D.; Koren, S.; Lara, E.; Beilina, A.; Hansen, N. F.; Arber, C. F.; Zulueta, J.; Wild-Crea, P.; Patel, D.; Hickey, G.; Waltz, B.; Malik, L.; Skarnes, W. C.; Reed, X.; Genner, R.; Daida, K.; Pantazis, C. B.; Grenn, F.; Nalls, M. A.; Billingsley, K.; Fossati, V.; Wray, S.; Ward, M.; Ryten, M.; Cookson, M. R.; Jain, M.; Paten, B.; Phillippy, A. M.; Blauwendraat, C.
Show abstract
While induced pluripotent stem cells (iPSCs) have gained popularity in studying neurodegenerative diseases, the heterogeneity of stem cells used across studies impacts cross-study comparison. The iPSC Neurodegenerative Disease Initiative (iNDI) selected the KOLF2.1J cell line and prioritized its use as a reference standard for studying the effects of pathogenic variants on cell biology due to its stability and neutral neurodegenerative disease genetic risk. This cell line, and its derivatives expressing over 100 variants related to Alzheimers disease, Parkinsons disease, and other neurological diseases, are available for academic and industry access. Current genomic data analyses are limited by the use of a human reference genome that does not capture the complete genetic background of a given iPSC line. While in the future this issue may be partially mitigated by the creation of a comprehensive human pangenome, previous work has shown that generating custom genomes is of value both to characterize the variation present and to serve as a more appropriate genomic reference. Here, we generated and characterized a custom complete genome assembly from KOLF2.1J. Mapping of sequencing reads to a personalized diploid assembly results in more comprehensive mapping compared to traditional linear references (i.e GRCh38). In addition, we provide a comprehensive custom gene annotation along with isoform expression and differential methylation analyses across multiple cell types. The assembly and all additional data is browsable and publicly available. This resource will enable more accurate investigation of the KOLF2.1J cell line and any genomics data generated compared to using traditional generalized references, while also serving as a foundational approach for establishing custom reference assemblies for other high-value iPSC lines.
Matching journals
The top 7 journals account for 50% of the predicted probability mass.