Easy-to-use whole-genome sequencing workflows and standardized practices to uncover hidden genetic variation in Synechocystis PCC 6803 wild-type and knock-out strains
Theune, M.; Fritsche, R.; Kueppers, N.; Boehm, M.; Kolkhof, P.; Paul, F.; Popa, O.; Oldenburg, E.; Wiegard, A.; Axmann, I. M.; Gutekunst, K.
Show abstract
Knock-out mutants are often used to study gene function by disrupting a specific gene and comparing the mutant to a wild-type strain. Reliable interpretation, however, requires that the two strains differ only by the intended mutation and that the observed phenotype is caused specifically by the deleted gene. In the highly polyploid cyanobacterium Synechocystis sp. PCC 6803, this is particularly challenging because incomplete segregation can mask genetic heterogeneity or secondary suppressor mutations. The genetic variation among laboratory wild-type lines can further confound phenotypic analyses. We show that these challenges can be addressed by routine strain validation via whole-genome sequencing (WGS). To this end, we developed and tested user friendly workflows for short-read (Illumina), long-read (Oxford Nanopore Technologies; ONT), and hybrid data, providing standardized quality control, variant calling, and structural variant detection. We benchmarked their performance in detecting single-nucleotide polymorphisms (SNPs), small indels, and structural variants using simulated datasets across different coverages and mixed populations. Applying the workflows to three Synechocystis sp. PCC 6803 wild-type lines revealed multiple sequence and structural differences relative to the reference genome, including previously undescribed genetic variants, underscoring the importance of documenting the strain background and the value of long-read sequencing. Characterization of two independent 6-phosphogluconate dehydrogenase (gnd) knock-out mutants and their complemented strains highlighted how a failed rescue can reveal a phenotype unrelated to the intended knock-out. An automated literature analysis revealed that only a minority of the investigated Synechocystis studies that used knock-out mutants included complementation as a control (39%), whereas this practice is more common in studies involving Escherichia coli (63%) and Saccharomyces cerevisiae (55%). Based on these results, we propose a practical guide for standardizing knock-out phenotyping in Synechocystis PCC 6803. Combined with accessible workflows for routine whole-genome validation, this framework aims to support more robust and reproducible knock-out studies in the future.
Matching journals
The top 16 journals account for 50% of the predicted probability mass.