Manipulating base quality scores enables variant calling from bisulfite sequencing alignments using conventional Bayesian approaches
Nunn, A.; Otto, C.; Stadler, P. F.; Langenberger, D.
Show abstract
Calling germline SNP variants from bisulfite-converted sequencing data poses a challenge for conventional software, which have no inherent capability to dissociate true polymorphisms from artificial mutations induced by the chemical treatment. Nevertheless, SNP data is desirable both for genotyping and to understand the DNA methylome in the context of the genetic background. The confounding effect of bisulfite conversion can be resolved by observing differences in allele counts on a per-strand basis, whereby artificial mutations are reflected by non-complementary base pairs. Herein, we present a computational pre-processing approach for adapting sequence alignment data, thus indirectly enabling downstream analysis in this manner using conventional variant calling software such as GATK or Freebayes. In comparison to specialised tools, the method represents a marked improvement in precision-sensitivity based on high-quality, published benchmark datasets for both human and model plant variants.
Matching journals
The top 7 journals account for 50% of the predicted probability mass.