Combined inference of known and novel mutational signatures with ReDeNovo
Kesimoglu, Z. N.; Hodzic, E.; Hoinka, J.; Amgalan, B.; Hirsch, M. G.; Przytycka, T. M.
Show abstract
Mutational signatures represent characteristic mutational patterns imprinted on the genome by mutagenic processes. They can provide information about the impact of the environmental and endogenous cellular processes on tumor mutations and can suggest treatment. Analysis of presence and strength of mutational signatures in cancer genomes has become a cornerstone in analysis of new and legacy cancer data. However, a precise inference of novel (de novo) signatures requires a large set of genomes, and methods focusing on estimating the presence of previously defined signatures are unable to uncover potential novel signatures that might emerge in new data. Thus, reliable methods to address these challenges are needed. We formally define the Combined Mutational Signature Inference Problem (CMSI) for the identification of known signatures and the inference of novel signatures in cancer data. CMSI represents non-convex optimization, and we provide a heuristic algorithm, ReDeNovo, to solve it efficiently. We extensively validated ReDeNovo on simulated data, evaluating its ability to precisely estimate presence and exposure to known signatures and to discover of novel signatures. On both tasks ReDeNovo outperformed existing approaches. In real biological data, ReDeNovo identified signatures missed by previous analyses and defined a new signature related to UV light exposure. ReDeNovo method provides a new and powerful tool to uncover mutational signatures. ReDeNovo is available from https://github.com/ncbi/redenovo.
Matching journals
The top 5 journals account for 50% of the predicted probability mass.