Cross-dataset pan-cancer detection: Correlating cell-free DNA fragment coverage with open chromatin sites across cell types
Olsen, L. R.; Odinokov, D.; Holsting, J. Q.; Kondrup, K.; Iisager, L.; Rusan, M.; Buus, S.; Laursen, B. E.; Borre, M.; Jochumsen, M. R.; Bouchelouche, K.; Frydendahl, A.; Rasmussen, M. H.; Henriksen, T. V.; Nesic, M.; Demuth, C.; Lindskrog, S. V.; Nordentoft, I.; Lamy, P.; Therkildsen, C.; Dyrskjot, L.; Sorensen, K. D.; Andersen, C. L.; Skanderup, A. J.; Besenbacher, S.
Show abstract
The fragmentation patterns of whole genome sequenced cell-free DNA are promising features for tumor-agnostic cancer detection. However, systematic biases challenge their cross-cohort generalization. We introduce LIONHEART, a novel, open source cancer detection method specifically optimized to generalize across datasets. The method correlates bias-corrected cfDNA fragment coverage across the genome with the locations of accessible chromatin regions from 898 cell and tissue type features. We use these correlations to detect changes in the cell-free DNA cell type composition caused by cancer. We test LIONHEART on nine datasets and fourteen cancer types (1106 non-cancer controls, 1449 cancers) obtained from different studies and show that it can distinguish cancer samples from non-cancer controls across cohorts with ROC AUC scores ranging from 0.62-0.95 (mean = 0.83, std = 0.12). We further validate the method on an external dataset, achieving a ROC AUC of 0.917.
Matching journals
The top 5 journals account for 50% of the predicted probability mass.