Back

Determining Optimal Placement of Copy Number Aberration Impacted Single Nucleotide Variants in a Tumor Progression History

Wu, C. H.; Joshi, S.; Robinson, W.; Robbins, P. F.; Schwartz, R.; Sahinalp, C.; Malikic, S.

2024-03-13 cancer biology
10.1101/2024.03.10.584318 bioRxiv
Show abstract

Intratumoral heterogeneity arises as a result of genetically distinct subclones emerging during tumor progression. These subclones are characterized by various types of somatic genomic aberrations, with single nucleotide variants (SNVs) and copy number aberrations (CNAs) being the most prominent. While single-cell sequencing provides powerful data for studying tumor progression, most existing and newly generated sequencing datasets are obtained through conventional bulk sequencing. Most of the available methods for studying tumor progression from multi-sample bulk sequencing data are either based on the use of SNVs from genomic loci not impacted by CNAs or designed to handle a small number of SNVs via enumerating their possible copy number trees. In this paper, we introduce DETOPT, a combinatorial optimization method for accurate tumor progression tree inference that places SNVs impacted by CNAs on trees of tumor progression with minimal distortion on their variant allele frequencies observed across available samples of a tumor. We show that on simulated data DETOPT provides more accurate tree placement of SNVs impacted by CNAs than the available alternatives. When applied to a set of multi-sample bulk exome-sequenced tumor metastases from a treatment-refractory, triple-positive metastatic breast cancer, DETOPT reports biologically plausible trees of tumor progression, identifying the tree placement of copy number state gains and losses impacting SNVs, including those in clinically significant genes.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 2%
14.3%
2
Bioinformatics
1061 papers in training set
Top 3%
10.0%
3
Cell Systems
167 papers in training set
Top 1%
9.1%
4
Scientific Reports
3102 papers in training set
Top 28%
4.3%
5
Biostatistics
21 papers in training set
Top 0.1%
3.9%
6
Bioinformatics Advances
184 papers in training set
Top 1%
3.9%
7
Cancer Research
116 papers in training set
Top 0.9%
3.6%
8
Genome Research
409 papers in training set
Top 1%
2.9%
50% of probability mass above
9
iScience
1063 papers in training set
Top 7%
2.7%
10
PLOS ONE
4510 papers in training set
Top 44%
2.7%
11
The American Journal of Human Genetics
206 papers in training set
Top 2%
2.6%
12
Genome Medicine
154 papers in training set
Top 3%
2.6%
13
Communications Biology
886 papers in training set
Top 4%
2.3%
14
npj Genomic Medicine
33 papers in training set
Top 0.2%
2.3%
15
Journal of Computational Biology
37 papers in training set
Top 0.1%
2.1%
16
Nature Communications
4913 papers in training set
Top 48%
1.9%
17
PLOS Genetics
756 papers in training set
Top 9%
1.7%
18
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 35%
1.5%
19
Cancers
200 papers in training set
Top 3%
1.5%
20
Genetics
225 papers in training set
Top 3%
1.3%
21
npj Systems Biology and Applications
99 papers in training set
Top 2%
1.2%
22
Nature Genetics
240 papers in training set
Top 6%
0.9%
23
Cell Reports
1338 papers in training set
Top 31%
0.9%
24
Frontiers in Genetics
197 papers in training set
Top 9%
0.8%
25
BMC Bioinformatics
383 papers in training set
Top 7%
0.8%
26
Genome Biology
555 papers in training set
Top 7%
0.8%
27
Frontiers in Molecular Biosciences
100 papers in training set
Top 5%
0.7%
28
Bulletin of Mathematical Biology
84 papers in training set
Top 2%
0.7%
29
BMC Genomics
328 papers in training set
Top 6%
0.7%
30
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 7%
0.7%