Back

Identifying Robust Subclonal Structures through Tumor Progression Tree Alignment

Gilbert, J.; Wu, C. H.; Knittel, H.; Schäffer, A. A.; Malikic, S.; Sahinalp, C.

2026-02-27 cancer biology
10.64898/2026.02.25.708046 bioRxiv
Show abstract

Understanding and comparing tumor evolutionary histories is fundamental to cancer genomics. Clonal trees, used to model tumor progression, are rooted, unordered trees in which each node represents a subclone labeled by a set of distinct mutations. To compare two clonal trees, we introduce omlta, the optimal multi-label tree alignment, which removes the minimum number of mutation labels from the trees, so that the remaining trees are isomorphic. Computing omlta is NP-hard. Here, we present an algorithm to compute the omlta, with a running time of [Formula] where L [≥] 1 is the total number of mutation labels occurring in the input trees and k is the minimum possible number of mutation labels that need to be removed for the alignment. Our implementation (https://github.com/algo-cancer/omlta) is the first computational tool for determining the optimal alignment between clonal trees. We applied omlta to 126 cases from the TRACERx study on non-small cell lung cancers and some melanoma single-cell data.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
14.8%
2
Journal of Computational Biology
37 papers in training set
Top 0.1%
12.4%
3
PLOS Computational Biology
1633 papers in training set
Top 3%
10.1%
4
Genome Research
409 papers in training set
Top 0.2%
8.4%
5
PLOS ONE
4510 papers in training set
Top 31%
4.9%
50% of probability mass above
6
iScience
1063 papers in training set
Top 3%
4.3%
7
Cell Systems
167 papers in training set
Top 3%
4.0%
8
BMC Bioinformatics
383 papers in training set
Top 3%
3.7%
9
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 2%
2.9%
10
Nature Communications
4913 papers in training set
Top 45%
2.4%
11
Scientific Reports
3102 papers in training set
Top 48%
2.4%
12
Bioinformatics Advances
184 papers in training set
Top 2%
2.4%
13
PLOS Genetics
756 papers in training set
Top 7%
1.9%
14
Genome Biology
555 papers in training set
Top 4%
1.7%
15
Communications Biology
886 papers in training set
Top 11%
1.5%
16
eLife
5422 papers in training set
Top 49%
1.2%
17
Genome Medicine
154 papers in training set
Top 6%
1.2%
18
Cancer Research
116 papers in training set
Top 3%
1.1%
19
Bulletin of Mathematical Biology
84 papers in training set
Top 2%
1.0%
20
Nature Genetics
240 papers in training set
Top 6%
0.9%
21
Biostatistics
21 papers in training set
Top 0.1%
0.9%
22
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.8%
0.8%
23
European Journal of Human Genetics
49 papers in training set
Top 1%
0.7%
24
Cancers
200 papers in training set
Top 5%
0.6%
25
Frontiers in Molecular Biosciences
100 papers in training set
Top 6%
0.6%
26
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 47%
0.6%
27
Cell Reports
1338 papers in training set
Top 37%
0.5%
28
Briefings in Bioinformatics
326 papers in training set
Top 8%
0.5%
29
PNAS Nexus
147 papers in training set
Top 3%
0.5%