Back

Efficient Agony Based Transfer Learning Algorithms for Survival Forecasting

Tamaskar, A.; Bannon, J.; Mishra, B.

2021-02-25 cancer biology
10.1101/2021.02.24.432695 bioRxiv
Show abstract

Progression modeling is a mature subfield of cancer bioinformatics, but it has yet to make a proportional clinical impact. The majority of the research in this area has focused on the development of efficient algorithms for accurately reconstructing sequences of (epi)genomic events from noisy data. We see this as the first step in a broad pipeline that will translate progression modeling to clinical utility, with the subsequent steps involving inferring prognoses and optimal therapy programs for different cancers and using similarity in progression to enhance decision making. In this paper we take some initial steps in completing this pipeline. As a theoretical contribution, we introduce a polytime-computable pairwise distance between progression models based on the graph-theoretic notion of "agony". Focusing on a particular progression model we can then use this agony distance to cluster (dis)similarities via multi-dimensional scaling. We recover known biological similarities and dissimilarities. Finally, we use the agony distance to automate transfer learning experiments and show a large improvement in the ability to forecast time to death.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 2%
14.1%
2
Bioinformatics
1061 papers in training set
Top 3%
9.9%
3
Journal of Computational Biology
37 papers in training set
Top 0.1%
9.0%
4
Cell Systems
167 papers in training set
Top 1%
8.3%
5
Nature Communications
4913 papers in training set
Top 30%
6.3%
6
PLOS ONE
4510 papers in training set
Top 32%
4.8%
50% of probability mass above
7
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 19%
3.8%
8
Biostatistics
21 papers in training set
Top 0.1%
3.5%
9
Communications Biology
886 papers in training set
Top 4%
2.6%
10
Scientific Reports
3102 papers in training set
Top 51%
2.0%
11
iScience
1063 papers in training set
Top 12%
1.9%
12
Cancers
200 papers in training set
Top 3%
1.9%
13
Bulletin of Mathematical Biology
84 papers in training set
Top 1%
1.8%
14
Patterns
70 papers in training set
Top 1.0%
1.7%
15
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
1.3%
16
Bioinformatics Advances
184 papers in training set
Top 4%
1.2%
17
Genome Research
409 papers in training set
Top 3%
1.2%
18
Cancer Research
116 papers in training set
Top 3%
1.2%
19
Expert Systems with Applications
11 papers in training set
Top 0.2%
1.2%
20
PLOS Genetics
756 papers in training set
Top 12%
0.9%
21
eLife
5422 papers in training set
Top 54%
0.9%
22
Science Advances
1098 papers in training set
Top 27%
0.9%
23
Frontiers in Genetics
197 papers in training set
Top 9%
0.8%
24
Cell Reports
1338 papers in training set
Top 33%
0.7%
25
Nature Genetics
240 papers in training set
Top 7%
0.7%
26
Frontiers in Bioinformatics
45 papers in training set
Top 1%
0.7%
27
Statistics in Medicine
34 papers in training set
Top 0.4%
0.7%
28
Heliyon
146 papers in training set
Top 8%
0.6%
29
BMC Bioinformatics
383 papers in training set
Top 8%
0.6%