Back

InstaPrism: an R package for fast implementation of BayesPrism

Hu, M.; Chikina, M.

2023-03-10 bioinformatics
10.1101/2023.03.07.531579 bioRxiv
Show abstract

Computational cell-type deconvolution is an important analytic technique for modeling the compositional heterogeneity of bulk gene expression data. A conceptually new Bayesian approach to this problem, BayesPrism, has recently been proposed and has subsequently been shown to be superior in accuracy and robustness against model misspecifications by independent studies. However, given that BayesPrism relies on Gibbs sampling, it is orders of magnitude more computationally expensive than standard approaches. Here, we introduce the InstaPrism algorithm which re-implements BayesPrism in a derandomized framework by replacing the time-consuming Gibbs sampling steps in BayesPrism with a fixed-point algorithm. We demonstrate that the new algorithm is effectively equivalent to BayesPrism while providing a considerable speed advantage. InstaPrism is implemented as a standalone R package with C++ backend and can be accessed from GitHub at https://github.com/humengying0907/InstaPrism.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 0.9%
26.4%
2
PLOS Genetics
756 papers in training set
Top 1%
10.3%
3
PLOS Computational Biology
1633 papers in training set
Top 4%
8.6%
4
BMC Bioinformatics
383 papers in training set
Top 1%
6.9%
50% of probability mass above
5
PLOS ONE
4510 papers in training set
Top 27%
6.4%
6
Genome Biology
555 papers in training set
Top 2%
4.4%
7
The Annals of Applied Statistics
15 papers in training set
Top 0.1%
4.0%
8
Nucleic Acids Research
1128 papers in training set
Top 5%
3.7%
9
Biometrics
22 papers in training set
Top 0.1%
2.9%
10
Frontiers in Genetics
197 papers in training set
Top 3%
2.1%
11
Genome Research
409 papers in training set
Top 2%
1.8%
12
Biostatistics
21 papers in training set
Top 0.1%
1.8%
13
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.4%
14
Bioinformatics Advances
184 papers in training set
Top 3%
1.4%
15
The American Journal of Human Genetics
206 papers in training set
Top 3%
1.2%
16
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
17
Nature Communications
4913 papers in training set
Top 62%
0.8%
18
Nature Biotechnology
147 papers in training set
Top 8%
0.7%
19
Statistics in Medicine
34 papers in training set
Top 0.4%
0.7%
20
Communications Biology
886 papers in training set
Top 28%
0.7%
21
BMC Genomics
328 papers in training set
Top 7%
0.7%
22
Frontiers in Molecular Biosciences
100 papers in training set
Top 6%
0.7%
23
Biophysical Journal
545 papers in training set
Top 6%
0.5%
24
Cell Systems
167 papers in training set
Top 14%
0.5%
25
Genetic Epidemiology
46 papers in training set
Top 1%
0.5%