Back

Efficient Stochastic Trace Generation for Transcription

Ferdowsi, A.; Fuegger, M.; Nowak, T.

2026-05-08 bioinformatics
10.64898/2026.05.05.722871 bioRxiv
Show abstract

Bursty transcription in single cells typically produces over-dispersed, skewed, and sometimes heavy-tailed expression distributions that are explained by two-state Markov models of the promoters. While the gold standard for simulation is exact stochastic sampling with Gillespies algorithm, obtaining thousands of timed traces is computationally costly. Surrogate models based on stochastic differential equations (SDEs) are widely used to speed up this simulation process. An example is the Chemical Langevin Equation based on Gaussian noise, which, however, does not capture heavy-tailed noise. In this work, we present a unified SDE framework that combines deterministic drift, Gaussian fluctuations, and additive sporadic jumps of arbitrary distributions, and provide an open-source Python implementation, bcrnnoise. The framework subsumes standard surrogate models and allows for vectorized generation of batches of transcription traces. We assess computational speed and accuracy of common surrogate models along with new models, showing that high accuracy can be obtained while reducing computational cost up to two orders of magnitude.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 0.5%
34.6%
2
BMC Bioinformatics
383 papers in training set
Top 0.5%
14.5%
3
PLOS Computational Biology
1633 papers in training set
Top 2%
12.4%
50% of probability mass above
4
PLOS ONE
4510 papers in training set
Top 42%
3.1%
5
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
2.6%
6
Scientific Reports
3102 papers in training set
Top 55%
1.8%
7
ACS Synthetic Biology
256 papers in training set
Top 2%
1.7%
8
iScience
1063 papers in training set
Top 14%
1.7%
9
Nature Communications
4913 papers in training set
Top 51%
1.7%
10
Biophysical Journal
545 papers in training set
Top 3%
1.7%
11
Genetics
225 papers in training set
Top 2%
1.7%
12
npj Systems Biology and Applications
99 papers in training set
Top 2%
1.0%
13
in silico Plants
24 papers in training set
Top 0.2%
0.9%
14
Genome Biology
555 papers in training set
Top 7%
0.8%
15
Physical Biology
43 papers in training set
Top 2%
0.7%
16
Bioinformatics Advances
184 papers in training set
Top 5%
0.7%
17
Nucleic Acids Research
1128 papers in training set
Top 18%
0.7%
18
Journal of The Royal Society Interface
189 papers in training set
Top 5%
0.6%
19
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.9%
0.6%
20
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 0.9%
0.6%
21
IFAC-PapersOnLine
12 papers in training set
Top 0.1%
0.6%
22
Physical Review E
95 papers in training set
Top 1%
0.6%
23
NAR Genomics and Bioinformatics
214 papers in training set
Top 5%
0.5%
24
Briefings in Bioinformatics
326 papers in training set
Top 8%
0.5%