Back

Analyzing the link between RNA secondary structures and R-loop formation with tree polynomials

Liu, P.; Lusk, J.; Jonoska, N.; Vazquez, M.

2024-02-01 molecular biology
10.1101/2023.09.24.559224 bioRxiv
Show abstract

R-loops are a class of non-canonical nucleic acid structures that typically form during transcription when the nascent RNA hybridizes the DNA template strand, leaving the DNA coding strand unpaired. Co-transcriptional R-loops are abundant in nature and biologically relevant. Recent research shows that DNA sequence and topology affect R-loops, yet it remains unclear how these and other factors drive R-loop formation. In this work, we investigate a link between the secondary structure of the nascent RNA and the probability of R-loop formation. We introduce tree-polynomial representations, a class of mathematical objects that enable accurate and efficient data analysis of RNA secondary structures. With tree-polynomials, we establish a strong correlation between the secondary structure of the RNA transcript and the probability of R-loop formation. We identify that branches with short stems separated by multiple bubbles in the RNA secondary structure are associated with the strong correlation and are predictive of R-loop formation.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 1%
18.6%
2
Nucleic Acids Research
1128 papers in training set
Top 2%
10.1%
3
Entropy
20 papers in training set
Top 0.1%
8.4%
4
Scientific Reports
3102 papers in training set
Top 18%
6.4%
5
Communications Biology
886 papers in training set
Top 0.9%
4.3%
6
PLOS ONE
4510 papers in training set
Top 36%
4.0%
50% of probability mass above
7
BMC Bioinformatics
383 papers in training set
Top 3%
3.6%
8
Bioinformatics
1061 papers in training set
Top 6%
3.3%
9
Nature Communications
4913 papers in training set
Top 43%
2.7%
10
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 24%
2.7%
11
Journal of Molecular Biology
217 papers in training set
Top 0.9%
2.6%
12
Physical Review Research
46 papers in training set
Top 0.2%
2.4%
13
iScience
1063 papers in training set
Top 10%
2.1%
14
Cell Systems
167 papers in training set
Top 7%
1.7%
15
Journal of The Royal Society Interface
189 papers in training set
Top 2%
1.7%
16
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.5%
17
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.5%
18
Frontiers in Genetics
197 papers in training set
Top 7%
1.2%
19
eLife
5422 papers in training set
Top 49%
1.2%
20
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
1.0%
21
Biophysical Journal
545 papers in training set
Top 4%
0.9%
22
Journal of Structural Biology
58 papers in training set
Top 2%
0.7%
23
Cell Reports
1338 papers in training set
Top 33%
0.7%
24
Genome Research
409 papers in training set
Top 4%
0.7%
25
Physical Review E
95 papers in training set
Top 1%
0.7%
26
Advanced Science
249 papers in training set
Top 21%
0.6%
27
Genome Biology
555 papers in training set
Top 8%
0.6%