Analyzing the link between RNA secondary structures and R-loop formation with tree polynomials
Liu, P.; Lusk, J.; Jonoska, N.; Vazquez, M.
Show abstract
R-loops are a class of non-canonical nucleic acid structures that typically form during transcription when the nascent RNA hybridizes the DNA template strand, leaving the DNA coding strand unpaired. Co-transcriptional R-loops are abundant in nature and biologically relevant. Recent research shows that DNA sequence and topology affect R-loops, yet it remains unclear how these and other factors drive R-loop formation. In this work, we investigate a link between the secondary structure of the nascent RNA and the probability of R-loop formation. We introduce tree-polynomial representations, a class of mathematical objects that enable accurate and efficient data analysis of RNA secondary structures. With tree-polynomials, we establish a strong correlation between the secondary structure of the RNA transcript and the probability of R-loop formation. We identify that branches with short stems separated by multiple bubbles in the RNA secondary structure are associated with the strong correlation and are predictive of R-loop formation.
Matching journals
The top 6 journals account for 50% of the predicted probability mass.