Back

In silico prediction of COVID-19 test efficiency with DinoKnot

Newman, T.; Chang, H. F. K.; Jabbari, H.

2020-09-11 bioinformatics
10.1101/2020.09.11.292730 bioRxiv
Show abstract

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a novel coronavirus spreading across the world causing the disease COVID-19. The diagnosis of COVID-19 is done by quantitative reverse-transcription polymer chain reaction (qRT-PCR) testing which utilizes different primer-probe sets depending on the assay used. Using in silico analysis we aimed to determine how the secondary structure of the SARS-CoV-2 RNA genome affects the interaction between the reverse primer during qRT-PCR and how it relates to the experimental primer-probe test efficiencies. We introduce the program DinoKnot (Duplex Interaction of Nucleic acids with pseudoKnots) that follows the hierarchical folding hypothesis to predict the secondary structure of two interacting nucleic acid strands (DNA/RNA) of similar or different type. DinoKnot is the first program that utilizes stable stems in both strands as a guide to find the structure of their interaction. Using DinoKnot we predicted the interaction of the reverse primers used in four common COVID-19 qRT-PCR tests with the SARS-CoV-2 RNA genome. In addition, we predicted how 12 mutations in the primer/probe binding region may affect the primer/probe ability and subsequent SARS-CoV-2 detection. While we found all reverse primers are capable of interacting with their target area, we identified partial mismatching between the SARS-CoV-2 genome and some reverse primers. We predicted three mutations that may prevent primer binding, reducing the ability for SARS-CoV-2 detection. We believe our contributions can aid in the design of a more sensitive SARS-CoV-2 test. Author summaryThe current testing for the disease COVID-19 that is caused by the novel cornonavirus SARS-CoV-2 uses oligonucleotides called primers that bind to specific target regions on the SARS-CoV-2 genome to detect the virus. Our goal was to use computational tools to predict how the structure of the SARS-CoV-2 RNA genome affects the ability of the primers to bind to their target region. We introduce the program DinoKnot (Duplex interaction of nucleic acids with pseudoknots) that is able to predict the interactions between two DNA or RNA molecules. We used DinoKnot to predict the efficiency of four common COVID-19 tests, and the effect of mutations in the SARS-CoV-2 virus on ability of the COVID-19 tests in detecting those strains. We predict partial mismatching between some primers and the SARS-CoV-2 genome but that all primers are capable of interacting with their target areas. We also predict three mutations that prevent primer binding and thus SARS-CoV-2 detection. We discuss the limitations of the current COVID-19 testing and suggest the design of a more sensitive COVID-19 test that can be aided by our findings.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
BMC Bioinformatics
383 papers in training set
Top 0.4%
17.5%
2
PLOS ONE
4510 papers in training set
Top 12%
14.7%
3
Bioinformatics
1061 papers in training set
Top 3%
9.1%
4
PLOS Computational Biology
1633 papers in training set
Top 6%
6.3%
5
PeerJ
261 papers in training set
Top 1%
4.9%
50% of probability mass above
6
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 0.1%
4.3%
7
Frontiers in Genetics
197 papers in training set
Top 2%
4.0%
8
F1000Research
79 papers in training set
Top 0.5%
3.6%
9
Bioinformatics Advances
184 papers in training set
Top 2%
2.7%
10
Scientific Reports
3102 papers in training set
Top 45%
2.6%
11
BMC Genomics
328 papers in training set
Top 2%
1.7%
12
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.5%
13
Frontiers in Bioinformatics
45 papers in training set
Top 0.3%
1.5%
14
Genes
126 papers in training set
Top 1%
1.3%
15
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
1.2%
16
Archives of Clinical and Biomedical Research
28 papers in training set
Top 1%
1.2%
17
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.1%
18
Biology Methods and Protocols
53 papers in training set
Top 2%
0.9%
19
Access Microbiology
22 papers in training set
Top 0.7%
0.7%
20
Viruses
318 papers in training set
Top 5%
0.7%
21
Biology
43 papers in training set
Top 3%
0.6%
22
BioData Mining
15 papers in training set
Top 1%
0.6%