Back

Repurposing the dark genome. IV - noncoding proteins

Nayak, S.; Dhar, P. K.

2023-06-29 synthetic biology
10.1101/2023.06.29.547021 bioRxiv
Show abstract

The dark genome comprising of non-expressing, non-translating, and extinct DNA sequences has remained a largely unexplored genomic space. Using computational and experimental approaches, novel insights into the dark matter genome have recently been gained, revealing the presence of a vast and unexplored resource. Non-coding RNA (ncRNA) refers to a class of RNA molecules that do not encode proteins but play important regulatory roles in the cell. We asked if it was possible to make functional peptides and proteins from ncRNA leading to a new biological insight and applications? Here we present initial computational data in support of making functional noncoding proteins (NCP) from ncRNA sequences. Different types of non-coding genomic sequences originating from Caenorhabditis elegans, Drosophila melanogaster, Arabidopsis thaliana, and Homo sapiens were studied to understand sequence composition, secondary structure, and physiochemical properties of NCPs. This work builds the foundation for experimentally characterizing the first-in-the-class non-coding proteins leading to a novel insights and applications.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
ACS Omega
90 papers in training set
Top 0.1%
12.4%
2
Scientific Reports
3102 papers in training set
Top 6%
10.1%
3
PeerJ
261 papers in training set
Top 0.3%
7.2%
4
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.5%
6.4%
5
PLOS ONE
4510 papers in training set
Top 28%
6.4%
6
International Journal of Molecular Sciences
453 papers in training set
Top 2%
4.3%
7
Biosystems
18 papers in training set
Top 0.1%
3.7%
50% of probability mass above
8
ACS Synthetic Biology
256 papers in training set
Top 1%
3.1%
9
PLOS Computational Biology
1633 papers in training set
Top 13%
2.1%
10
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
2.1%
11
RNA Biology
70 papers in training set
Top 0.2%
1.8%
12
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.7%
13
International Journal of Biological Macromolecules
65 papers in training set
Top 2%
1.7%
14
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.5%
1.7%
15
iScience
1063 papers in training set
Top 18%
1.5%
16
Frontiers in Genetics
197 papers in training set
Top 6%
1.3%
17
Communications Biology
886 papers in training set
Top 12%
1.3%
18
Nucleic Acids Research
1128 papers in training set
Top 13%
1.3%
19
Chemical Communications
24 papers in training set
Top 0.7%
1.2%
20
Computational Biology and Chemistry
23 papers in training set
Top 0.3%
1.0%
21
Genes
126 papers in training set
Top 2%
1.0%
22
Nano Letters
63 papers in training set
Top 2%
0.8%
23
Angewandte Chemie
12 papers in training set
Top 0.2%
0.8%
24
Viruses
318 papers in training set
Top 5%
0.8%
25
Life
27 papers in training set
Top 0.4%
0.7%
26
Journal of Molecular Biology
217 papers in training set
Top 4%
0.7%
27
Journal of Medical Virology
137 papers in training set
Top 5%
0.7%
28
Informatics in Medicine Unlocked
21 papers in training set
Top 1%
0.7%
29
NAR Molecular Medicine
18 papers in training set
Top 0.3%
0.7%
30
Biophysical Journal
545 papers in training set
Top 6%
0.6%