Back

Repurposing The Dark Genome. III - Intronic Proteins

Garg, M.; Dhar, P. K.

2023-06-10 synthetic biology
10.1101/2023.06.10.544447 bioRxiv
Show abstract

Based on the expression patterns, genomes are viewed as a collection of protein-coding, RNA-coding, and non-expressing DNA sequences. Unlike most prokaryotes, eukaryotic gene expression comes with an additional step called alternative splicing. During the maturation process, different combinations of exons are spliced out and joined together resulting in the formation of mRNA isoforms. After removal from pre-mRNA, introns may be degraded by cellular exonucleases or form long non-coding RNAs (lncRNAs), or temporarily retained in the nucleus for regulating gene expression. We asked: Do introns have an unutilized potential for encoding proteins? If introns had an opportunity of getting translated, what kind of peptides or proteins, would they make? This study is based on the hypothesis of making functional proteins from leftover introns and is an extension of the original work of making functional proteins from the E. coli intergenic sequences (Dhar et al., 2009). Here full-length introns were computationally translated into proteins to study their potential structural, physicochemical, functional, and cellular location properties. Experimental validation is underway for a detailed understanding of the biology of intronic proteins. A synthetic intronic protein repository would provide an opportunity to design first-in-the-class molecules toward functional endpoints.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Biosystems
18 papers in training set
Top 0.1%
14.1%
2
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.1%
14.1%
3
ACS Omega
90 papers in training set
Top 0.1%
9.9%
4
RNA
169 papers in training set
Top 0.1%
6.7%
5
International Journal of Molecular Sciences
453 papers in training set
Top 0.9%
6.2%
50% of probability mass above
6
Nucleic Acids Research
1128 papers in training set
Top 5%
4.2%
7
Journal of Molecular Biology
217 papers in training set
Top 0.5%
3.9%
8
iScience
1063 papers in training set
Top 6%
3.5%
9
RNA Biology
70 papers in training set
Top 0.1%
2.5%
10
PeerJ
261 papers in training set
Top 4%
2.3%
11
Scientific Reports
3102 papers in training set
Top 54%
1.8%
12
PLOS ONE
4510 papers in training set
Top 55%
1.7%
13
Protein Science
221 papers in training set
Top 0.9%
1.7%
14
ACS Synthetic Biology
256 papers in training set
Top 2%
1.7%
15
Computational Biology and Chemistry
23 papers in training set
Top 0.2%
1.5%
16
Frontiers in Genetics
197 papers in training set
Top 6%
1.5%
17
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.5%
18
PLOS Computational Biology
1633 papers in training set
Top 19%
1.3%
19
Biotechnology and Bioengineering
49 papers in training set
Top 0.6%
1.3%
20
Biology of the Cell
11 papers in training set
Top 0.1%
1.2%
21
eLife
5422 papers in training set
Top 51%
1.1%
22
Frontiers in Bioengineering and Biotechnology
88 papers in training set
Top 2%
0.9%
23
Biology Methods and Protocols
53 papers in training set
Top 3%
0.7%
24
Open Biology
95 papers in training set
Top 2%
0.7%
25
Genes
126 papers in training set
Top 3%
0.7%
26
F1000Research
79 papers in training set
Top 5%
0.7%