Back

Genome-wide annotation and analyses of bifunctional genes in the human genome

Insan, J.; Menon, M. B.; Dhamija, S.

2026-01-29 genomics
10.64898/2026.01.28.702170 bioRxiv
Show abstract

Conventional gene annotation pipelines classify eukaryotic genes into protein-coding and non-coding. Alternative splicing may generate non-coding transcript variants from protein-coding genes, that are expressed in tissue- or disease-specific manner. We and others have described the genes which transcribe both coding and non-coding transcripts as bifunctional genes. Here we present a genome-wide analyses of bifunctional genes and reannotate the genes in the human genome reference assembly into coding, non-coding and bifunctional. We identify over 4000 "bifunctional genes" in the human genome, constituting approximately 10% of the transcribed genes, and present evidence that these genes are conserved in evolution and their number correlate well with genome size and complexity. These genes are enriched in gene sets involved in vesicular transport, autophagy, RNA/DNA binding, glycosylation and splicing. By monitoring the expression of non-coding exons in long-read sequencing datasets and by quantitative RT-qPCR, we provide evidence for the expression of non-coding variants from bifunctional genes. The ncRNA transcripts from these genes might have similar or different roles from their cognate mRNA counterparts. They may act as miRNA sponges or harbour non-canonical open-reading frames that encode microproteins, while also competing for binding with RNA-binding proteins. We present evidence for establishing potential biological functions of bifunctional genes and summarise the findings in a searchable database. Further studies and functional characterization focused on this special group of genes may reveal interesting gene regulatory mechanisms relevant to physiology and pathology.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
PLOS Genetics
756 papers in training set
Top 0.7%
14.0%
2
Scientific Reports
3102 papers in training set
Top 7%
9.8%
3
International Journal of Molecular Sciences
453 papers in training set
Top 0.4%
8.2%
4
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 1%
4.2%
5
RNA Biology
70 papers in training set
Top 0.1%
4.2%
6
Frontiers in Genetics
197 papers in training set
Top 2%
4.1%
7
Genomics
60 papers in training set
Top 0.3%
3.9%
8
PLOS ONE
4510 papers in training set
Top 41%
3.5%
50% of probability mass above
9
Genes
126 papers in training set
Top 0.3%
3.5%
10
Nucleic Acids Research
1128 papers in training set
Top 6%
3.5%
11
Cells
232 papers in training set
Top 0.8%
3.5%
12
BMC Biology
248 papers in training set
Top 0.5%
2.7%
13
Human Molecular Genetics
130 papers in training set
Top 1%
2.7%
14
Biochimica et Biophysica Acta (BBA) - Molecular Basis of Disease
25 papers in training set
Top 0.1%
2.3%
15
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.6%
16
Molecular Genetics and Genomics
11 papers in training set
Top 0.2%
1.4%
17
PLOS Computational Biology
1633 papers in training set
Top 18%
1.4%
18
BMC Cancer
52 papers in training set
Top 2%
0.9%
19
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
0.9%
20
Gene
41 papers in training set
Top 2%
0.7%
21
Genome Biology and Evolution
280 papers in training set
Top 2%
0.7%
22
Molecular and Cellular Biology
40 papers in training set
Top 0.4%
0.7%
23
Biology
43 papers in training set
Top 3%
0.7%
24
Nature Communications
4913 papers in training set
Top 64%
0.7%
25
PeerJ
261 papers in training set
Top 18%
0.6%
26
iScience
1063 papers in training set
Top 38%
0.6%
27
G3 Genes|Genomes|Genetics
351 papers in training set
Top 3%
0.6%
28
Communications Biology
886 papers in training set
Top 30%
0.6%