Back

A functional annotation based integration of different similarity measures for gene expressions

Misra, S.; Roy, S.; Ray, S. S.

2026-02-24 bioinformatics
10.64898/2026.02.23.707392 bioRxiv
Show abstract

Genes with similar expression profiles often exhibit similar functional properties. An "integrated similarity score" (ISS) is developed by combining different expression similarity measures through weights, obtained using biological information, for improving gene similarity. The expression similarity measures are converted to the common framework of positive predictive value using functional annotation. A fitness function, called "fitness function using functional annotation of genes" (FFFAG), is also developed by minimizing the difference between functional similarity value and the ISS. The FFFAG is used to determine the weight combination of different similarity measures in ISS. In addition, an existing similarity measure, called TMJ (integrated similarity measure by multiplying Triangle and Jaccard similarity), is also modified to incorporate biological knowledge involving functional annotation. The results demonstrate that ISS is superior to individual similarity measure to find similar gene pairs. Further, the ISS predicts the functional categories of 40 unclassified yeast genes at p-value cutoff of 10-10 from 12 clusters. The associated code is accessible at http://www.isical.ac.in/[~]shubhra/ISS.html.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
BMC Bioinformatics
383 papers in training set
Top 0.7%
10.8%
2
PLOS ONE
4510 papers in training set
Top 17%
10.5%
3
Bioinformatics
1061 papers in training set
Top 3%
8.7%
4
Briefings in Bioinformatics
326 papers in training set
Top 0.8%
6.6%
5
PLOS Computational Biology
1633 papers in training set
Top 6%
5.0%
6
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1%
4.1%
7
Computers in Biology and Medicine
120 papers in training set
Top 0.6%
4.1%
8
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 2%
3.7%
50% of probability mass above
9
Scientific Reports
3102 papers in training set
Top 42%
3.0%
10
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.9%
1.8%
11
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 0.2%
1.5%
12
IEEE Access
31 papers in training set
Top 0.5%
1.4%
13
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 6%
1.1%
14
Frontiers in Genetics
197 papers in training set
Top 7%
1.1%
15
Expert Systems with Applications
11 papers in training set
Top 0.3%
1.0%
16
International Journal of Molecular Sciences
453 papers in training set
Top 12%
0.9%
17
BioSystems
11 papers in training set
Top 0.2%
0.9%
18
Journal of Computational Biology
37 papers in training set
Top 0.4%
0.9%
19
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.9%
20
Chaos, Solitons & Fractals
32 papers in training set
Top 1%
0.9%
21
Applied Sciences
24 papers in training set
Top 0.6%
0.9%
22
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.4%
0.9%
23
PeerJ
261 papers in training set
Top 12%
0.9%
24
Neurocomputing
13 papers in training set
Top 0.5%
0.8%
25
Physical Biology
43 papers in training set
Top 2%
0.8%
26
Physical Review E
95 papers in training set
Top 1%
0.8%
27
Informatics in Medicine Unlocked
21 papers in training set
Top 1%
0.8%
28
Nucleic Acids Research
1128 papers in training set
Top 18%
0.7%
29
Computational Biology and Chemistry
23 papers in training set
Top 0.5%
0.7%
30
Vaccines
196 papers in training set
Top 3%
0.7%