Back

Fast Approximate IsoRank for Scalable Global Alignment of Biological Networks

Devkota, K.; Cowen, L. J.; Blumer, A.; Hu, X.

2023-03-15 bioinformatics
10.1101/2023.03.13.532445 bioRxiv
Show abstract

A well-studied approximate version of the graph matching problem is directly relevant for the study of protein-protein interaction networks. Called by the computational biology community Global Network Alignment, the two networks to be matched are derived from the protein-protein interaction (PPI) networks from organisms of two different species. If these two species evolved recently from a common ancestor, we can view the two PPI networks as a single network that evolved over time. It is the two versions of this network that we want to align using approximate graph matching. The first spectral method for the PPI global alignment problem proposed by the biological community was the IsoRank method of Singh et al. This method for global biological network alignment is still used today. However, with the advent of many more experiments, the size of the networks available to match has grown considerably, making running IsoRank unfeasible on these networks without access to state of the art computational resources. In this paper, we develop a new IsoRank approximation, which exploits the mathematical properties of IsoRanks linear system to solve the problem in quadratic time with respect to the maximum size of the two PPI networks. We further propose a computationally cheaper refinement to this initial approximation so that the updated result is even closer to the original IsoRank formulation. In experiments on synthetic and real PPI networks, we find that the results of our approximate IsoRank are not only nearly as accurate as the original IsoRank results but are also much faster, which makes the global alignment of large-scale biological networks feasible and scalable.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 1%
22.9%
2
BMC Bioinformatics
383 papers in training set
Top 0.7%
10.6%
3
Journal of Computational Biology
37 papers in training set
Top 0.1%
8.5%
4
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.1%
6.4%
5
PLOS ONE
4510 papers in training set
Top 33%
4.4%
50% of probability mass above
6
PLOS Computational Biology
1633 papers in training set
Top 8%
4.4%
7
Briefings in Bioinformatics
326 papers in training set
Top 2%
4.0%
8
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.4%
3.6%
9
Neurocomputing
13 papers in training set
Top 0.1%
2.4%
10
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.2%
1.9%
11
Scientific Reports
3102 papers in training set
Top 57%
1.7%
12
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.5%
13
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.5%
14
Frontiers in Genetics
197 papers in training set
Top 6%
1.4%
15
Bioinformatics Advances
184 papers in training set
Top 4%
0.9%
16
Frontiers in Computational Neuroscience
53 papers in training set
Top 2%
0.8%
17
IEEE Access
31 papers in training set
Top 1.0%
0.8%
18
Neural Networks
32 papers in training set
Top 0.8%
0.7%
19
Biostatistics
21 papers in training set
Top 0.1%
0.7%
20
Entropy
20 papers in training set
Top 0.4%
0.7%
21
Life
27 papers in training set
Top 0.6%
0.7%
22
Physical Review E
95 papers in training set
Top 1%
0.7%
23
PeerJ
261 papers in training set
Top 19%
0.5%
24
iScience
1063 papers in training set
Top 40%
0.5%
25
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 1%
0.5%