Back

Topological Data Analysis of Protein Structure Manifolds from Molecular Dynamics Computer Simulation

Sino, M.; Kamberaj, H.

2025-07-14 biophysics
10.1101/2025.07.12.664527 bioRxiv
Show abstract

The analysis of computer simulation data requires efficient statistical and computational approaches, based on well-established theoretical frameworks. This study aims to introduce such approaches for topological data analysis within the persistent homology framework and to describe the manifold of the protein structure dynamics within the differential geometry of the directed graphs framework. Furthermore, the asymmetric kernel-directed graphs determined by the transfer entropy will describe the information flow in this manifold. The primary goal is to characterise changes in the topology of the protein structure due to the mutations. Moreover, this study aims to define the embedded manifold of dimension m of the amino acid sequence interaction network using the graphs Laplacian matrix for determining the local embedded vector fields and coordinate vectors in this manifold for each amino acid as the vertices of either a directed or undirected graph. Furthermore, this study strives to show that encoding the amino acid sequence information in an m-dimensional manifold is statistically efficient by decoding that information in a much lower-dimensional space. Then, using the topological data analysis, we can observe protein structure dynamics changes in a multidimensional manifold, for example, due to amino acid mutations. The analysis showed that short equilibrium structure fluctuations at a few nanoseconds enable the construction of such a manifold. As a case study, the influence of the mutation of the two disulphide bridges on the three-dimensional structure of the Bovine Pancreatic Trypsin Inhibitor protein is investigated.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Chaos, Solitons & Fractals
32 papers in training set
Top 0.1%
14.7%
2
Physica A: Statistical Mechanics and its Applications
10 papers in training set
Top 0.1%
10.1%
3
Entropy
20 papers in training set
Top 0.1%
7.2%
4
PLOS ONE
4510 papers in training set
Top 25%
6.8%
5
Physical Biology
43 papers in training set
Top 0.2%
6.4%
6
The European Physical Journal Plus
13 papers in training set
Top 0.2%
3.6%
7
Scientific Reports
3102 papers in training set
Top 37%
3.6%
50% of probability mass above
8
PLOS Computational Biology
1633 papers in training set
Top 11%
3.3%
9
Computers in Biology and Medicine
120 papers in training set
Top 1.0%
3.3%
10
Frontiers in Molecular Biosciences
100 papers in training set
Top 0.7%
2.7%
11
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
1.7%
12
Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences
15 papers in training set
Top 0.4%
1.7%
13
Physical Review E
95 papers in training set
Top 0.8%
1.5%
14
BioSystems
11 papers in training set
Top 0.2%
1.2%
15
Journal of Theoretical Biology
144 papers in training set
Top 1%
1.2%
16
SoftwareX
15 papers in training set
Top 0.3%
0.9%
17
Life
27 papers in training set
Top 0.3%
0.8%
18
Computational and Structural Biotechnology Journal
216 papers in training set
Top 8%
0.8%
19
Mathematical Biosciences and Engineering
23 papers in training set
Top 0.6%
0.8%
20
Cognitive Neurodynamics
15 papers in training set
Top 0.4%
0.7%
21
Bioengineering
24 papers in training set
Top 1%
0.7%
22
The Journal of Chemical Physics
49 papers in training set
Top 0.4%
0.7%
23
Frontiers in Applied Mathematics and Statistics
10 papers in training set
Top 0.4%
0.7%
24
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
25
Neurocomputing
13 papers in training set
Top 0.6%
0.7%
26
BMC Bioinformatics
383 papers in training set
Top 7%
0.7%
27
The Journal of Physical Chemistry B
158 papers in training set
Top 2%
0.7%
28
Frontiers in Physics
20 papers in training set
Top 1%
0.7%
29
Epidemiology and Infection
84 papers in training set
Top 4%
0.6%
30
Brain Sciences
52 papers in training set
Top 3%
0.6%