Topological Data Analysis of Protein Structure Manifolds from Molecular Dynamics Computer Simulation
Sino, M.; Kamberaj, H.
Show abstract
The analysis of computer simulation data requires efficient statistical and computational approaches, based on well-established theoretical frameworks. This study aims to introduce such approaches for topological data analysis within the persistent homology framework and to describe the manifold of the protein structure dynamics within the differential geometry of the directed graphs framework. Furthermore, the asymmetric kernel-directed graphs determined by the transfer entropy will describe the information flow in this manifold. The primary goal is to characterise changes in the topology of the protein structure due to the mutations. Moreover, this study aims to define the embedded manifold of dimension m of the amino acid sequence interaction network using the graphs Laplacian matrix for determining the local embedded vector fields and coordinate vectors in this manifold for each amino acid as the vertices of either a directed or undirected graph. Furthermore, this study strives to show that encoding the amino acid sequence information in an m-dimensional manifold is statistically efficient by decoding that information in a much lower-dimensional space. Then, using the topological data analysis, we can observe protein structure dynamics changes in a multidimensional manifold, for example, due to amino acid mutations. The analysis showed that short equilibrium structure fluctuations at a few nanoseconds enable the construction of such a manifold. As a case study, the influence of the mutation of the two disulphide bridges on the three-dimensional structure of the Bovine Pancreatic Trypsin Inhibitor protein is investigated.
Matching journals
The top 7 journals account for 50% of the predicted probability mass.