Back

Learning fragment-based segmentation of binding sites from molecular dynamics: a proof-of-concept on cardiac myosin.

Yang, Y.-Y.; Pickersgill, R. W.; Fornili, A.

2026-02-16 bioinformatics
10.64898/2026.02.13.703009 bioRxiv
Show abstract

The geometric and chemical features of protein binding sites tend to change as a consequence of conformational dynamics. In the ligand-unbound (apo) state, a binding site might be only transiently organised in a way that can accommodate a given ligand, with the relevant regions of the protein coming together in a suitable arrangement only in a subset of conformations. Ligand binding itself can also induce further changes in the binding site. Because most ligands can be decomposed into smaller fragments, we hypothesised that mapping onto the binding site surface the propensity of binding specific fragments could be used to monitor changes in the overall ability of the site to bind a ligand. This task can be formulated as semantic segmentation, which can now be performed successfully using deep learning methods. Here we introduce the Fragment-Based protein Ensemble semantic Segmentation Tool for Myosin (FragBEST-Myo), a deep learning method based on a 3D U-Net architecture, trained to partition the omecamtiv mecarbil (OM) binding site of cardiac myosin into fragment-specific regions using only local shape and physico-chemical features. The model was trained on labelled Molecular Dynamics (MD) trajectories of OM-bound myosin in both post-rigor (PR) and pre-power-stroke (PPS) states, achieving an accuracy of ~95% and a mean Intersection over Union (mIoU) > 0.75 on unseen trajectories from both states. When applied to apo trajectories, FragBEST-Myo-derived descriptors produced rankings consistent with similarity to holo conformations. Moreover, selecting apo frames based on FragBEST-Myo ranking increased the chance of recovering holo-like OM docking poses relative to randomly chosen control frames, supporting its use as a screening tool for ensemble docking. Beyond frame selection, fragment maps provide a compact representation to assess docking poses and to guide fragment-based design. Our proof-of-concept provides a basis for developing future general models applicable to a broader range of proteins and ligands, with the fragment-based formulation offering a natural route to generalisation. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=103 SRC="FIGDIR/small/703009v1_ufig1.gif" ALT="Figure 1"> View larger version (32K): org.highwire.dtl.DTLVardef@1551255org.highwire.dtl.DTLVardef@26bf73org.highwire.dtl.DTLVardef@1e3181forg.highwire.dtl.DTLVardef@44baf7_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.