Back

Boosting performance of generative diffusion model for molecular docking by training on artificial binding pockets

Voitsitskyi, T.; Bdzhola, V.; Stratiichuk, R.; Koleiev, I.; Ostrovsky, Z.; Vozniak, V.; Khropachov, I.; Henitsoi, P.; Popryho, L.; Zhytar, R.; Yesylevskyy, S. O.; Nafiev, A.; Starosyla, S.

2023-11-22 bioinformatics
10.1101/2023.11.22.568238 bioRxiv
Show abstract

This study introduces the PocketCFDM generative diffusion model, aimed at improving the prediction of small molecule poses in the protein binding pockets. The model utilizes a novel data augmentation technique, involving the creation of numerous artificial binding pockets that mimic the statistical patterns of non-bond interactions found in actual protein-ligand complexes. An algorithmic method was developed to assess and replicate these interaction patterns in the artificial binding pockets built around small molecule conformers. It is shown that the integration of artificial binding pockets into the training process significantly enhanced the models performance. Notably, PocketCFDM surpassed DiffDock in terms of non-bond interaction quality, number of steric clashes, and inference speed. Future developments and optimizations of the model are discussed. AvailabilityThe inference code and final model weights of PocketCFDM are accessible publicly via the GitHub repository: https://github.com/vtarasv/pocket-cfdm.git.

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.