Back

Macro-Equi-Diff (MED): Scaffold-based Macrocycles Generation Using Equivariant Diffusion

Kambampati, S. S.; Anumandla, S.; Guttula, S. L.; Kavadi, V. R.; Gogte, S.; Kondaparthi, V.

2026-02-06 bioinformatics
10.64898/2026.02.05.703948 bioRxiv
Show abstract

Macrocyclic compounds are essential in drug discovery as they can modulate protein-protein interactions and enhance selectivity. Their structural complexity enables access to molecular diversity beyond traditional small molecules; however, designing feasible macrocycles remains a challenging task. Current computational methods often fail to generate macrocycles with proper drug-like properties. Here, we present Macro-Equi-Diff (MED), a deep learning framework that combines transformer-based site identification with an E(3)-equivariant Diffusion Model (EDM) for linker creation, and a fragment-linker attachment module. MED transforms acyclic molecules into structurally consistent macrocycles. MED was tested on the ZINC dataset, achieving high validity (93.92%), uniqueness (99.94%), macrocyclization (99.92%), and linker novelty (82.81%). MED improves upon previous methods that lack a macrocyclic geometry context. As a case study, MED was used to macrocyclize four acyclic drugs targeting the JAK2 protein. The generated macrocycles exhibited favourable molecular descriptors and strong binding affinities, establishing MED as a reliable method for expanding the macrocyclic chemical space.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.1%
44.8%
2
Advanced Science
249 papers in training set
Top 3%
5.2%
3
Briefings in Bioinformatics
326 papers in training set
Top 1%
5.2%
50% of probability mass above
4
Bioinformatics
1061 papers in training set
Top 5%
4.7%
5
Journal of Cheminformatics
25 papers in training set
Top 0.1%
4.3%
6
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1%
3.9%
7
PLOS ONE
4510 papers in training set
Top 50%
1.9%
8
Communications Chemistry
39 papers in training set
Top 0.1%
1.9%
9
Chemical Science
71 papers in training set
Top 0.8%
1.8%
10
Nature Communications
4913 papers in training set
Top 49%
1.8%
11
Scientific Reports
3102 papers in training set
Top 55%
1.8%
12
PLOS Computational Biology
1633 papers in training set
Top 19%
1.3%
13
Molecules
37 papers in training set
Top 1%
1.3%
14
Journal of Medicinal Chemistry
68 papers in training set
Top 0.8%
1.3%
15
Acta Pharmaceutica Sinica B
11 papers in training set
Top 0.6%
1.0%
16
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 5%
1.0%
17
Communications Biology
886 papers in training set
Top 22%
0.8%
18
International Journal of Molecular Sciences
453 papers in training set
Top 17%
0.7%
19
BMC Bioinformatics
383 papers in training set
Top 8%
0.5%
20
Computers in Biology and Medicine
120 papers in training set
Top 6%
0.5%
21
eLife
5422 papers in training set
Top 62%
0.5%
22
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.4%
0.5%
23
Nucleic Acids Research
1128 papers in training set
Top 20%
0.5%