Back

Constrained Evolutionary Design of Matrixyl Analogs: Balancing Permeability and Functional Preservation Through Computational Optimization

Komianos, N.; Prakash, P.

2026-05-14 bioinformatics
10.64898/2026.05.12.724473 bioRxiv
Show abstract

Matrixyl (palmitoyl pentapeptide-4, KTTKS core) is a collagen-stimulating peptide used in topical anti-ageing products, but its in-use efficacy is limited by poor permeation through the stratum corneum. We describe a deterministic computational workflow that combines a tournament genetic algorithm and NSGA-II with exact RDKit molecular descriptors to search the fixed-length, edit-distance-2 neighbourhood of KTTKS (3,706 candidate sequences) for analogs with descriptors more favourable for passive transdermal diffusion. The search returns a 9-member Pareto frontier that quantifies the trade-off between predicted permeability and motif preservation. Five of the nine frontier members carry the same substitution, lysine to proline at position 4 (K4P). This single change lowers the topological polar surface area by 25.6%, removes the +1 charge contributed by lysine, and reduces the functional-preservation score from 1.00 (KTTKS) to 0.67. The frontier ranking is unchanged by {+/-}30% perturbations to the TPSA and Mw penalty weights and by a 30% increase in the LogP penalty; only a 30% reduction in the LogP penalty produces rank movement. The frontier matches the ground-truth Pareto set obtained by exhaustive enumeration of all 3,706 candidates (precision and recall both 100%). On the basis of these results we recommend three sequences for experimental validation: PTTPS (largest predicted gain), KTTPS (single-mutation, conservative), and KTTPP (backup). All code, results, and figures are released under MIT and CC BY 4.0.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.5%
12.4%
2
PLOS Computational Biology
1633 papers in training set
Top 3%
10.0%
3
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.2%
9.1%
4
Advanced Science
249 papers in training set
Top 3%
6.3%
5
Nature Communications
4913 papers in training set
Top 33%
4.8%
6
Bioinformatics
1061 papers in training set
Top 5%
3.6%
7
Scientific Reports
3102 papers in training set
Top 37%
3.6%
8
PLOS ONE
4510 papers in training set
Top 41%
3.2%
50% of probability mass above
9
International Journal of Molecular Sciences
453 papers in training set
Top 3%
3.0%
10
Clinical and Translational Science
21 papers in training set
Top 0.2%
2.9%
11
Journal of Medicinal Chemistry
68 papers in training set
Top 0.5%
2.3%
12
Pharmaceuticals
33 papers in training set
Top 0.7%
1.7%
13
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
14
Pharmaceutics
21 papers in training set
Top 0.2%
1.5%
15
Communications Biology
886 papers in training set
Top 11%
1.5%
16
Communications Chemistry
39 papers in training set
Top 0.4%
1.3%
17
eLife
5422 papers in training set
Top 51%
1.1%
18
Chemical Science
71 papers in training set
Top 2%
0.9%
19
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.9%
20
Journal of Cheminformatics
25 papers in training set
Top 0.5%
0.9%
21
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.8%
22
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 45%
0.7%
23
Advanced Therapeutics
15 papers in training set
Top 0.5%
0.7%
24
Biophysical Journal
545 papers in training set
Top 5%
0.7%
25
Clinical Pharmacology & Therapeutics
25 papers in training set
Top 0.8%
0.7%
26
Frontiers in Immunology
586 papers in training set
Top 8%
0.7%
27
Frontiers in Pharmacology
100 papers in training set
Top 5%
0.7%
28
iScience
1063 papers in training set
Top 37%
0.6%
29
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.6%
30
Frontiers in Molecular Biosciences
100 papers in training set
Top 6%
0.6%