Back to Search Start Over

HELM-GPT: de novo macrocyclic peptide design using generative pre-trained transformer.

Authors :
Xu, Xiaopeng
Xu, Chencheng
He, Wenjia
Wei, Lesong
Li, Haoyang
Zhou, Juexiao
Zhang, Ruochi
Wang, Yu
Xiong, Yuanpeng
Gao, Xin
Source :
Bioinformatics. Jun2024, Vol. 40 Issue 6, p1-9. 9p.
Publication Year :
2024

Abstract

Motivation Macrocyclic peptides hold great promise as therapeutics targeting intracellular proteins. This stems from their remarkable ability to bind flat protein surfaces with high affinity and specificity while potentially traversing the cell membrane. Research has already explored their use in developing inhibitors for intracellular proteins, such as KRAS, a well-known driver in various cancers. However, computational approaches for de novo macrocyclic peptide design remain largely unexplored. Results Here, we introduce HELM-GPT, a novel method that combines the strength of the hierarchical editing language for macromolecules (HELM) representation and generative pre-trained transformer (GPT) for de novo macrocyclic peptide design. Through reinforcement learning (RL), our experiments demonstrate that HELM-GPT has the ability to generate valid macrocyclic peptides and optimize their properties. Furthermore, we introduce a contrastive preference loss during the RL process, further enhanced the optimization performance. Finally, to co-optimize peptide permeability and KRAS binding affinity, we propose a step-by-step optimization strategy, demonstrating its effectiveness in generating molecules fulfilling both criteria. In conclusion, the HELM-GPT method can be used to identify novel macrocyclic peptides to target intracellular proteins. Availability and implementation The code and data of HELM-GPT are freely available on GitHub (https://github.com/charlesxu90/helm-gpt). [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13674803
Volume :
40
Issue :
6
Database :
Academic Search Index
Journal :
Bioinformatics
Publication Type :
Academic Journal
Accession number :
178159567
Full Text :
https://doi.org/10.1093/bioinformatics/btae364