1. Plasma protein binding prediction focusing on residue-level features and circularity of cyclic peptides by deep learning
- Author
-
Yutaka Akiyama, Masahito Ohue, Yasushi Yoshikawa, Keisuke Yanagisawa, and Jianan Li
- Subjects
Statistics and Probability ,chemistry.chemical_classification ,AcademicSubjects/SCI01060 ,Correlation coefficient ,Drug discovery ,Plasma protein binding ,Original Papers ,Biochemistry ,Small molecule ,Circular convolution ,Cyclic peptide ,Computer Science Applications ,Computational Mathematics ,Residue (chemistry) ,Computational Theory and Mathematics ,chemistry ,Molecular descriptor ,Data and Text Mining ,Biological system ,Molecular Biology - Abstract
Motivation In recent years, cyclic peptide drugs have been receiving increasing attention because they can target proteins that are difficult to be tackled by conventional small-molecule drugs or antibody drugs. Plasma protein binding rate (%PPB) is a significant pharmacokinetic property of a compound in drug discovery and design. However, due to structural differences, previous computational prediction methods developed for small-molecule compounds cannot be successfully applied to cyclic peptides, and methods for predicting the PPB rate of cyclic peptides with high accuracy are not yet available. Results Cyclic peptides are larger than small molecules, and their local structures have a considerable impact on PPB; thus, molecular descriptors expressing residue-level local features of cyclic peptides, instead of those expressing the entire molecule, as well as the circularity of the cyclic peptides should be considered. Therefore, we developed a prediction method named CycPeptPPB using deep learning that considers both factors. First, the macrocycle ring of cyclic peptides was decomposed residue by residue. The residue-based descriptors were arranged according to the sequence information of the cyclic peptide. Furthermore, the circular data augmentation method was used, and the circular convolution method CyclicConv was devised to express the cyclic structure. CycPeptPPB exhibited excellent performance, with mean absolute error (MAE) of 4.79% and correlation coefficient (R) of 0.92 for the public drug dataset, compared to the prediction performance of the existing PPB rate prediction software (MAE=15.08%, R=0.63). Availability and implementation The data underlying this article are available in the online supplementary material. The source code of CycPeptPPB is available at https://github.com/akiyamalab/cycpeptppb. Supplementary information Supplementary data are available at Bioinformatics online.
- Published
- 2021
- Full Text
- View/download PDF