Back to Search Start Over

Mining channel-regulated peptides from animal venom by integrating sequence semantics and structural information.

Authors :
Wang, Jian-Ming
Cui, Rong-Kai
Qian, Zheng-Kun
Yang, Zi-Zhong
Li, Yi
Source :
Computational Biology & Chemistry. Apr2024, Vol. 109, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

Channel-regulated peptides (CRPs) derived from animal venom hold great promise as potential drug candidates for numerous diseases associated with channel proteins. However, discovering and identifying CRPs using traditional bio-experimental methods is a time-consuming and laborious process. While there were a few computational studies on CRPs, they were limited to specific channel proteins, relied heavily on complex feature engineering, and lacked the incorporation of multi-source information. To address these problems, we proposed a novel deep learning model, called DeepCRPs, based on graph neural networks for systematically mining CRPs from animal venom. By combining the sequence semantic and structural information, the classification performance of four CRPs was significantly enhanced, reaching an accuracy of 0.92. This performance surpassed baseline models with accuracies ranging from 0.77 to 0.89. Furthermore, we employed advanced interpretable techniques to explore sequence and structural determinants relevant to the classification of CRPs, yielding potentially valuable bio-function interpretations. Comprehensive experimental results demonstrated the precision and interpretive capability of DeepCRPs, making it an accurate and bio-explainable suit for the identification and categorization of CRPs. Our research will contribute to the discovery and development of toxin peptides targeting channel proteins. The source data and code are freely available at https://github.com/liyigerry/DeepCRPs. [Display omitted] • Sequence semantics and structural information were fused in graph-based models to improve the performance of CRPs prediction. • The pre-trained language model extracted the semantic features of protein sequences as feature vectors of nodes in the graph, which is better than other features such as one-hot encoding. • An advanced interpretable technique was employed to explore sequence and structural determinants relevant to classification, yielding potentially valuable bio-function interpretations. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
14769271
Volume :
109
Database :
Academic Search Index
Journal :
Computational Biology & Chemistry
Publication Type :
Academic Journal
Accession number :
175981471
Full Text :
https://doi.org/10.1016/j.compbiolchem.2024.108027