1. DeepMal: Accurate prediction of protein malonylation sites by deep neural networks.
- Author
-
Wang, Minghui, Cui, Xiaowen, Li, Shan, Yang, Xinhua, Ma, Anjun, Zhang, Yusen, and Yu, Bin
- Subjects
- *
CONVOLUTIONAL neural networks , *FORECASTING , *FATTY acid oxidation , *DEEP learning , *POST-translational modification , *PROKARYOTES - Abstract
Lysine malonylation is one of the important post-translational modification (PTM) of proteins. Malonylated proteins can affect various cell functions of eukaryotes and prokaryotes, which play an important role in metabolic pathways, mitochondrial functions, fatty acid oxidation and other life activities. However, accurate identification of the malonylation sites is the key to further understand the molecular mechanism of malonylation. Currently, experimental identification is still a challenging task, which usually requires a large amount of laboratory work and considerable cost. Regarding this situation, there is an urgent need to establish useful calculation methods and develop effective predictors. We propose a new deep learning network model called DeepMal. Firstly, features are extracted by enhanced amino acid composition (EAAC), enhanced grouped amino acid composition (EGAAC), dipeptide deviation from the expected mean (DDE), K nearest neighbors (KNN) and BLOSUM62 matrix. Secondly, the linear convolutional neural network is used to extract the specific features of malonylation sites, select the relevant features and reduce the feature dimension through maximum pooling. Finally, malonylation sites and non-malonylation sites are classified by a multilayer neural network. And an independent dataset is used to assess the predictive ability of the model DeepMal. On the independent datasets Escherichia coli (E. coli), Homo sapiens (H. sapiens), Mus musculus (M. musculus), the AUC values are 0.974, 0.956 and 0.944, and the accuracies are 96.5%, 95.5% and 94.5%, respectively. Compared with other prediction models, the prediction accuracy is increased by 9.5%–18.5%, and this indicates the effectiveness of the prediction model DeepMal. Using deep learning network can enhance the robustness of DeepMal model for predicting malonylation sites. The source codes and all datasets are publicly available at https://github.com/QUST-AIBBDRC/DeepMal/. • A novel method (DeepMal) to predict the lysine malonylation sites of proteins. • Protein sequence information is extracted from sequence features, physicochemical properties, and evolutionary features. • Convolutional neural networks can learn to extract significant features associated with malonylation sites. • Deep neural networks can improve the accurate prediction of protein lysine malonylation sites of proteins. • The proposed method increases the prediction performance on independent test datasets compared with other methods. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF