Back to Search Start Over

An Efficient Deep Learning Approach for DNA-Binding Proteins Classification from Primary Sequences

Authors :
Ahmed, Nosiba Yousif
Alsanousi, Wafa Alameen
Hamid, Eman Mohammed
Elbashir, Murtada K.
Al-Aidarous, Khadija Mohammed
Mohammed, Mogtaba
Musa, Mohamed Elhafiz M.
Source :
International Journal of Computational Intelligence Systems; December 2024, Vol. 17 Issue: 1
Publication Year :
2024

Abstract

As the number of identified proteins has expanded, the accurate identification of proteins has become a significant challenge in the field of biology. Various computational methods, such as Support Vector Machine (SVM), K-nearest neighbors (KNN), and convolutional neural network (CNN), have been proposed to recognize deoxyribonucleic acid (DNA)-binding proteins solely based on amino acid sequences. However, these methods do not consider the contextual information within amino acid sequences, limiting their ability to adequately capture sequence features. In this study, we propose a novel approach to identify DNA-binding proteins by integrating a CNN with bidirectional long-short-term memory (LSTM) and gated recurrent unit (GRU) as (CNN-BiLG). The CNN-BiLG model can explore the potential contextual relationships of amino acid sequences and obtain more features than traditional models. Our experimental results demonstrate a validation set prediction accuracy of 94% for the proposed CNN-BiLG, surpassing the accuracy of machine learning models and deep learning models. Furthermore, our model is both effective and efficient, exhibiting commendable classification accuracy based on comparative analysis.

Details

Language :
English
ISSN :
18756891 and 18756883
Volume :
17
Issue :
1
Database :
Supplemental Index
Journal :
International Journal of Computational Intelligence Systems
Publication Type :
Periodical
Accession number :
ejs66059531
Full Text :
https://doi.org/10.1007/s44196-024-00462-3