Back to Search Start Over

DiCleave: a deep learning model for predicting human Dicer cleavage sites.

Authors :
Mu, Lixuan
Song, Jiangning
Akutsu, Tatsuya
Mori, Tomoya
Source :
BMC Bioinformatics; 1/9/2024, Vol. 25 Issue 1, p1-15, 15p
Publication Year :
2024

Abstract

Background: MicroRNAs (miRNAs) are a class of non-coding RNAs that play a pivotal role as gene expression regulators. These miRNAs are typically approximately 20 to 25 nucleotides long. The maturation of miRNAs requires Dicer cleavage at specific sites within the precursor miRNAs (pre-miRNAs). Recent advances in machine learning-based approaches for cleavage site prediction, such as PHDcleav and LBSizeCleav, have been reported. ReCGBM, a gradient boosting-based model, demonstrates superior performance compared with existing methods. Nonetheless, ReCGBM operates solely as a binary classifier despite the presence of two cleavage sites in a typical pre-miRNA. Previous approaches have focused on utilizing only a fraction of the structural information in pre-miRNAs, often overlooking comprehensive secondary structure information. There is a compelling need for the development of a novel model to address these limitations. Results: In this study, we developed a deep learning model for predicting the presence of a Dicer cleavage site within a pre-miRNA segment. This model was enhanced by an autoencoder that learned the secondary structure embeddings of pre-miRNA. Benchmarking experiments demonstrated that the performance of our model was comparable to that of ReCGBM in the binary classification tasks. In addition, our model excelled in multi-class classification tasks, making it a more versatile and practical solution than ReCGBM. Conclusions: Our proposed model exhibited superior performance compared with the current state-of-the-art model, underscoring the effectiveness of a deep learning approach in predicting Dicer cleavage sites. Furthermore, our model could be trained using only sequence and secondary structure information. Its capacity to accommodate multi-class classification tasks has enhanced the practical utility of our model. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
14712105
Volume :
25
Issue :
1
Database :
Complementary Index
Journal :
BMC Bioinformatics
Publication Type :
Academic Journal
Accession number :
174684416
Full Text :
https://doi.org/10.1186/s12859-024-05638-4