Back to Search Start Over

Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction.

Authors :
Liang, Ying
Wu, Yanan
Zhang, Zequn
Liu, Niannian
Peng, Jun
Tang, Jianjun
Source :
BMC Bioinformatics. 6/29/2022, Vol. 23 Issue 1, p1-18. 18p.
Publication Year :
2022

Abstract

Background: DNA N4-methylcytosine is part of the restrictive modification system, which works by regulating some biological processes, for example, the initiation of DNA replication, mismatch repair and inactivation of transposon. However, using experimental methods to detect 4mC sites is time-consuming and expensive. Besides, considering the huge differences in the number of 4mC samples among different species, it is challenging to achieve a robust multi-species 4mC site prediction performance. Hence, it is of great significance to develop effective computational tools to identify 4mC sites. Results: This work proposes a flexible deep learning-based framework to predict 4mC sites, called Hyb4mC. Hyb4mC adopts the DNA2vec method for sequence embedding, which captures more efficient and comprehensive information compared with the sequence-based feature method. Then, two different subnets are used for further analysis: Hyb_Caps and Hyb_Conv. Hyb_Caps is composed of a capsule neural network and can generalize from fewer samples. Hyb_Conv combines the attention mechanism with a text convolutional neural network for further feature learning. Conclusions: Extensive benchmark tests have shown that Hyb4mC can significantly enhance the performance of predicting 4mC sites compared with the recently proposed methods. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
14712105
Volume :
23
Issue :
1
Database :
Academic Search Index
Journal :
BMC Bioinformatics
Publication Type :
Academic Journal
Accession number :
157713027
Full Text :
https://doi.org/10.1186/s12859-022-04789-6