Back to Search
Start Over
Prediction of enhancer–promoter interactions using the cross-cell type information and domain adversarial neural network
- Source :
- BMC Bioinformatics, Vol 21, Iss 1, Pp 1-16 (2020), BMC Bioinformatics
- Publication Year :
- 2020
- Publisher :
- BMC, 2020.
-
Abstract
- Background Enhancer–promoter interactions (EPIs) play key roles in transcriptional regulation and disease progression. Although several computational methods have been developed to predict such interactions, their performances are not satisfactory when training and testing data from different cell lines. Currently, it is still unclear what extent a across cell line prediction can be made based on sequence-level information. Results In this work, we present a novel Sequence-based method (called SEPT) to predict the enhancer–promoter interactions in new cell line by using the cross-cell information and Transfer learning. SEPT first learns the features of enhancer and promoter from DNA sequences with convolutional neural network (CNN), then designing the gradient reversal layer of transfer learning to reduce the cell line specific features meanwhile retaining the features associated with EPIs. When the locations of enhancers and promoters are provided in new cell line, SEPT can successfully recognize EPIs in this new cell line based on labeled data of other cell lines. The experiment results show that SEPT can effectively learn the latent import EPIs-related features between cell lines and achieves the best prediction performance in terms of AUC (the area under the receiver operating curves). Conclusions SEPT is an effective method for predicting the EPIs in new cell line. Domain adversarial architecture of transfer learning used in SEPT can learn the latent EPIs shared features among cell lines from all other existing labeled data. It can be expected that SEPT will be of interest to researchers concerned with biological interaction prediction.
- Subjects :
- Cell type
Computer science
Convolutional neural network
Regulatory Sequences, Nucleic Acid
lcsh:Computer applications to medicine. Medical informatics
Biochemistry
Domain (software engineering)
Enhancer–promoter interactions
03 medical and health sciences
0302 clinical medicine
Structural Biology
Transcriptional regulation
Humans
Layer (object-oriented design)
Enhancer
Promoter Regions, Genetic
Molecular Biology
lcsh:QH301-705.5
030304 developmental biology
Gradient reversal layer
0303 health sciences
Artificial neural network
business.industry
Applied Mathematics
Methodology Article
Pattern recognition
Computer Science Applications
Transfer learning
ROC Curve
lcsh:Biology (General)
Cell culture
Area Under Curve
lcsh:R858-859.7
Artificial intelligence
Neural Networks, Computer
DNA microarray
Transfer of learning
business
Cell line
030217 neurology & neurosurgery
Test data
Subjects
Details
- Language :
- English
- ISSN :
- 14712105
- Volume :
- 21
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- BMC Bioinformatics
- Accession number :
- edsair.doi.dedup.....2f0a7d259c2c98a569fe0376f41b5ed2
- Full Text :
- https://doi.org/10.1186/s12859-020-03844-4