Back to Search Start Over

Splice sites detection using chaos game representation and neural network.

Authors :
Hoang, Tung
Yin, Changchuan
Yau, Stephen S.-T.
Source :
Genomics. Mar2020, Vol. 112 Issue 2, p1847-1852. 6p.
Publication Year :
2020

Abstract

A novel method is proposed to detect the acceptor and donor splice sites using chaos game representation and artificial neural network. In order to achieve high accuracy, inputs to the neural network, or feature vector, shall reflect the true nature of the DNA segments. Therefore it is important to have one-to-one numerical representation, i.e. a feature vector should be able to represent the original data. Chaos game representation (CGR) is an iterative mapping technique that assigns each nucleotide in a DNA sequence to a respective position on the plane in a one-to-one manner. Using CGR, a DNA sequence can be mapped to a numerical sequence that reflects the true nature of the original sequence. In this research, we propose to use CGR as feature input to a neural network to detect splice sites on the NN269 dataset. Computational experiments indicate that this approach gives good accuracy while being simpler than other methods in the literature, with only one neural network component. The code and data for our method can be accessed from this link: https://github.com/thoang3/portfolio/tree/SpliceSites_ANN_CGR. • We propose to use chaos game representation and neural network for splice sites detection. • DNA sequences are mapped to numerical sequences using chaos game representation. • Numerical sequences are used as feature inputs to neural networks for splice sites classification. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
08887543
Volume :
112
Issue :
2
Database :
Academic Search Index
Journal :
Genomics
Publication Type :
Academic Journal
Accession number :
141755304
Full Text :
https://doi.org/10.1016/j.ygeno.2019.10.018