Back to Search Start Over

M6A-BiNP: predicting N6-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information.

Authors :
Wang, Mingzhao
Xie, Juanying
Xu, Shengquan
Source :
RNA Biology; Dec 2021, Vol. 18 Issue 12, p2498-2512, 15p
Publication Year :
2021

Abstract

N<superscript>6</superscript>-methyladenosine (m<superscript>6</superscript>A) plays an important role in various biological processes. Identifying m<superscript>6</superscript>A site is a key step in exploring its biological functions. One of the biggest challenges in identifying m<superscript>6</superscript>A sites is how to extract features comprising rich categorical information to distinguish m<superscript>6</superscript>A and non-m<superscript>6</superscript>A sites. To address this challenge, we propose bidirectional dinucleotide and trinucleotide position-specific propensities, respectively, in this paper. Based on this, we propose two feature-encoding algorithms: Position-Specific Propensities and Pointwise Mutual Information (PSP-PMI) and Position-Specific Propensities and Pointwise Joint Mutual Information (PSP-PJMI). PSP-PMI is based on the bidirectional dinucleotide propensity and the pointwise mutual information, while PSP-PJMI is based on the bidirectional trinucleotide position-specific propensity and the proposed pointwise joint mutual information in this paper. We introduce parameters α and β in PSP-PMI and PSP-PJMI, respectively, to represent the distance from the nucleotide to its forward or backward adjacent nucleotide or dinucleotide, so as to extract features containing local and global classification information. Finally, we propose the M6A-BiNP predictor based on PSP-PMI or PSP-PJMI and SVM classifier. The 10-fold cross-validation experimental results on the benchmark datasets of non-single-base resolution and single-base resolution demonstrate that PSP-PMI and PSP-PJMI can extract features with strong capabilities to identify m<superscript>6</superscript>A and non-m<superscript>6</superscript>A sites. The M6A-BiNP predictor based on our proposed feature encoding algorithm PSP-PJMI is better than the state-of-the-art predictors, and it is so far the best model to identify m<superscript>6</superscript>A and non-m<superscript>6</superscript>A sites. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15476286
Volume :
18
Issue :
12
Database :
Complementary Index
Journal :
RNA Biology
Publication Type :
Academic Journal
Accession number :
153736799
Full Text :
https://doi.org/10.1080/15476286.2021.1930729