Back to Search Start Over

BiPSTP: Sequence feature encoding method for identifying different RNA modifications with bidirectional position-specific trinucleotides propensities.

Authors :
Mingzhao Wang
Haider Ali
Yandi Xu
Juanying Xie
Shengquan Xu
Source :
Journal of Biological Chemistry. Apr2024, Vol. 300 Issue 4, p1-20. 20p.
Publication Year :
2024

Abstract

RNA modification, a posttranscriptional regulatory mechanism, significantly influences RNA biogenesis and function. The accurate identification of modification sites is paramount for investigating their biological implications. Methods for encoding RNA sequence into numerical data play a crucial role in developing robust models for predicting modification sites. However, existing techniques suffer from limitations, including inadequate information representation, challenges in effectively integrating positional and sequential information, and the generation of irrelevant or redundant features when combining multiple approaches. These deficiencies hinder the effectiveness of machine learning models in addressing the performance challenges associated with predicting RNA modification sites. Here, we introduce a novel RNA sequence feature representation method, named BiPSTP, which utilizes bidirectional trinucleotide position-specific propensities. We employ the parameter ΞΎ to denote the interval between the current nucleotide and its adjacent forward or backward dinucleotide, enabling the extraction of positional and sequential information from RNA sequences. Leveraging the BiPSTP method, we have developed the prediction model mRNAPred using support vector machine classifier to identify multiple types of RNA modification sites. We evaluate the performance of our BiPSTP method and mRNAPred model across 12 distinct RNA modification types. Our experimental results demonstrate the superiority of the mRNAPred model compared to state-of-art models in the domain of RNA modification sites identification. Importantly, our BiPSTP method enhances the robustness and generalization performance of prediction models. Notably, it can be applied to feature extraction from DNA sequences to predict other biological modification sites. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00219258
Volume :
300
Issue :
4
Database :
Academic Search Index
Journal :
Journal of Biological Chemistry
Publication Type :
Academic Journal
Accession number :
177095101
Full Text :
https://doi.org/10.1016/j.jbc.2024.107140