Back to Search Start Over

SubLocEP: a novel ensemble predictor of subcellular localization of eukaryotic mRNA based on machine learning.

Authors :
Li, Jing
Zhang, Lichao
He, Shida
Guo, Fei
Zou, Quan
Source :
Briefings in Bioinformatics; Sep2021, Vol. 22 Issue 5, p1-11, 11p
Publication Year :
2021

Abstract

Motivation mRNA location corresponds to the location of protein translation and contributes to precise spatial and temporal management of the protein function. However, current assignment of subcellular localization of eukaryotic mRNA reveals important limitations: (1) turning multiple classifications into multiple dichotomies makes the training process tedious; (2) the majority of the models trained by classical algorithm are based on the extraction of single sequence information; (3) the existing state-of-the-art models have not reached an ideal level in terms of prediction and generalization ability. To achieve better assignment of subcellular localization of eukaryotic mRNA, a better and more comprehensive model must be developed. Results In this paper, SubLocEP is proposed as a two-layer integrated prediction model for accurate prediction of the location of sequence samples. Unlike the existing models based on limited features, SubLocEP comprehensively considers additional feature attributes and is combined with LightGBM to generated single feature classifiers. The initial integration model (single-layer model) is generated according to the categories of a feature. Subsequently, two single-layer integration models are weighted (sequence-based: physicochemical properties = 3:2) to produce the final two-layer model. The performance of SubLocEP on independent datasets is sufficient to indicate that SubLocEP is an accurate and stable prediction model with strong generalization ability. Additionally, an online tool has been developed that contains experimental data and can maximize the user convenience for estimation of subcellular localization of eukaryotic mRNA. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
14675463
Volume :
22
Issue :
5
Database :
Complementary Index
Journal :
Briefings in Bioinformatics
Publication Type :
Academic Journal
Accession number :
152975137
Full Text :
https://doi.org/10.1093/bib/bbaa401