Back to Search Start Over

Improved multi-label classifiers for predicting protein subcellular localization.

Authors :
Chen L
Qu R
Liu X
Source :
Mathematical biosciences and engineering : MBE [Math Biosci Eng] 2024 Jan; Vol. 21 (1), pp. 214-236. Date of Electronic Publication: 2022 Dec 11.
Publication Year :
2024

Abstract

Protein functions are closely related to their subcellular locations. At present, the prediction of protein subcellular locations is one of the most important problems in protein science. The evident defects of traditional methods make it urgent to design methods with high efficiency and low costs. To date, lots of computational methods have been proposed. However, this problem is far from being completely solved. Recently, some multi-label classifiers have been proposed to identify subcellular locations of human, animal, Gram-negative bacterial and eukaryotic proteins. These classifiers adopted the protein features derived from gene ontology information. Although they provided good performance, they can be further improved by adopting more powerful machine learning algorithms. In this study, four improved multi-label classifiers were set up for identification of subcellular locations of the above four protein types. The random k-labelsets (RAKEL) algorithm was used to tackle proteins with multiple locations, and random forest was used as the basic prediction engine. All classifiers were tested by jackknife test, indicating their high performance. Comparisons with previous classifiers further confirmed the superiority of the proposed classifiers.

Details

Language :
English
ISSN :
1551-0018
Volume :
21
Issue :
1
Database :
MEDLINE
Journal :
Mathematical biosciences and engineering : MBE
Publication Type :
Academic Journal
Accession number :
38303420
Full Text :
https://doi.org/10.3934/mbe.2024010