Back to Search Start Over

Pippin: A random forest-based method for identifying presynaptic and postsynaptic neurotoxins.

Authors :
Li, Pengyu
Zhang, He
Zhao, Xuyang
Jia, Cangzhi
Li, Fuyi
Song, Jiangning
Source :
Journal of Bioinformatics & Computational Biology. Apr2020, Vol. 18 Issue 2, pN.PAG-N.PAG. 21p.
Publication Year :
2020

Abstract

Presynaptic and postsynaptic neurotoxins are two types of neurotoxins from venomous animals and functionally important molecules in the neurosciences; however, their experimental characterization is difficult, time-consuming, and costly. Therefore, bioinformatics tools that can identify presynaptic and postsynaptic neurotoxins would be very useful for understanding their functions and mechanisms. In this study, we propose Pippin, a novel machine learning-based method that allows users to rapidly and accurately identify these two types of neurotoxins. Pippin was developed using the random forest (RF) algorithm and evaluated based on an up-to-date dataset. A variety of sequence and motif features were combined, and a two-step feature-selection algorithm was employed to characterize the optimal feature subset for presynaptic and postsynaptic neurotoxin prediction. Extensive benchmark tests illustrate that Pippin significantly improved predictive performance as compared with six other commonly used machine-learning algorithms, including the naïve Bayes classifier, Multinomial Naïve Bayes classifier (MNBC), AdaBoost, Bagging, K -nearest neighbors, and XGBoost. Additionally, we developed an online webserver for Pippin to facilitate public use. To the best of our knowledge, this is the first webserver for presynaptic and postsynaptic neurotoxin prediction. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02197200
Volume :
18
Issue :
2
Database :
Academic Search Index
Journal :
Journal of Bioinformatics & Computational Biology
Publication Type :
Academic Journal
Accession number :
143189253
Full Text :
https://doi.org/10.1142/S0219720020500080