1. Classification of Parkinson’s disease utilizing multi-edit nearest-neighbor and ensemble learning algorithms with speech samples
- Author
-
Liuyang Yang, Hehua Zhang, Mingguo Qiu, Pin Wang, Yongming Li, Yan Fang, Yuchuan Liu, Yin Jun, and Xueru Zhu
- Subjects
Future studies ,Computer science ,Biomedical Engineering ,Stability (learning theory) ,02 engineering and technology ,Machine learning ,computer.software_genre ,k-nearest neighbors algorithm ,Biomaterials ,03 medical and health sciences ,0302 clinical medicine ,Optimal selection of speech samples ,Ensemble learning ,Decorrelated neural network ensembles (DNNE) ,0202 electrical engineering, electronic engineering, information engineering ,Humans ,Speech ,Radiology, Nuclear Medicine and imaging ,Instance selection ,Multi-edit-nearest-neighbor algorithm (MENN) ,Radiological and Ultrasound Technology ,Artificial neural network ,business.industry ,Research ,Random forest (RF) ,Parkinson Disease ,Pattern recognition ,General Medicine ,Classification of Parkinson disease ,Random forest ,020201 artificial intelligence & image processing ,Neural Networks, Computer ,Noise (video) ,Artificial intelligence ,business ,computer ,Algorithm ,030217 neurology & neurosurgery - Abstract
Background The use of speech based data in the classification of Parkinson disease (PD) has been shown to provide an effect, non-invasive mode of classification in recent years. Thus, there has been an increased interest in speech pattern analysis methods applicable to Parkinsonism for building predictive tele-diagnosis and tele-monitoring models. One of the obstacles in optimizing classifications is to reduce noise within the collected speech samples, thus ensuring better classification accuracy and stability. While the currently used methods are effect, the ability to invoke instance selection has been seldomly examined. Methods In this study, a PD classification algorithm was proposed and examined that combines a multi-edit-nearest-neighbor (MENN) algorithm and an ensemble learning algorithm. First, the MENN algorithm is applied for selecting optimal training speech samples iteratively, thereby obtaining samples with high separability. Next, an ensemble learning algorithm, random forest (RF) or decorrelated neural network ensembles (DNNE), is used to generate trained samples from the collected training samples. Lastly, the trained ensemble learning algorithms are applied to the test samples for PD classification. This proposed method was examined using a more recently deposited public datasets and compared against other currently used algorithms for validation. Results Experimental results showed that the proposed algorithm obtained the highest degree of improved classification accuracy (29.44%) compared with the other algorithm that was examined. Furthermore, the MENN algorithm alone was found to improve classification accuracy by as much as 45.72%. Moreover, the proposed algorithm was found to exhibit a higher stability, particularly when combining the MENN and RF algorithms. Conclusions This study showed that the proposed method could improve PD classification when using speech data and can be applied to future studies seeking to improve PD classification methods.
- Published
- 2016
- Full Text
- View/download PDF