Back to Search Start Over

Speech reconstruction using a deep partially supervised neural network

Authors :
Ian McLoughlin
Jingjie Li
Yan Song
Hamid R. Sharifzadeh
Source :
Healthcare Technology Letters (2017)
Publication Year :
2017
Publisher :
Wiley, 2017.

Abstract

Statistical speech reconstruction for larynx-related dysphonia has achieved good performance using Gaussian mixture models and, more recently, restricted Boltzmann machine arrays; however, deep neural network (DNN)-based systems have been hampered by the limited amount of training data available from individual voice-loss patients. The authors propose a novel DNN structure that allows a partially supervised training approach on spectral features from smaller data sets, yielding very good results compared with the current state-of-the-art.

Details

Language :
English
ISSN :
20533713
Database :
Directory of Open Access Journals
Journal :
Healthcare Technology Letters
Publication Type :
Academic Journal
Accession number :
edsdoj.79ee62f643954f2780c9352efd0f30bc
Document Type :
article
Full Text :
https://doi.org/10.1049/htl.2016.0103