Back to Search
Start Over
Restricted Boltzmann Machine-Based Voice Conversion for Nonparallel Corpus
- Source :
- IEEE Signal Processing Letters. 24:1103-1107
- Publication Year :
- 2017
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2017.
-
Abstract
- A large amount of parallel training corpus is necessary for robust, high-quality voice conversion. However, such parallel data may not always be available. This letter presents a new voice conversion method that needs no parallel speech corpus, and adopts a restricted Boltzmann machine (RBM) to represent the distribution of the spectral features derived from a target speaker. A linear transformation was employed to convert the spectral and delta features. A conversion function was obtained by maximizing the conditional probability density function with respect to the target RBM. A feasibility test was carried out on the OGI VOICES corpus. Results from the subjective listening tests and the objective results both showed that the proposed method outperforms the conventional GMM-based method.
- Subjects :
- Restricted Boltzmann machine
Training set
business.industry
Computer science
Applied Mathematics
Speech recognition
Feature extraction
020206 networking & telecommunications
Pattern recognition
Speech corpus
Probability density function
02 engineering and technology
Conditional probability distribution
030507 speech-language pathology & audiology
03 medical and health sciences
Distribution (mathematics)
Signal Processing
0202 electrical engineering, electronic engineering, information engineering
Artificial intelligence
Electrical and Electronic Engineering
0305 other medical science
business
Subjects
Details
- ISSN :
- 15582361 and 10709908
- Volume :
- 24
- Database :
- OpenAIRE
- Journal :
- IEEE Signal Processing Letters
- Accession number :
- edsair.doi...........923d35502c5d48235226c19cd0d28d39
- Full Text :
- https://doi.org/10.1109/lsp.2017.2713412