Back to Search Start Over

Restricted Boltzmann Machine-Based Voice Conversion for Nonparallel Corpus

Authors :
Ki-Seung Lee
Source :
IEEE Signal Processing Letters. 24:1103-1107
Publication Year :
2017
Publisher :
Institute of Electrical and Electronics Engineers (IEEE), 2017.

Abstract

A large amount of parallel training corpus is necessary for robust, high-quality voice conversion. However, such parallel data may not always be available. This letter presents a new voice conversion method that needs no parallel speech corpus, and adopts a restricted Boltzmann machine (RBM) to represent the distribution of the spectral features derived from a target speaker. A linear transformation was employed to convert the spectral and delta features. A conversion function was obtained by maximizing the conditional probability density function with respect to the target RBM. A feasibility test was carried out on the OGI VOICES corpus. Results from the subjective listening tests and the objective results both showed that the proposed method outperforms the conventional GMM-based method.

Details

ISSN :
15582361 and 10709908
Volume :
24
Database :
OpenAIRE
Journal :
IEEE Signal Processing Letters
Accession number :
edsair.doi...........923d35502c5d48235226c19cd0d28d39
Full Text :
https://doi.org/10.1109/lsp.2017.2713412