Back to Search
Start Over
Feature Selection-based Voice Transformation
- Source :
- The Journal of the Acoustical Society of Korea. 31:39-50
- Publication Year :
- 2012
- Publisher :
- The Acoustical Society of Korea, 2012.
-
Abstract
- A voice transformation (VT) method that can make the utterance of a source speaker mimic that of a target speaker is described. Speaker individuality transformation is achieved by altering three feature parameters, which include the LPC cepstrum, pitch period and gain. The main objective of this study involves construction of an optimal sequence of features selected from a target speaker’s database, to maximize both the correlation probabilities between the transformed and the source features and the likelihood of the transformed features with respect to the target model. A set of two-pass conversion rules is proposed, where the feature parameters are first selected from a database then the optimal sequence of the feature parameters is then constructed in the second pass. The conversion rules were developed using a statistical approach that employed a maximum likelihood criterion. In constructing an optimal sequence of the features, a hidden Markov model (HMM) was employed to find the most likely combination of the features with respect to the target speaker’s model. The effectiveness of the proposed transformation method was evaluated using objective tests and informal listening tests. We confirmed that the proposed method leads to perceptually more preferred results, compared with the conventional methods.
- Subjects :
- Engineering
Acoustics and Ultrasonics
business.industry
Applied Mathematics
Speech recognition
Feature selection
Pattern recognition
Speaker recognition
Speaker diarisation
Set (abstract data type)
Speech and Hearing
Transformation (function)
Feature (computer vision)
Signal Processing
Cepstrum
Artificial intelligence
business
Hidden Markov model
Instrumentation
Subjects
Details
- ISSN :
- 12254428
- Volume :
- 31
- Database :
- OpenAIRE
- Journal :
- The Journal of the Acoustical Society of Korea
- Accession number :
- edsair.doi...........d811bc40a04a1cdd8fb74ce654ef097c
- Full Text :
- https://doi.org/10.7776/ask.2012.31.1.039