Back to Search
Start Over
Voice activity detection in a regularized reproducing kernel Hilbert space
- Publication Year :
- 2010
-
Abstract
- Voice activity detection (VAD) is used to detect whether the acoustic signal belongs to speech or non-speech clusters based on the statistical distribution of the acoustic features. Traditional VAD algorithms are applied in a linear transformed space without any constraint relating to the special characteristics speech or noise. As a result, the VAD algorithms are not robust to noise interference. Considering that speech is a special type of acoustic signal that only occupies a small fraction of the whole acoustic space, we proposed a new speech feature extraction method by giving constraints on the processing space as a reproducing kernel Hilbert space (RKHS). In the RKHS, we regarded the speech estimation as a functional approximation problem, and estimated the approximation function via a regularized framework in the RKHS. Under this framework, we could incorporate the nonlinear mapping functions in the approximation implicitly via a kernel function. The approximation function could capture the nonlinear and high-order statistical regularities of the speech. Our VAD algorithm is designed on the basis of the power energy in this regularized RKHS. Compared with a baseline and G.729B VAD algorithms, experimental results showed the promising advantages of our proposed algorithm.
Details
- Database :
- OAIster
- Notes :
- application/pdf, English
- Publication Type :
- Electronic Resource
- Accession number :
- edsoai.ocn747420456
- Document Type :
- Electronic Resource