Back to Search Start Over

Comparing spectrum estimators in speaker verification under additive noise degradation

Authors :
Johan K. Sandberg
Cemal Hanilci
Figen Ertaş
Rahim Saeidi
Paavo Alku
Maria Hansson-Sandsten
Tomi Kinnunen
Jouni Pohjalainen
Uludağ Üniversitesi/Mühendislik Fakültesi/Elektronik Mühendisliği Bölümü.
Hanilci, Cemal
Ertaş, Figen
AAH-4188-2021
S-4967-2016
Source :
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2012, pp. 4769-4772, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2012, 4769-4772. Kyoto, Japan : IEEE, STARTPAGE=4769;ENDPAGE=4772;TITLE=Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2012, ICASSP
Publication Year :
2012
Publisher :
IEEE, 2012.

Abstract

Bu çalışma, 25-30 Mart 2012 tarihleri arasında Kyoto[Japonya]’da düzenlenen IEEE International Conference on Acoustics, Speech and Signal Processing’da bildiri olarak sunulmuştur. Different short-term spectrum estimators for speaker verification under additive noise are considered. Conventionally, mel-frequency cepstral coefficients (MFCCs) are computed from discrete Fourier transform (DFT) spectra of windowed speech frames. Recently, linear prediction (LP) and its temporally weighted variants have been substituted as the spectrum analysis method in speech and speaker recognition. In this paper, 12 different short-term spectrum estimation methods are compared for speaker verification under additive noise contamination. Experimental results conducted on NIST 2002 SRE show that the spectrum estimation method has a large effect on recognition performance and stabilized weighted LP (SWLP) and minimum variance distortionless response (MVDR) methods yield approximately 7 % and 8 % relative improvements over the standard DFT method at -10 dB SNR level of factory and babble noises, respectively in terms of equal error rate (EER). Inst Elect & Elect Engineers, Signal Processing Soc IEEE

Details

Language :
English
Database :
OpenAIRE
Journal :
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2012, pp. 4769-4772, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2012, 4769-4772. Kyoto, Japan : IEEE, STARTPAGE=4769;ENDPAGE=4772;TITLE=Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2012, ICASSP
Accession number :
edsair.doi.dedup.....ada7b527c0749f5c6d8b48e1a1fa32cc