Back to Search Start Over

Predicting the effects of periodicity on the intelligibility of masked speech: An evaluation of different modelling approaches and their limitations

Authors :
Torsten Dau
Helia Relaño-Iborra
Johannes Zaar
Kurt Steinmetzger
Stuart Rosen
Source :
Steinmetzger, K, Zaar, J, Relaño-Iborra, H, Rosen, S & Dau, T 2019, ' Predicting the effects of periodicity on the intelligibility of masked speech : An evaluation of different modelling approaches and their limitations ', Journal of the Acoustical Society of America, vol. 146, no. 4, pp. 2562-2576 . https://doi.org/10.1121/1.5129050
Publication Year :
2019
Publisher :
Acoustical Society of America (ASA), 2019.

Abstract

Four existing speech intelligibility models with different theoretical assumptions were used to predict previously published behavioural data. Those data showed that complex tones with pitch-related periodicity are far less effective maskers of speech than aperiodic noise. This so-called masker-periodicity benefit (MPB) far exceeded the fluctuating-masker benefit (FMB) obtained from slow masker envelope fluctuations. In contrast, the normal-hearing listeners hardly benefitted from periodicity in the target speech. All tested models consistently underestimated MPB and FMB, while most of them also overestimated the intelligibility of vocoded speech. To understand these shortcomings, the internal signal representations of the models were analysed in detail. The best-performing model, the correlation-based version of the speech-based envelope power spectrum model (sEPSMcorr), combined an auditory processing front end with a modulation filterbank and a correlation-based back end. This model was then modified to further improve the predictions. The resulting second version of the sEPSMcorr outperformed the original model with all tested maskers and accounted for about half the MPB, which can be attributed to reduced modulation masking caused by the periodic maskers. However, as the sEPSMcorr2 failed to account for the other half of the MPB, the results also indicate that future models should consider the contribution of pitch-related effects, such as enhanced stream segregation, to further improve their predictive power.

Details

ISSN :
00014966
Volume :
146
Database :
OpenAIRE
Journal :
The Journal of the Acoustical Society of America
Accession number :
edsair.doi.dedup.....675d92ae377596a083b94686399ad342
Full Text :
https://doi.org/10.1121/1.5129050