Back to Search
Start Over
Predicting the effects of periodicity on the intelligibility of masked speech: An evaluation of different modelling approaches and their limitations
- Source :
- Steinmetzger, K, Zaar, J, Relaño-Iborra, H, Rosen, S & Dau, T 2019, ' Predicting the effects of periodicity on the intelligibility of masked speech : An evaluation of different modelling approaches and their limitations ', Journal of the Acoustical Society of America, vol. 146, no. 4, pp. 2562-2576 . https://doi.org/10.1121/1.5129050
- Publication Year :
- 2019
- Publisher :
- Acoustical Society of America (ASA), 2019.
-
Abstract
- Four existing speech intelligibility models with different theoretical assumptions were used to predict previously published behavioural data. Those data showed that complex tones with pitch-related periodicity are far less effective maskers of speech than aperiodic noise. This so-called masker-periodicity benefit (MPB) far exceeded the fluctuating-masker benefit (FMB) obtained from slow masker envelope fluctuations. In contrast, the normal-hearing listeners hardly benefitted from periodicity in the target speech. All tested models consistently underestimated MPB and FMB, while most of them also overestimated the intelligibility of vocoded speech. To understand these shortcomings, the internal signal representations of the models were analysed in detail. The best-performing model, the correlation-based version of the speech-based envelope power spectrum model (sEPSMcorr), combined an auditory processing front end with a modulation filterbank and a correlation-based back end. This model was then modified to further improve the predictions. The resulting second version of the sEPSMcorr outperformed the original model with all tested maskers and accounted for about half the MPB, which can be attributed to reduced modulation masking caused by the periodic maskers. However, as the sEPSMcorr2 failed to account for the other half of the MPB, the results also indicate that future models should consider the contribution of pitch-related effects, such as enhanced stream segregation, to further improve their predictive power.
- Subjects :
- Male
Periodicity
Sound Spectrography
Acoustics and Ultrasonics
Speech recognition
Intelligibility (communication)
01 natural sciences
Correlation
Front and back ends
030507 speech-language pathology & audiology
03 medical and health sciences
Arts and Humanities (miscellaneous)
0103 physical sciences
Humans
Speech
010301 acoustics
Mathematics
Speech Intelligibility
Spectral density
Signal Processing, Computer-Assisted
Models, Theoretical
Filter bank
Acoustic Stimulation
Aperiodic graph
Noise
0305 other medical science
Perceptual Masking
Psychoacoustics
Subjects
Details
- ISSN :
- 00014966
- Volume :
- 146
- Database :
- OpenAIRE
- Journal :
- The Journal of the Acoustical Society of America
- Accession number :
- edsair.doi.dedup.....675d92ae377596a083b94686399ad342
- Full Text :
- https://doi.org/10.1121/1.5129050