Back to Search Start Over

Automatic intelligibility assessment of speakers after laryngeal cancer by means of acoustic modeling.

Authors :
Bocklet T
Riedhammer K
Nöth E
Eysholdt U
Haderlein T
Source :
Journal of voice : official journal of the Voice Foundation [J Voice] 2012 May; Vol. 26 (3), pp. 390-7. Date of Electronic Publication: 2011 Aug 05.
Publication Year :
2012

Abstract

Objective: One aspect of voice and speech evaluation after laryngeal cancer is acoustic analysis. Perceptual evaluation by expert raters is a standard in the clinical environment for global criteria such as overall quality or intelligibility. So far, automatic approaches evaluate acoustic properties of pathologic voices based on voiced/unvoiced distinction and fundamental frequency analysis of sustained vowels. Because of the high amount of noisy components and the increasing aperiodicity of highly pathologic voices, a fully automatic analysis of fundamental frequency is difficult. We introduce a purely data-driven system for the acoustic analysis of pathologic voices based on recordings of a standard text.<br />Methods: Short-time segments of the speech signal are analyzed in the spectral domain, and speaker models based on this information are built. These speaker models act as a clustered representation of the acoustic properties of a person's voice and are thus characteristic for speakers with different kinds and degrees of pathologic conditions. The system is evaluated on two different data sets with speakers reading standardized texts. One data set contains 77 speakers after laryngeal cancer treated with partial removal of the larynx. The other data set contains 54 totally laryngectomized patients, equipped with a Provox shunt valve. Each speaker was rated by five expert listeners regarding three different criteria: strain, voice quality, and speech intelligibility.<br />Results/conclusion: We show correlations for each data set with r and ρ≥0.8 between the automatic system and the mean value of the five raters. The interrater correlation of one rater to the mean value of the remaining raters is in the same range. We thus assume that for selected evaluation criteria, the system can serve as a validated objective support for acoustic voice and speech analysis.<br /> (Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.)

Details

Language :
English
ISSN :
1873-4588
Volume :
26
Issue :
3
Database :
MEDLINE
Journal :
Journal of voice : official journal of the Voice Foundation
Publication Type :
Academic Journal
Accession number :
21820272
Full Text :
https://doi.org/10.1016/j.jvoice.2011.04.010