Back to Search
Start Over
Automatic intelligibility assessment of speakers after laryngeal cancer by means of acoustic modeling.
- Source :
-
Journal of voice : official journal of the Voice Foundation [J Voice] 2012 May; Vol. 26 (3), pp. 390-7. Date of Electronic Publication: 2011 Aug 05. - Publication Year :
- 2012
-
Abstract
- Objective: One aspect of voice and speech evaluation after laryngeal cancer is acoustic analysis. Perceptual evaluation by expert raters is a standard in the clinical environment for global criteria such as overall quality or intelligibility. So far, automatic approaches evaluate acoustic properties of pathologic voices based on voiced/unvoiced distinction and fundamental frequency analysis of sustained vowels. Because of the high amount of noisy components and the increasing aperiodicity of highly pathologic voices, a fully automatic analysis of fundamental frequency is difficult. We introduce a purely data-driven system for the acoustic analysis of pathologic voices based on recordings of a standard text.<br />Methods: Short-time segments of the speech signal are analyzed in the spectral domain, and speaker models based on this information are built. These speaker models act as a clustered representation of the acoustic properties of a person's voice and are thus characteristic for speakers with different kinds and degrees of pathologic conditions. The system is evaluated on two different data sets with speakers reading standardized texts. One data set contains 77 speakers after laryngeal cancer treated with partial removal of the larynx. The other data set contains 54 totally laryngectomized patients, equipped with a Provox shunt valve. Each speaker was rated by five expert listeners regarding three different criteria: strain, voice quality, and speech intelligibility.<br />Results/conclusion: We show correlations for each data set with r and ρ≥0.8 between the automatic system and the mean value of the five raters. The interrater correlation of one rater to the mean value of the remaining raters is in the same range. We thus assume that for selected evaluation criteria, the system can serve as a validated objective support for acoustic voice and speech analysis.<br /> (Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.)
- Subjects :
- Adult
Aged
Aged, 80 and over
Automation
Germany
Humans
Laryngeal Neoplasms complications
Laryngeal Neoplasms physiopathology
Larynx, Artificial
Middle Aged
Observer Variation
Predictive Value of Tests
Reading
Regression Analysis
Reproducibility of Results
Signal Processing, Computer-Assisted
Speech, Alaryngeal instrumentation
Time Factors
Treatment Outcome
Voice Disorders diagnosis
Voice Disorders etiology
Voice Disorders physiopathology
Laryngeal Neoplasms surgery
Laryngectomy adverse effects
Models, Statistical
Speech Acoustics
Speech Intelligibility
Speech Production Measurement methods
Voice Disorders surgery
Voice Quality
Subjects
Details
- Language :
- English
- ISSN :
- 1873-4588
- Volume :
- 26
- Issue :
- 3
- Database :
- MEDLINE
- Journal :
- Journal of voice : official journal of the Voice Foundation
- Publication Type :
- Academic Journal
- Accession number :
- 21820272
- Full Text :
- https://doi.org/10.1016/j.jvoice.2011.04.010