Back to Search
Start Over
Automatic indexing of speech segments with spontaneity levels on large audio database
- Source :
- ACM Workshop on Searching Spontaneous Conversational Speech, ACM Workshop on Searching Spontaneous Conversational Speech, 2010, Firenze, Italy
- Publication Year :
- 2010
- Publisher :
- ACM, 2010.
-
Abstract
- Spontaneous speech detection from a large audio database can be useful for different applications. For example, processing spontaneous speech is one of the many challenges that Automatic Speech Recognition (ASR) systems have to deal with. Spontaneous speech detection can also be an informative descriptor for information retrieval.The main evidences characterizing spontaneous speech are disfluencies (filled pause, repetition, repair and false start) and many studies have focused on the detection and the correction of these disfluencies. In this study1 we define spontaneous speech as unprepared speech, in opposition to prepared speech where utterances contain well-formed sentences close to those that can be found in written documents. Disfluencies are of course very good indicators of unprepared speech, however they are not the only ones: ungrammaticality and language register are also important as well as prosodic patterns. This paper proposes a set of acoustic and linguistic features that can be used for characterizing and detecting spontaneous speech segments from large audio databases, and proposes a method to extract and to exploit these features in order to index audio documents with three speech spontaneity levels.
- Subjects :
- Audio mining
Speech production
Computer science
Speech recognition
Speech synthesis
02 engineering and technology
computer.software_genre
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
030507 speech-language pathology & audiology
03 medical and health sciences
0202 electrical engineering, electronic engineering, information engineering
Speech analytics
ComputingMilieux_MISCELLANEOUS
Voice activity detection
Database
business.industry
Acoustic model
020206 networking & telecommunications
Speech corpus
Speech processing
[INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL]
Artificial intelligence
0305 other medical science
business
computer
Natural language processing
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- Proceedings of the 2010 international workshop on Searching spontaneous conversational speech
- Accession number :
- edsair.doi.dedup.....73fad90bbf5f310e38ffc06ee28a200f
- Full Text :
- https://doi.org/10.1145/1878101.1878110