Back to Search Start Over

Structural and Semantic Modeling of Audio for Content-Based Querying and Browsing.

Authors :
Larsen, Henrik Legind
Pasi, Gabriella
Ortiz-Arroyo, Daniel
Andreasen, Troels
Christiansen, Henning
Sert, Mustafa
Baykal, Buyurman
Yazıcı, Adnan
Source :
Flexible Query Answering Systems (9783540346388); 2006, p319-330, 12p
Publication Year :
2006

Abstract

A typical content-based audio management system deals with three aspects namely audio segmentation and classification, audio analysis, and content-based retrieval of audio. In this paper, we integrate the three aspects of content-based audio management into a single framework and propose an efficient method for flexible querying and browsing of auditory data. More specifically, we utilize two robust feature sets namely MPEG-7 Audio Spectrum Flatness (ASF) and Mel Frequency Cepstral Coefficients (MFCC) as the underlying features in order to improve the content-based retrieval accuracy, since both features have some advantages for distinct types of audio (e.g., music and speech). The proposed system provides a wide range of opportunities to query and browse an audio data by content, such as querying and browsing for a chorus section, sound effects, and query-by-example. In addition, the clients can express their queries in the form of point, range, and k-nearest neighbor, which are particularly significant in the multimedia domain. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISBNs :
9783540346388
Database :
Complementary Index
Journal :
Flexible Query Answering Systems (9783540346388)
Publication Type :
Book
Accession number :
32903711
Full Text :
https://doi.org/10.1007/11766254_27