Back to Search Start Over

Metrics for Polyphonic Sound Event Detection

Authors :
Annamaria Mesaros
Toni Heittola
Tuomas Virtanen
Source :
Applied Sciences, Vol 6, Iss 6, p 162 (2016)
Publication Year :
2016
Publisher :
MDPI AG, 2016.

Abstract

This paper presents and discusses various metrics proposed for evaluation of polyphonic sound event detection systems used in realistic situations where there are typically multiple sound sources active simultaneously. The system output in this case contains overlapping events, marked as multiple sounds detected as being active at the same time. The polyphonic system output requires a suitable procedure for evaluation against a reference. Metrics from neighboring fields such as speech recognition and speaker diarization can be used, but they need to be partially redefined to deal with the overlapping events. We present a review of the most common metrics in the field and the way they are adapted and interpreted in the polyphonic case. We discuss segment-based and event-based definitions of each metric and explain the consequences of instance-based and class-based averaging using a case study. In parallel, we provide a toolbox containing implementations of presented metrics.

Details

Language :
English
ISSN :
20763417
Volume :
6
Issue :
6
Database :
Directory of Open Access Journals
Journal :
Applied Sciences
Publication Type :
Academic Journal
Accession number :
edsdoj.75a0b3c4043c4f23a4dda15c5db850d0
Document Type :
article
Full Text :
https://doi.org/10.3390/app6060162