Multimodal Evaluation Method for Sound Event Detection - Université Paris-Est-Créteil-Val-de-Marne Access content directly
Conference Papers Year : 2022

Multimodal Evaluation Method for Sound Event Detection


Time is an important dimension in sound event detection (SED) systems. However, evaluating the performance of SED systems is directly taken from the classical machine learning domain, and they are not well adapted to the needs of these systems such as recognizing the time, duration, detection, and uniformity of sound events. Despite its importance, it is not well-developed yet. Current methods are highly biased by their assumptions and may misleadingly present convincible results. This paper presents a novel multimodal method to evaluate SED systems from multiple perspectives such as detection, total duration, relative duration, and uniformity. Furthermore, the proposed method is simple, time-efficient, visualizable, extensible, open-source, and overcomes the limitations of existing methods. The benefits of the proposed approach are demonstrated by re-evaluating the best systems presented in a known challenge on sound event detection.
No file

Dates and versions

hal-04063311 , version 1 (08-04-2023)



Seyed M.R. Modaresi, Aomar Osmani, Mohammadreza Razzazi, Abdelghani Chibani. Multimodal Evaluation Method for Sound Event Detection. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2022, Singapore, Singapore. pp.1026-1030, ⟨10.1109/ICASSP43922.2022.9746906⟩. ⟨hal-04063311⟩


17 View
0 Download



Gmail Facebook X LinkedIn More