TY - GEN
T1 - Sound event detection with soft labels
T2 - European Signal Processing Conference
AU - Harju, Manu
AU - Martín-Morató, Irene
AU - Heittola, Toni
AU - Mesaros, Annamaria
N1 - Publisher Copyright:
© 2024 European Signal Processing Conference, EUSIPCO. All rights reserved.
PY - 2024
Y1 - 2024
N2 - Sound event detection has been an essential task in the DCASE Challenge since the beginning, with various alterations over the years. The 2023 Challenge presented for the first time a sound event detection task for which the reference labels representing sound class activity were provided as real numbers on the interval from zero to one, in addition to binary labels. In this paper we provide an overview of the sound event detection with soft labels task in DCASE 2023 Challenge, and re-evaluate the challenge submissions using a soft metric. The use of a soft metric allows computing precision, recall and F-score directly using the soft labels, and thus avoids the optimization step for binarizing both the reference and predictions using a threshold. We analyze the behavior of the soft metric on a large number of systems, and show that for the softly labeled reference data, the results obtained with the soft metrics represent very well the system’s ability to follow the data, and is a good proxy for entropy-based measures.
AB - Sound event detection has been an essential task in the DCASE Challenge since the beginning, with various alterations over the years. The 2023 Challenge presented for the first time a sound event detection task for which the reference labels representing sound class activity were provided as real numbers on the interval from zero to one, in addition to binary labels. In this paper we provide an overview of the sound event detection with soft labels task in DCASE 2023 Challenge, and re-evaluate the challenge submissions using a soft metric. The use of a soft metric allows computing precision, recall and F-score directly using the soft labels, and thus avoids the optimization step for binarizing both the reference and predictions using a threshold. We analyze the behavior of the soft metric on a large number of systems, and show that for the softly labeled reference data, the results obtained with the soft metrics represent very well the system’s ability to follow the data, and is a good proxy for entropy-based measures.
KW - soft labels
KW - sound event detection
U2 - 10.23919/EUSIPCO63174.2024.10715239
DO - 10.23919/EUSIPCO63174.2024.10715239
M3 - Conference contribution
AN - SCOPUS:85208426354
T3 - European Signal Processing Conference
SP - 66
EP - 70
BT - 2024 32nd European Signal Processing Conference (EUSIPCO)
PB - IEEE
Y2 - 26 August 2024 through 30 August 2024
ER -