TY - GEN
T1 - Crowdsourcing strong labels for sound event detection
AU - Martin Morato, Irene
AU - Harju, Manu
AU - Mesaros, Annamaria
PY - 2021/12/13
Y1 - 2021/12/13
N2 - Strong labels are a necessity for evaluation of sound event detection methods, but often scarcely available due to the high resources required by the annotation task. We present a method for estimating strong labels using crowdsourced weak labels, through a process that divides the annotation task into simple unit tasks. Based on estimations of annotators' competence, aggregation and processing of the weak labels results in a set of objective strong labels. The experiment uses synthetic audio in order to verify the quality of the resulting annotations through comparison with ground truth. The proposed method produces labels with high precision, though not all event instances are recalled. Detection metrics comparing the produced annotations with the ground truth show 80% F-score in 1 s segments, and up to 89.5% intersection-based F1-score calculated according to the polyphonic sound detection score metrics.
AB - Strong labels are a necessity for evaluation of sound event detection methods, but often scarcely available due to the high resources required by the annotation task. We present a method for estimating strong labels using crowdsourced weak labels, through a process that divides the annotation task into simple unit tasks. Based on estimations of annotators' competence, aggregation and processing of the weak labels results in a set of objective strong labels. The experiment uses synthetic audio in order to verify the quality of the resulting annotations through comparison with ground truth. The proposed method produces labels with high precision, though not all event instances are recalled. Detection metrics comparing the produced annotations with the ground truth show 80% F-score in 1 s segments, and up to 89.5% intersection-based F1-score calculated according to the polyphonic sound detection score metrics.
U2 - 10.1109/WASPAA52581.2021.9632761
DO - 10.1109/WASPAA52581.2021.9632761
M3 - Conference contribution
SN - 978-1-6654-4871-0
BT - 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
T2 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
Y2 - 17 October 2021 through 20 October 2021
ER -