Training Sound Event Detection with Soft Labels from Crowdsourced Annotations

Tutkimustuotos: KonferenssiartikkeliScientificvertaisarvioitu

3 Lataukset (Pure)

Abstrakti

In this paper, we study the use of soft labels to train a system for sound event detection (SED). Soft labels can result from annotations which account for human uncertainty about categories, or emerge as a natural representation of multiple opinions in annotation. Converting annotations to hard labels results in unambiguous categories for training, at the cost of losing the details about the labels distribution. This work investigates how soft labels can be used, and what benefits they bring in training a SED system. The results show that the system is capable of learning information about the activity of the sounds which is reflected in the soft labels and is able to detect sounds that are missed in the typical binary target training setup. We also release a new dataset produced through crowdsourcing, containing temporally strong labels for sound events in real-life recordings, with both soft and hard labels.
AlkuperäiskieliEnglanti
Otsikko ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
KustantajaIEEE
ISBN (elektroninen)978-1-7281-6327-7
DOI - pysyväislinkit
TilaJulkaistu - 2023
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaIEEE International Conference on Acoustics, Speech, and Signal Processing - Rhodes Island, Kreikka
Kesto: 4 kesäk. 202310 kesäk. 2023

Julkaisusarja

NimiProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
ISSN (elektroninen)2379-190X

Conference

ConferenceIEEE International Conference on Acoustics, Speech, and Signal Processing
Maa/AlueKreikka
KaupunkiRhodes Island
Ajanjakso4/06/2310/06/23

Julkaisufoorumi-taso

  • Jufo-taso 2

Sormenjälki

Sukella tutkimusaiheisiin 'Training Sound Event Detection with Soft Labels from Crowdsourced Annotations'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä