Data-efficient low-complexity acoustic scene classification in the dcase 2024 challenge

Florian Schmid, Paul Primus, Toni Heittola, Annamaria Mesaros, Irene Martin Morato, Khaled Koutini, Gerhard Widmer

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

55 Downloads (Pure)

Abstract

This article describes the Data-Efficient Low-Complexity Acoustic Scene Classification Task in the DCASE 2024 Challenge and the corresponding baseline system. The task setup is a continuation of previous editions (2022 and 2023), which focused on recording device mismatches and low-complexity constraints. This year’s edition introduces an additional real-world problem: participants must develop data-efficient systems for five scenarios, which progressively limit the available training data. The provided baseline system is based on an efficient, factorized CNN architecture constructed from inverted residual blocks and uses Freq-MixStyle to tackle the device mismatch problem. The task received 37 submissions from 17 teams, with the large majority of systems outperforming the baseline. The top-ranked system’s accuracy ranges from 54.3\% on the smallest to 61.8\% on the largest subset, corresponding to relative improvements of approximately 23\% and 9\% over the baseline system on the evaluation set.
Original languageEnglish
Title of host publicationProceedings of the 9th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2024)
Place of PublicationTokyo, Japan
PublisherDCASE
Pages136-140
ISBN (Electronic)978-952-03-3171-9
Publication statusPublished - Oct 2024
Publication typeA4 Article in conference proceedings
EventWorkshop on Detection and Classification of Acoustic Scenes and Events - Tokyo, Japan
Duration: 23 Oct 202425 Oct 2024
https://dcase.community/workshop2024/

Workshop

WorkshopWorkshop on Detection and Classification of Acoustic Scenes and Events
Abbreviated titleDCASE2024
Country/TerritoryJapan
CityTokyo
Period23/10/2425/10/24
Internet address

Publication forum classification

  • Publication forum level 1

Fingerprint

Dive into the research topics of 'Data-efficient low-complexity acoustic scene classification in the dcase 2024 challenge'. Together they form a unique fingerprint.

Cite this