Abstrakti
This paper presents an analysis of the Low-Complexity Acoustic Scene Classification task in DCASE 2022 Challenge. The task was a continuation from the previous years, but the low-complexity requirements were changed to the following: the maximum number of allowed parameters, including the zero-valued ones, was 128 K, with parameters being represented using INT8 numerical format; and the maximum number of multiply-accumulate operations at inference time was 30 million. Despite using the same previous year dataset, the audio samples have been shortened to 1 second instead of 10 second for this year challenge. The provided baseline system is a convolutional neural network which employs post-training quantization of parameters, resulting in 46.5 K parameters, and 29.23 million multiply-and-accumulate operations (MMACs). Its performance on the evaluation data is 44.2% accuracy and 1.532 log-loss. In comparison, the top system in the challenge obtained an accuracy of 59.6% and a log loss of 1.091, having 121 K parameters and 28 MMACs. The task received 48 submissions from 19 different teams, most of which outperformed the baseline system.
Alkuperäiskieli | Englanti |
---|---|
Otsikko | Proceedings of the 7th Workshop on Detection and Classication of Acoustic Scenes and Events (DCASE 2022) |
Toimittajat | Mathieu Lagrange, Annamaria Mesaros, Thomas Pellegrini, Gaël Richard, Romain Serizel, Dan Stowell |
Kustantaja | DCASE |
Sivut | 111-115 |
ISBN (elektroninen) | 978-952-03-2677-7 |
Tila | Julkaistu - 3 marrask. 2022 |
OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisussa |
Tapahtuma | Workshop on Detection and Classification of Acoustic Scenes and Events - Nancy, Ranska Kesto: 3 marrask. 2022 → 4 marrask. 2022 https://dcase.community/workshop2022/ |
Conference
Conference | Workshop on Detection and Classification of Acoustic Scenes and Events |
---|---|
Lyhennettä | DCASE |
Maa/Alue | Ranska |
Kaupunki | Nancy |
Ajanjakso | 3/11/22 → 4/11/22 |
www-osoite |
Julkaisufoorumi-taso
- Jufo-taso 1