Acoustic Scene Classification Across Multiple Devices Through Incremental Learning of Device-Specific Domains

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

11 Downloads (Pure)

Abstract

In this paper, we propose using a domain-incremental learning approach for coping with different devices in acoustic scene classification. While the typical way to handle mismatched training data is through domain adaptation or specific regularization techniques, incremental learning offers a different approach. With this technique, it is possible to learn the characteristics of new devices on-the-go, adding to a previously trained model. This also means that new device data can be introduced at any time, without a need to retrain the original model. In terms of incremental learning, we propose a combination of domain-specific Low-Rank Adaptation (LoRA) parameters and running statistics of Batch Normalization (BN) layers. LoRA adds low-rank decomposition matrices to a convolutional layer with a few trainable parameters for each new device, while domain-specific BN is used to boost performance. Experiments are conducted on the TAU Urban Acoustic Scenes 2020 Mobile development dataset, containing 9 different devices; we train the system using the 40h of data available for the main device, and incrementally learn the domains of the other 8 devices based on 3h of data available for each. We show that the proposed approach outperforms other fine-tuning-based methods, and is outperformed only by joint learning with all data from all devices.
Original languageEnglish
Title of host publicationProceedings of the Detection and Classification of Acoustic Scenes and Events 2024 Workshop (DCASE2024)
PublisherDCASE
Pages81-85
Number of pages4
ISBN (Electronic)978-952-03-3171-9
Publication statusPublished - 2024
Publication typeA4 Article in conference proceedings
EventWorkshop on Detection and Classification of Acoustic Scenes and Events - Tokyo, Japan
Duration: 23 Oct 202425 Oct 2024
https://dcase.community/workshop2024/

Workshop

WorkshopWorkshop on Detection and Classification of Acoustic Scenes and Events
Abbreviated titleDCASE2024
Country/TerritoryJapan
CityTokyo
Period23/10/2425/10/24
Internet address

Publication forum classification

  • Publication forum level 1

Fingerprint

Dive into the research topics of 'Acoustic Scene Classification Across Multiple Devices Through Incremental Learning of Device-Specific Domains'. Together they form a unique fingerprint.

Cite this