Skip to main navigation Skip to search Skip to main content

Class-Incremental Learning for Multi-Label Audio Classification

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

20 Citations (Scopus)
37 Downloads (Pure)

Abstract

In this paper, we propose a method for class-incremental learning of potentially overlapping sounds for solving a sequence of multi-label audio classification tasks. We design an incremental learner that learns new classes independently of the old classes. To preserve knowledge about the old classes, we propose a cosine similarity-based distillation loss that minimizes discrepancy in the feature representations of subsequent learners, and use it along with a Kullback-Leibler divergence-based distillation loss that minimizes discrepancy in their respective outputs. Experiments are performed on a dataset with 50 sound classes, with an initial classification task containing 30 base classes and 4 incremental phases of 5 classes each. After each phase, the system is tested for multi-label classification with the entire set of classes learned so far. The proposed method obtains an average F1-score of 40.9% over the five phases, ranging from 45.2% in phase 0 on 30 classes, to 36.3% in phase 4 on 50 classes. Average performance degradation over incremental phases is only 0.7 percentage points from the initial F1-score of 45.2%.

Original languageEnglish
Title of host publication2024 IEEE International Conference on Acoustics, Speech, and Signal Processing
Subtitle of host publicationProceedings
PublisherIEEE
Pages916-920
Number of pages5
ISBN (Electronic)9798350344851
DOIs
Publication statusPublished - 2024
Publication typeA4 Article in conference proceedings
EventIEEE International Conference on Acoustics, Speech, and Signal Processing - Seoul, Korea, Republic of
Duration: 14 Apr 202419 Apr 2024

Publication series

NameProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
ISSN (Print)1520-6149

Conference

ConferenceIEEE International Conference on Acoustics, Speech, and Signal Processing
Country/TerritoryKorea, Republic of
CitySeoul
Period14/04/2419/04/24

Funding

This work was supported in part by Academy of Finland grant 332063 \u201CTeaching machines to listen\u201D. The authors wish to thank CSC-IT Centre of Science Ltd., Finland, for providing computational resources.

FundersFunder number
CSC-IT Centre of Science Ltd.
Strategic Research Council at the Research Council of Finland332063
Strategic Research Council at the Research Council of Finland

    Keywords

    • Class-incremental learning
    • independent learning
    • knowledge transfer
    • multi-label audio classification

    Publication forum classification

    • Publication forum level 2

    ASJC Scopus subject areas

    • Software
    • Signal Processing
    • Electrical and Electronic Engineering

    Fingerprint

    Dive into the research topics of 'Class-Incremental Learning for Multi-Label Audio Classification'. Together they form a unique fingerprint.

    Cite this