Sound Event Envelope Estimation in Polyphonic Mixtures

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

10 Citations (Scopus)
22 Downloads (Pure)

Abstract

Sound event detection is the task of identifying automatically the presence and temporal boundaries of sound events within an input audio stream. In the last years, deep learning methods have established themselves as the state-of-the-art approach for the task, using binary indicators during training to denote whether an event is active or inactive. However, such binary activity indicators do not fully describe the events, and estimating the envelope of the sounds could provide more precise modeling of their activity. This paper proposes to estimate the amplitude envelopes of target sound event classes in polyphonic mixtures. For training, we use the amplitude envelopes of the target sounds, calculated from mixture signals and, for comparison, from their isolated counterparts. The model is then used to perform envelope estimation and sound event detection. Results show that the envelope estimation allows good modeling of the sounds activity, with detection results comparable to current state-of-the art.
Original languageEnglish
Title of host publicationICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
PublisherIEEE
Pages935-939
Number of pages5
ISBN (Electronic)978-1-4799-8131-1
ISBN (Print)978-1-4799-8132-8
DOIs
Publication statusPublished - 17 Apr 2019
Publication typeA4 Article in conference proceedings
EventIEEE International Conference on Acoustics, Speech and Signal Processing -
Duration: 1 Jan 19001 Jan 2000

Publication series

NameIEEE International Conference on Acoustics, Speech and Signal Processing
ISSN (Print)1520-6149
ISSN (Electronic)2379-190X

Conference

ConferenceIEEE International Conference on Acoustics, Speech and Signal Processing
Period1/01/001/01/00

Keywords

  • acoustic signal detection
  • acoustic signal processing
  • learning (artificial intelligence)
  • sound event envelope estimation
  • polyphonic mixtures
  • sound event detection
  • input audio stream
  • deep learning methods
  • binary activity indicators
  • amplitude envelopes
  • target sound event classes
  • sounds activity
  • Training
  • Estimation
  • Event detection
  • Acoustics
  • Signal to noise ratio
  • Automobiles
  • Dogs
  • Sound event detection
  • Envelope estimation
  • Deep Neural Networks

Publication forum classification

  • Publication forum level 1

Fingerprint

Dive into the research topics of 'Sound Event Envelope Estimation in Polyphonic Mixtures'. Together they form a unique fingerprint.

Cite this