Phase reconstruction of spectrograms based on a model of repeated audio events

Paul Magron, Roland Badeau, Bertrand David

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

2 Citations (Scopus)

Abstract

Phase recovery of modified spectrograms is a major issue in audio signal processing applications, such as source separation. This paper introduces a novel technique for estimating the phases of components in complex mixtures within onset frames in the Time-Frequency (TF) domain. We propose to exploit the phase repetitions from one onset frame to another. We introduce a reference phase which characterizes a component independently of its activation times. The onset phases of a component are then modeled as the sum of this reference and an offset which is linearly dependent on the frequency. We derive a complex mixture model within onset frames and we provide two algorithms for the estimation of the model phase parameters. The model is estimated on experimental data and this technique is integrated into an audio source separation framework. The results demonstrate that this model is a promising tool for exploiting phase repetitions, and point out its potential for separating overlapping components in complex mixtures.
Original languageEnglish
Title of host publicationWorkshop on Applications of Signal Processing to Audio and Acoustics
PublisherIEEE
Number of pages5
ISBN (Electronic)978-1-4799-7450-4
Publication statusPublished - Oct 2015
Externally publishedYes
Publication typeA4 Article in conference proceedings
EventIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) - Mohonk Mountain House, New Paltz, United States
Duration: 18 Oct 201521 Oct 2015

Conference

ConferenceIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
Country/TerritoryUnited States
CityNew Paltz
Period18/10/1521/10/15

Fingerprint

Dive into the research topics of 'Phase reconstruction of spectrograms based on a model of repeated audio events'. Together they form a unique fingerprint.

Cite this