Complex NMF under phase constraints based on signal modeling: application to audio source separation

Paul Magron, Roland Badeau, Bertrand David

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

12 Citations (Scopus)

Abstract

Nonnegative Matrix Factorization (NMF) is a powerful tool for de- composing mixtures of audio signals in the Time-Frequency (TF) domain. In the source separation framework, the phase recovery for each extracted component is necessary for synthesizing time-domain signals. The Complex NMF (CNMF) model aims to jointly estimate the spectrogram and the phase of the sources, but requires to con- strain the phase in order to produce satisfactory sounding results. We propose to incorporate phase constraints based on signal models within the CNMF framework: a phase unwrapping constraint that enforces a form of temporal coherence, and a constraint based on the repetition of audio events, which models the phases of the sources within onset frames. We also provide an algorithm for estimating the model parameters. The experimental results highlight the interest of including such constraints in the CNMF framework for separating overlapping components in complex audio mixtures.
Original languageEnglish
Title of host publicationInternational Conference on Acoustics, Speech and Signal Processing
Pages46-50
Number of pages5
ISBN (Electronic)978-1-4799-9988-0
Publication statusPublished - Mar 2016
Externally publishedYes
Publication typeA4 Article in conference proceedings
Event41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 - Shanghai, China
Duration: 20 Mar 201625 Mar 2016

Conference

Conference41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016
Country/TerritoryChina
CityShanghai
Period20/03/1625/03/16

Fingerprint

Dive into the research topics of 'Complex NMF under phase constraints based on signal modeling: application to audio source separation'. Together they form a unique fingerprint.

Cite this