Abstract
Complex nonnegative matrix factorization (NMF) is a powerful tool for decomposing audio spectrograms while accounting for some phase information in the time-frequency domain. While its estimation was originally based on the Euclidean distance, in this paper we propose to extend it to any beta-divergence, a family of functions widely used in audio to estimate NMF. To this end, we introduce the beta-divergence in a heuristic fashion within a phase-aware probabilistic model. Estimating this model results in performing an NMF with Itakura-Saito (IS) divergence on a quantity called the phase-corrected posterior power of the sources, which is both phase-dependent and nonnegative-valued. Therefore, we replace IS with the beta-divergence, so that the factorization uses an optimal distortion metric and remains phase-aware. Even though by doing so we loose theoretical convergence guarantees, the resulting algorithm demonstrates its potential for an audio source separation task, where it outperforms previous complex NMFs approaches.
| Original language | English |
|---|---|
| Title of host publication | 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC) |
| Publisher | IEEE |
| Pages | 156-160 |
| ISBN (Electronic) | 978-1-5386-8151-0 |
| ISBN (Print) | 978-1-5386-8152-7 |
| DOIs | |
| Publication status | Published - Sept 2018 |
| Publication type | A4 Article in conference proceedings |
| Event | International Workshop on Acoustic Signal Enhancement - Duration: 1 Jan 1900 → … |
Conference
| Conference | International Workshop on Acoustic Signal Enhancement |
|---|---|
| Period | 1/01/00 → … |
Publication forum classification
- Publication forum level 1
Fingerprint
Dive into the research topics of 'Towards Complex Nonnegative Matrix Factorization with the Beta-Divergence'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver