Abstrakti
Complex nonnegative matrix factorization (NMF) is a powerful tool for decomposing audio spectrograms while accounting for some phase information in the time-frequency domain. While its estimation was originally based on the Euclidean distance, in this paper we propose to extend it to any beta-divergence, a family of functions widely used in audio to estimate NMF. To this end, we introduce the beta-divergence in a heuristic fashion within a phase-aware probabilistic model. Estimating this model results in performing an NMF with Itakura-Saito (IS) divergence on a quantity called the phase-corrected posterior power of the sources, which is both phase-dependent and nonnegative-valued. Therefore, we replace IS with the beta-divergence, so that the factorization uses an optimal distortion metric and remains phase-aware. Even though by doing so we loose theoretical convergence guarantees, the resulting algorithm demonstrates its potential for an audio source separation task, where it outperforms previous complex NMFs approaches.
Alkuperäiskieli | Englanti |
---|---|
Otsikko | 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC) |
Kustantaja | IEEE |
Sivut | 156-160 |
ISBN (elektroninen) | 978-1-5386-8151-0 |
ISBN (painettu) | 978-1-5386-8152-7 |
DOI - pysyväislinkit | |
Tila | Julkaistu - syysk. 2018 |
OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisussa |
Tapahtuma | International Workshop on Acoustic Signal Enhancement - Kesto: 1 tammik. 1900 → … |
Conference
Conference | International Workshop on Acoustic Signal Enhancement |
---|---|
Ajanjakso | 1/01/00 → … |
Julkaisufoorumi-taso
- Jufo-taso 1