Abstract
In this paper, we study statistical models for the phase of the short-term Fourier transform (STFT) of audio signals. STFT phase globally appears as uniformly distributed, which has led researchers in this field to model it as a uniform random variable. However, some information about the phase can be obtained from a sinusoidal model, which reveals its local structure. Therefore, we propose to model the phase with a von Mises (VM) random variable, which enables us to favor the sinusoidal model-based phase value. We estimate the distribution parameters and we validate this model on real audio data. In particular, we observe that both models (uniform and VM) are relevant from a statistical perspective but they convey different information about the phase (global vs. local). We also apply this VM model to an audio source separation task, where it outperforms previous approaches.
Original language | English |
---|---|
Title of host publication | 16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018 |
Publisher | IEEE |
ISBN (Electronic) | 9781538681510 |
DOIs | |
Publication status | Published - 2 Nov 2018 |
Publication type | A4 Article in conference proceedings |
Event | International Workshop on Acoustic Signal Enhancement - Tokyo, Japan Duration: 17 Sept 2018 → 20 Sept 2018 |
Conference
Conference | International Workshop on Acoustic Signal Enhancement |
---|---|
Country/Territory | Japan |
City | Tokyo |
Period | 17/09/18 → 20/09/18 |
Publication forum classification
- Publication forum level 1