On modeling the STFT phase of audio signals with the von Mises distribution

Paul Magron, Tuomas Virtanen

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    3 Citations (Scopus)

    Abstract

    In this paper, we study statistical models for the phase of the short-term Fourier transform (STFT) of audio signals. STFT phase globally appears as uniformly distributed, which has led researchers in this field to model it as a uniform random variable. However, some information about the phase can be obtained from a sinusoidal model, which reveals its local structure. Therefore, we propose to model the phase with a von Mises (VM) random variable, which enables us to favor the sinusoidal model-based phase value. We estimate the distribution parameters and we validate this model on real audio data. In particular, we observe that both models (uniform and VM) are relevant from a statistical perspective but they convey different information about the phase (global vs. local). We also apply this VM model to an audio source separation task, where it outperforms previous approaches.
    Original languageEnglish
    Title of host publication16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018
    PublisherIEEE
    ISBN (Electronic)9781538681510
    DOIs
    Publication statusPublished - 2 Nov 2018
    Publication typeA4 Article in conference proceedings
    EventInternational Workshop on Acoustic Signal Enhancement - Tokyo, Japan
    Duration: 17 Sept 201820 Sept 2018

    Conference

    ConferenceInternational Workshop on Acoustic Signal Enhancement
    Country/TerritoryJapan
    CityTokyo
    Period17/09/1820/09/18

    Publication forum classification

    • Publication forum level 1

    Fingerprint

    Dive into the research topics of 'On modeling the STFT phase of audio signals with the von Mises distribution'. Together they form a unique fingerprint.

    Cite this