Source Separation and Reconstruction of Spatial Audio Using Spectrogram Factorization

Joonas Nikunen, Tuomas Virtanen

    Tutkimustuotos: LukuScientificvertaisarvioitu

    1 Sitaatiot (Scopus)

    Abstrakti

    This chapter introduces methods for factorizing the spectrogram of multichannel audio into repetitive spectral objects and apply the introduced models to the analysis of spatial audio and modification of spatial sound through source separation. The purpose of decomposing an audio spectrogram using spectral templates is to learn the underlying structures (audio objects) from the observed data. The chapter discusses two main scenarios such as parameterization of multichannel surround sound and parameterization of microphone array signals. It explains the principles of source separation by time-frequency filtering using separation masks constructed from the spectrogram models. The chapter introduces a spatial covariance matrix model based on the directions of arrival of sound events and spectral templates, and discusses its relationship to conventional spatial audio signal processing. Source separation using spectrogram factorization models is achieved via time- frequency filtering of the original observation short-time Fourier transform (STFT) by a generalized Wiener filter obtained from the spectrogram model parameters.
    AlkuperäiskieliEnglanti
    OtsikkoParametric time-frequency-domain spatial audio
    ToimittajatVille Pulkki, Symeon Delikaris-Manias, Archontis Politis
    KustantajaJohn Wiley & Sons
    Sivut215-250
    ISBN (elektroninen)978-1-119-25263-4
    ISBN (painettu)978-1-119-25259-7
    DOI - pysyväislinkit
    TilaJulkaistu - 13 lokak. 2017
    OKM-julkaisutyyppiA3 Kirjan tai muun kokoomateoksen osa

    Julkaisufoorumi-taso

    • Jufo-taso 2

    Sormenjälki

    Sukella tutkimusaiheisiin 'Source Separation and Reconstruction of Spatial Audio Using Spectrogram Factorization'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

    Siteeraa tätä