Siirry päänavigointiin Siirry hakuun Siirry pääsisältöön

Multichannel Blind Sound Source Separation using Spatial Covariance Model with Level and Time Differences and Non-Negative Matrix Factorization

  • Julio Jose Carabias Orti
  • , Joonas Nikunen
  • , Tuomas Virtanen
  • , Pedro Vera-Candeas

    Tutkimustuotos: ArtikkeliTieteellinenvertaisarvioitu

    29 Sitaatiot (Scopus)
    12 Lataukset (Pure)

    Abstrakti

    This paper presents an algorithm for multichannel sound source separation using explicit modeling of level and time differences in source spatial covariance matrices (SCM). We propose a novel SCM model in which the spatial properties are modeled by the weighted sum of direction of arrival (DOA) kernels. DOA kernels are obtained as the combination of phase and level difference covariance matrices representing both time and level differences between microphones for a grid of predefined source directions. The proposed SCM model is combined with the NMF model for the magnitude spectrograms. Opposite to other SCM models in the literature, in this work, source localization is implicitly defined in the model and estimated during the signal factorization. Therefore, no localization pre-processing is required. Parameters are estimated using complex-valued non-negative matrix factorization (CNMF) with both Euclidean distance and Itakura Saito divergence. Separation performance of the proposed system is evaluated using the two-channel SiSEC development dataset and four channels signals recorded in a regular room with moderate reverberation. Finally, a comparison to other state-of-the-art methods is performed, showing better achieved separation performance in terms of SIR and perceptual measures.

    AlkuperäiskieliEnglanti
    Sivut1512-1527
    JulkaisuIEEE/ACM Transactions on Audio Speech and Language Processing
    Vuosikerta26
    Numero9
    Varhainen verkossa julkaisun päivämäärä26 huhtik. 2018
    DOI - pysyväislinkit
    TilaJulkaistu - syysk. 2018
    OKM-julkaisutyyppiA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä

    Julkaisufoorumi-taso

    • Jufo-taso 2

    !!ASJC Scopus subject areas

    • Computer Science (miscellaneous)
    • Acoustics and Ultrasonics
    • Computational Mathematics
    • Electrical and Electronic Engineering

    Sormenjälki

    Sukella tutkimusaiheisiin 'Multichannel Blind Sound Source Separation using Spatial Covariance Model with Level and Time Differences and Non-Negative Matrix Factorization'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

    Siteeraa tätä