Skip to main navigation Skip to search Skip to main content

Examining The Perceptual Effect of Alternative Objective Functions for Deep Learning Based Music Source Separation

  • Stylianos Ioannis Mimilakis
  • , Estefania Cano
  • , Derry FitzGerald
  • , Konstantinos Drossos
  • , Gerald Schuller

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    2 Citations (Scopus)

    Abstract

    In this study, we examine the effect of various objective functions used to optimize the recently proposed deep learning architecture for singing voice separation MaD - Masker and Denoiser. The parameters of the MaD architecture are optimized using an objective function that contains a reconstruction criterion between predicted and true magnitude spectra of the singing voice, and a regularization term. We examine various reconstruction criteria such as the generalized Kullback-Leibler, mean squared error, and noise to mask ratio. We also explore recently proposed, for optimizing MaD, regularization terms such as sparsity and TwinNetwork regularization. Results from both objective assessment and listening tests suggest that the TwinNetwork regularization results in improved singing voice separation quality.
    Original languageEnglish
    Title of host publication2018 52nd Asilomar Conference on Signals, Systems, and Computers
    PublisherIEEE
    Pages679-683
    ISBN (Electronic)978-1-5386-9218-9
    ISBN (Print)978-1-5386-9219-6
    DOIs
    Publication statusPublished - Oct 2018
    Publication typeA4 Article in conference proceedings
    EventAsilomar Conference on Signals, Systems and Computers -
    Duration: 1 Jan 1900 → …

    Publication series

    Name
    ISSN (Print)1058-6393

    Conference

    ConferenceAsilomar Conference on Signals, Systems and Computers
    Period1/01/00 → …

    Publication forum classification

    • Publication forum level 1

    Fingerprint

    Dive into the research topics of 'Examining The Perceptual Effect of Alternative Objective Functions for Deep Learning Based Music Source Separation'. Together they form a unique fingerprint.

    Cite this