Context-dependent sound event detection

Julkaisun otsikon käännös: Context-dependent sound event detection

    Tutkimustuotos: ArtikkeliScientificvertaisarvioitu

    122 Sitaatiot (Scopus)
    1 Lataukset (Pure)

    Abstrakti

    The work presented in this article studies how the context information can be used in the automatic sound event detection process, and how the detection system can benefit from such information. Humans are using context information to make more accurate predictions about the sound events and ruling out unlikely events given the context. We propose a similar utilization of context information in the automatic sound event detection process. The proposed approach is composed of two stages: automatic context recognition stage and sound event detection stage. Contexts are modeled using Gaussian mixture models and sound events are modeled using three-state left-to-right hidden Markov models. In the first stage, audio context of the tested signal is recognized. Based on the recognized context, a context-specific set of sound event classes is selected for the sound event detection stage. The event detection stage also uses context-dependent acoustic models and count-based event priors. Two alternative event detection approaches are studied. In the first one, a monophonic event sequence is outputted by detecting the most prominent sound event at each time instance using Viterbi decoding. The second approach introduces a new method for producing polyphonic event sequence by detecting multiple overlapping sound events using multiple restricted Viterbi passes. A new metric is introduced to evaluate the sound event detection performance with various level of polyphony. This combines the detection accuracy and coarse time-resolution error into one metric, making the comparison of the performance of detection algorithms simpler. The two-step approach was found to improve the results substantially compared to the context-independent baseline system. In the block-level, the detection accuracy can be almost doubled by using the proposed context-dependent event detection.
    Julkaisun otsikon käännösContext-dependent sound event detection
    AlkuperäiskieliEnglanti
    Artikkeli1
    Sivumäärä13
    JulkaisuEurasip Journal on Audio, Speech, and Music Processing
    Vuosikerta2013
    DOI - pysyväislinkit
    TilaJulkaistu - 2013
    OKM-julkaisutyyppiA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä

    Julkaisufoorumi-taso

    • Jufo-taso 1

    Sormenjälki

    Sukella tutkimusaiheisiin 'Context-dependent sound event detection'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

    Siteeraa tätä