Abstract
In this paper, the use of multi label neural networks are proposed for detection of temporally overlapping sound events in realistic environments. Real-life sound recordings typically have many overlapping sound events, making it hard to recognize each event with the standard sound event detection methods. Frame-wise spectral-domain features are used as inputs to train a deep neural network for multi label classification in this work. The model is evaluated with recordings from realistic everyday environments and the obtained overall accuracy is 63.8%. The method is compared against a state-of-the-art method using non-negative matrix factorization as a pre-processing stage and hidden Markov models as a classifier. The proposed method improves the accuracy by 19% percentage points overall.
| Original language | English |
|---|---|
| Title of host publication | 2015 International Joint Conference on Neural Networks (IJCNN) |
| Publisher | IEEE |
| ISBN (Print) | 978-1-4799-1959-8 |
| DOIs | |
| Publication status | Published - Jul 2015 |
| Publication type | A4 Article in conference proceedings |
| Event | International Joint Conference on Neural Networks - Duration: 1 Jan 1900 → … |
Conference
| Conference | International Joint Conference on Neural Networks |
|---|---|
| Period | 1/01/00 → … |
Publication forum classification
- Publication forum level 1
Fingerprint
Dive into the research topics of 'Polyphonic Sound Event Detection Using Multi Label Deep Neural Networks'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver