Developing computational systems requires methods for evaluating their performance to guide development and compare alternate approaches. A reliable evaluation procedure for a classification or recognition system will involve a standard dataset of example input data along with the intended target output, and well-defined metrics to compare the systems' outputs with this ground truth. This chapter examines the important factors in the design and construction of evaluation datasets and goes through the metrics commonly used in system evaluation, comparing their properties. We include a survey of currently available datasets for environmental sound scene and event recognition and conclude with advice for designing evaluation protocols.
|Title of host publication||Computational Analysis of Sound Scenes and Events|
|Editors||Tuomas Virtanen, Mark D. Plumbley, Dan Ellis|
|Place of Publication||Cham|
|Publication status||Published - 2018|
|Publication type||A3 Book chapter|
Publication forum classification
- Publication forum level 2