TY - GEN
T1 - Active Learning for Sound Event Classification by Clustering Unlabeled Data
AU - Zhao, Shuyang
AU - Heittola, Toni
AU - Virtanen, Tuomas
PY - 2017
Y1 - 2017
N2 - This paper proposes a novel active learning method to save annotation effort when preparing material to train sound event classifiers. K-medoids clustering is performed on unlabeled sound segments, and medoids of clusters are presented to annotators for labeling. The annotated label for a medoid is used to derive predicted labels for other cluster members. The obtained labels are used to build a classifier using supervised training. The accuracy of the resulted classifier is used to evaluate the performance of the proposed method. The evaluation made on a public environmental sound dataset shows that the proposed method outperforms reference methods (random sampling, certainty-based active learning and semi-supervised learning) with all simulated labeling budgets, the number of available labeling responses. Through all the experiments, the proposed method saves 50%–60% labeling budget to achieve the same accuracy, with respect to the best reference method.
AB - This paper proposes a novel active learning method to save annotation effort when preparing material to train sound event classifiers. K-medoids clustering is performed on unlabeled sound segments, and medoids of clusters are presented to annotators for labeling. The annotated label for a medoid is used to derive predicted labels for other cluster members. The obtained labels are used to build a classifier using supervised training. The accuracy of the resulted classifier is used to evaluate the performance of the proposed method. The evaluation made on a public environmental sound dataset shows that the proposed method outperforms reference methods (random sampling, certainty-based active learning and semi-supervised learning) with all simulated labeling budgets, the number of available labeling responses. Through all the experiments, the proposed method saves 50%–60% labeling budget to achieve the same accuracy, with respect to the best reference method.
KW - active learning, sound event classification, K-medoids clustering
U2 - 10.1109/ICASSP.2017.7952256
DO - 10.1109/ICASSP.2017.7952256
M3 - Conference contribution
SP - 751
EP - 755
BT - 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
PB - IEEE
T2 - IEEE International Conference on Acoustics, Speech and Signal Processing
Y2 - 1 January 1900 through 1 January 2000
ER -