This is a dataset containing audio captions for audio files of the TAU Urban Acoustic Scenes 2019 development dataset (airport, public square, and park) for 10 cities.
Martin Morato, I. & Mesaros, A., 15 marrask. 2021, Proceedings of the 6th Workshop on Detection and Classication of Acoustic Scenes and Events (DCASE 2021). Font, F., Mesaros, A., P.W. Ellis, D., Fonseca, E., Fuentes, M. & Elizalde, B. (toim.). DCASE, s. 90-94
Tutkimustuotos: Konferenssiartikkeli › Tieteellinen › vertaisarvioitu
Open access
Tiedosto
18Lataukset
(Pure)
Siteeraa tätä
DataSetCite
Martin Morato, I. (Creator), Harju, M. (Creator), Hirvonen, M. (Creator), Mesaros, A. (Creator) (6 kesäk. 2024). SiVi-CAFE dataset - Sighted and Visually-impaired Captions for Audio in Finnish and English. Zenodo. 10.5281/zenodo.11505822