Siirry päänavigointiin Siirry hakuun Siirry pääsisältöön

BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language models

  • Marvin Lavechin
  • , Yaya Sy
  • , Hadrien Titeux
  • , María Andrea Cruz Blandón
  • , Okko Räsänen
  • , Hervé Bredin
  • , Emmanuel Dupoux
  • , Alejandrina Cristia

Tutkimustuotos: KonferenssiartikkeliTieteellinenvertaisarvioitu

16 Sitaatiot (Scopus)
20 Lataukset (Pure)

Abstrakti

Self-supervised techniques for learning speech representations have been shown to develop linguistic competence from exposure to speech without the need for human labels. In order to fully realize the potential of these approaches and further our understanding of how infants learn language, simulations must closely emulate real-life situations by training on developmentally plausible corpora and benchmarking against appropriate test sets. To this end, we propose a language-acquisition-friendly benchmark to probe spoken language models at the lexical and syntactic levels, both of which are compatible with the vocabulary typical of children's language experiences. This paper introduces the benchmark and summarizes a range of experiments showing its usefulness. In addition, we highlight two exciting challenges that need to be addressed for further progress: bridging the gap between text and speech and between clean speech and in-the-wild speech.

AlkuperäiskieliEnglanti
OtsikkoProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
KustantajaInternational Speech Communication Association
Sivut4588-4592
Sivumäärä5
DOI - pysyväislinkit
TilaJulkaistu - 2023
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaAnnual Conference of the International Speech Communication Association, INTERSPEECH - Dublin, Irlanti
Kesto: 20 elok. 202324 elok. 2023

Julkaisusarja

NimiInterspeech
KustantajaInternational Speech Communication Association
ISSN (elektroninen)2958-1796

Conference

ConferenceAnnual Conference of the International Speech Communication Association, INTERSPEECH
Maa/AlueIrlanti
KaupunkiDublin
Ajanjakso20/08/2324/08/23

Julkaisufoorumi-taso

  • Jufo-taso 1

!!ASJC Scopus subject areas

  • Language and Linguistics
  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Modelling and Simulation

Sormenjälki

Sukella tutkimusaiheisiin 'BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language models'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä