Skip to main navigation Skip to search Skip to main content

BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language models

  • Marvin Lavechin
  • , Yaya Sy
  • , Hadrien Titeux
  • , María Andrea Cruz Blandón
  • , Okko Räsänen
  • , Hervé Bredin
  • , Emmanuel Dupoux
  • , Alejandrina Cristia

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

16 Citations (Scopus)
20 Downloads (Pure)

Abstract

Self-supervised techniques for learning speech representations have been shown to develop linguistic competence from exposure to speech without the need for human labels. In order to fully realize the potential of these approaches and further our understanding of how infants learn language, simulations must closely emulate real-life situations by training on developmentally plausible corpora and benchmarking against appropriate test sets. To this end, we propose a language-acquisition-friendly benchmark to probe spoken language models at the lexical and syntactic levels, both of which are compatible with the vocabulary typical of children's language experiences. This paper introduces the benchmark and summarizes a range of experiments showing its usefulness. In addition, we highlight two exciting challenges that need to be addressed for further progress: bridging the gap between text and speech and between clean speech and in-the-wild speech.

Original languageEnglish
Title of host publicationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
PublisherInternational Speech Communication Association
Pages4588-4592
Number of pages5
DOIs
Publication statusPublished - 2023
Publication typeA4 Article in conference proceedings
EventAnnual Conference of the International Speech Communication Association, INTERSPEECH - Dublin, Ireland
Duration: 20 Aug 202324 Aug 2023

Publication series

NameInterspeech
PublisherInternational Speech Communication Association
ISSN (Electronic)2958-1796

Conference

ConferenceAnnual Conference of the International Speech Communication Association, INTERSPEECH
Country/TerritoryIreland
CityDublin
Period20/08/2324/08/23

Keywords

  • child language
  • language acquisition
  • self-supervised learning
  • spoken language modeling

Publication forum classification

  • Publication forum level 1

ASJC Scopus subject areas

  • Language and Linguistics
  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Modelling and Simulation

Fingerprint

Dive into the research topics of 'BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language models'. Together they form a unique fingerprint.

Cite this