TY - GEN
T1 - BabySLM
T2 - Annual Conference of the International Speech Communication Association, INTERSPEECH
AU - Lavechin, Marvin
AU - Sy, Yaya
AU - Titeux, Hadrien
AU - Blandón, María Andrea Cruz
AU - Räsänen, Okko
AU - Bredin, Hervé
AU - Dupoux, Emmanuel
AU - Cristia, Alejandrina
N1 - Publisher Copyright:
© 2023 International Speech Communication Association. All rights reserved.
PY - 2023
Y1 - 2023
N2 - Self-supervised techniques for learning speech representations have been shown to develop linguistic competence from exposure to speech without the need for human labels. In order to fully realize the potential of these approaches and further our understanding of how infants learn language, simulations must closely emulate real-life situations by training on developmentally plausible corpora and benchmarking against appropriate test sets. To this end, we propose a language-acquisition-friendly benchmark to probe spoken language models at the lexical and syntactic levels, both of which are compatible with the vocabulary typical of children's language experiences. This paper introduces the benchmark and summarizes a range of experiments showing its usefulness. In addition, we highlight two exciting challenges that need to be addressed for further progress: bridging the gap between text and speech and between clean speech and in-the-wild speech.
AB - Self-supervised techniques for learning speech representations have been shown to develop linguistic competence from exposure to speech without the need for human labels. In order to fully realize the potential of these approaches and further our understanding of how infants learn language, simulations must closely emulate real-life situations by training on developmentally plausible corpora and benchmarking against appropriate test sets. To this end, we propose a language-acquisition-friendly benchmark to probe spoken language models at the lexical and syntactic levels, both of which are compatible with the vocabulary typical of children's language experiences. This paper introduces the benchmark and summarizes a range of experiments showing its usefulness. In addition, we highlight two exciting challenges that need to be addressed for further progress: bridging the gap between text and speech and between clean speech and in-the-wild speech.
KW - child language
KW - language acquisition
KW - self-supervised learning
KW - spoken language modeling
U2 - 10.21437/Interspeech.2023-978
DO - 10.21437/Interspeech.2023-978
M3 - Conference contribution
AN - SCOPUS:85171546944
T3 - Interspeech
SP - 4588
EP - 4592
BT - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
PB - International Speech Communication Association
Y2 - 20 August 2023 through 24 August 2023
ER -