Detection of Typical Pronunciation Errors in Non-native English Speech Using Convolutional Recurrent Neural Networks

Aleksandr Diment, Eemi Fagerlund, Adrian Benfield, Tuomas Virtanen

Tutkimustuotos: KonferenssiartikkeliTieteellinenvertaisarvioitu

7 Sitaatiot (Scopus)

Abstrakti

A machine learning method for the automatic detection of pronunciation errors made by non-native speakers of English is proposed. It consists of training word-specific binary classifiers on a collected dataset of isolated words with possible pronunciation errors, typical for Finnish native speakers. The classifiers predict whether the typical error is present in the given word utterance. They operate on sequences of acoustic features, extracted from consecutive frames of an audio recording of a word utterance. The proposed architecture includes a convolutional neural network, a recurrent neural network, or a combination of the two. The optimal topology and hyperpa-rameters are obtained in a Bayesian optimisation setting using a tree-structured Parzen estimator. A dataset of 80 words uttered naturally by 120 speakers is collected. The performance of the proposed system, evaluated on a well-represented subset of the dataset, shows that it is capable of detecting pronunciation errors in most of the words (46/49) with high accuracy (mean accuracy gain over the zero rule 12.21 percent points).

AlkuperäiskieliEnglanti
Otsikko2019 International Joint Conference on Neural Networks, IJCNN 2019
KustantajaIEEE
ISBN (elektroninen)9781728119854
DOI - pysyväislinkit
TilaJulkaistu - 1 heinäk. 2019
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaInternational Joint Conference on Neural Networks - Budapest, Unkari
Kesto: 14 heinäk. 201919 heinäk. 2019

Julkaisusarja

Nimi
ISSN (elektroninen)2161-4407

Conference

ConferenceInternational Joint Conference on Neural Networks
Maa/AlueUnkari
KaupunkiBudapest
Ajanjakso14/07/1919/07/19

Julkaisufoorumi-taso

  • Jufo-taso 1

!!ASJC Scopus subject areas

  • Software
  • Artificial Intelligence

Sormenjälki

Sukella tutkimusaiheisiin 'Detection of Typical Pronunciation Errors in Non-native English Speech Using Convolutional Recurrent Neural Networks'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä