Single-Channel Speaker Distance Estimation in Reverberant Environments

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

1 Citation (Scopus)
11 Downloads (Pure)

Abstract

We introduce the novel task of continuous-valued speaker distance estimation which focuses on estimating non-discrete distances between a sound source and microphone, based on audio captured by the microphone. A novel learning-based approach for estimating speaker distance in reverberant environments from a single omnidirectional microphone is proposed. Using common acoustic features, such as the magnitude and phase of the audio spectrogram, with a convolutional recurrent neural network results in errors on the order of centimeters in noiseless audios. Experiments are carried out by means of an image-source room simulator with convolved speeches from a public dataset. An ablation study is performed to demonstrate the effectiveness of the proposed feature set. Finally, a study of the impact of real background noise, extracted from the WHAM! dataset at different signal-to-noise ratios highlights the discrepancy between noisy and noiseless scenarios, underlining the difficulty of the problem.

Original languageEnglish
Title of host publicationProceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023
PublisherIEEE
ISBN (Electronic)979-8-3503-2372-6
DOIs
Publication statusPublished - 2023
Publication typeA4 Article in conference proceedings
EventIEEE Workshop on Applications of Signal Processing to Audio and Acoustics - New Paltz, United States
Duration: 22 Oct 202325 Oct 2023

Publication series

NameIEEE Workshop on Applications of Signal Processing to Audio and Acoustics
Volume2023-October
ISSN (Print)1931-1168
ISSN (Electronic)1947-1629

Conference

ConferenceIEEE Workshop on Applications of Signal Processing to Audio and Acoustics
Country/TerritoryUnited States
CityNew Paltz
Period22/10/2325/10/23

Keywords

  • Deep Learning
  • Distance estimation
  • Reverberation
  • Single-channel

Publication forum classification

  • Publication forum level 1

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Single-Channel Speaker Distance Estimation in Reverberant Environments'. Together they form a unique fingerprint.

Cite this