Abstract
Effective employment of deep neural networks (DNNs) in mobile devices and embedded systems, like field programmable gate arrays, is hampered by requirements for memory and computational power. In this paper we propose a method that employs a non-uniform fixed-point quantization and a virtual bit shift (VBS) to improve the accuracy of the quantization of the DNN weights. We evaluate our method in a speech enhancement application, where a fully connected DNN is used to predict the clean speech spectrum from the input noisy speech spectrum. A DNN is optimized, its memory requirement is calculated, and its performance is evaluated using the short-time objective intelligibility (STOI) metric. The application of the low-bit quantization leads to a 50% reduction of the DNN memory requirement while the STOI performance drops only by 2.7%.
Original language | English |
---|---|
Title of host publication | 2020 28th European Signal Processing Conference (EUSIPCO) |
Publisher | IEEE |
Pages | 466-470 |
Number of pages | 5 |
ISBN (Electronic) | 978-9-0827-9705-3 |
DOIs | |
Publication status | Published - 2021 |
Publication type | A4 Article in conference proceedings |
Event | European Signal Processing Conference - Beurs van Berlage, Amsterdam, Netherlands Duration: 18 Jan 2021 → 22 Jan 2021 Conference number: 28 https://eusipco2020.org |
Publication series
Name | European Signal Processing Conference |
---|---|
ISSN (Electronic) | 2076-1465 |
Conference
Conference | European Signal Processing Conference |
---|---|
Abbreviated title | EUSIPCO2020 |
Country/Territory | Netherlands |
City | Amsterdam |
Period | 18/01/21 → 22/01/21 |
Internet address |
Keywords
- Quantization (signal)
- Neural networks
- Memory management
- Speech enhancement
- Logic gates
- Table lookup
- Field programmable gate arrays
- neural network quantization
- memory footprint reduction
- FPGA
- hardware accelerators
Publication forum classification
- Publication forum level 1