Mind The Structure: Adopting Structural Information For Deep Neural Network Compression

Homayun Afrabandpey, Anton Muravev, Hamed R. Tavakoli, Honglei Zhang, Francesco Cricri, Moncef Gabbouj, Emre Aksu

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

1 Citation (Scopus)

Abstract

Deep neural networks have huge number of parameters and require large number of bits for representation. This hinders their adoption in decentralized environments where model transfer among different parties is a characteristic of the environment while the communication bandwidth is limited. Parameter quantization is a compression approach to address this challenge by reducing the number of bits required to represent a model, e.g. a neural network. However, majority of existing neural network quantization methods do not exploit structural information of layers and parameters during quantization. In this paper, focusing on Convolutional Neural Networks (CNNs), we present a novel quantization approach by employing the structural information of neural network layers and their corresponding parameters. Starting from a pre-trained CNN, we categorize network parameters into different groups based on the similarity of their layers and their spatial structure. Parameters of each group are independently clustered and the centroid of each cluster is used as representative for all parameters in the cluster. Finally, the centroids and the cluster indexes of the parameters are used as a compact representation of the parameters. Experiments with two different tasks, i.e., acoustic scene classification and image compression, demonstrate the effectiveness of the proposed approach.
Original languageEnglish
Title of host publication2021 IEEE International Conference on Image Processing (ICIP)
Pages3532-3536
Number of pages5
ISBN (Electronic)978-1-6654-4115-5
DOIs
Publication statusPublished - 19 Sept 2021
Publication typeA4 Article in conference proceedings
EventIEEE International Conference on Image Processing - , United States
Duration: 19 Sept 202122 Sept 2021

Publication series

NameProceedings : International Conference on Image Processing
ISSN (Electronic)2381-8549

Conference

ConferenceIEEE International Conference on Image Processing
Country/TerritoryUnited States
Period19/09/2122/09/21

Publication forum classification

  • Publication forum level 1

Fingerprint

Dive into the research topics of 'Mind The Structure: Adopting Structural Information For Deep Neural Network Compression'. Together they form a unique fingerprint.

Cite this