Abstract
A Neural Network Compression (NNC) standard aims to define a set of coding tools for efficient compression and transmission of neural networks. This paper addresses the high-level syntax (HLS) of NNC and proposes three HLS techniques for network topology coding and payload partitioning. Our first technique provides an efficient way to code prune topology information. It removes redundancy in the bitmask and thereby improves coding efficiency by 4–99% over existing approaches. The second technique processes bitmasks in larger chunks instead of one bit at a time. It is shown to reduce computational complexity of NNC encoding by 63% and NNC decoding by 82%. Our third technique makes use of partial data counters to partition an NNC bitstream into uniformly sized units for more efficient data transmission. Even though the smaller partition sizes introduce some overhead, our network simulations show better throughput due to lower packet retransmission rates. To our knowledge, this the first work to address the practical implementation aspects of HLS. The proposed techniques can be seen as key enabling factors for efficient adaptation and economical deployment of the NNC standard in a plurality of next-generation industrial and academic applications.
Original language | English |
---|---|
Title of host publication | 2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) |
Publisher | IEEE |
Number of pages | 4 |
ISBN (Electronic) | 978-1-6654-7218-0 |
ISBN (Print) | 978-1-6654-7219-7 |
DOIs | |
Publication status | Published - 18 Jul 2022 |
Publication type | A4 Article in conference proceedings |
Event | IEEE International Conference on Multimedia and Expo Workshops - Taipei City, Taiwan, Province of China Duration: 18 Jul 2022 → 22 Jul 2022 |
Conference
Conference | IEEE International Conference on Multimedia and Expo Workshops |
---|---|
Country/Territory | Taiwan, Province of China |
City | Taipei City |
Period | 18/07/22 → 22/07/22 |
Keywords
- Neural Network Compression (NNC)
- Neural Network Representation (NNR)
- High-Level Syntax (HLS)
- neural network topology
- bitmask coding
Publication forum classification
- Publication forum level 1