Abstract
The Joint Video Expert Team (JVET) is currently developing the next-generation MPEG/ITU video coding standard called Versatile Video Coding (VVC) and their ultimate goal is to double the coding efficiency over the state-of-the-art HEVC standard.The latest version of the VVC reference encoder, VTM6.1, is able to improve the intra coding efficiency by 24 % over the HEVC reference encoder HM16.20, but at the expense of 27 times the encoding time. The complexity overhead of VVC primarily stems from its novel block partitioning scheme that complements Quad-Tree (QT) split with Multi-Type Tree (MTT) partitioning in order to better fit the local variations of the video signal. This work reduces the block partitioning complexity of VTM6.1 through the use of Convolutional Neural Networks (CNNs). For each 64 × 64 Coding Unit (CU), the CNN is trained to predict a probability vector that speeds up coding block partitioning in encoding. Our solution is shown to decrease the intra encoding complexity of VTM6.1 by 51.5% with a bitrate increase of only 1.45%.
Original language | English |
---|---|
Title of host publication | 2020 IEEE International Conference on Image Processing (ICIP) |
Publisher | IEEE |
Pages | 3139-3143 |
Number of pages | 5 |
ISBN (Print) | 9781728163956 |
DOIs | |
Publication status | Published - Oct 2020 |
Publication type | A4 Article in conference proceedings |
Event | IEEE International Conference on Image Processing - United Arab Emirates, Abu Dhabi, United Arab Emirates Duration: 25 Oct 2020 → 28 Oct 2020 https://2020.ieeeicip.org |
Publication series
Name | Proceedings - International Conference on Image Processing, ICIP |
---|---|
Volume | 2020-October |
ISSN (Print) | 1522-4880 |
Conference
Conference | IEEE International Conference on Image Processing |
---|---|
Abbreviated title | ICIP 2020 |
Country/Territory | United Arab Emirates |
City | Abu Dhabi |
Period | 25/10/20 → 28/10/20 |
Internet address |
Publication forum classification
- Publication forum level 1