Learned Image Coding for Machines: A Content-Adaptive Approach

Nam Le, Honglei Zhang, Francesco Cricri, Ramin Ghaznavi-Youvalari, Hamed R. Tavakoli, Esa Rahtu

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

40 Citations (Scopus)

Abstract

Today, according to the Cisco Annual Internet Report (2018-2023), the fastest-growing category of Internet traffic is machine-to-machine communication. In particular, machine-to-machine communication of images and videos represents a new challenge and opens up new perspectives in the context of data compression. One possible solution approach consists of adapting current human-targeted image and video coding standards to the use case of machine consumption. Another approach consists of developing completely new compression paradigms and architectures for machine-to-machine communications. In this paper, we focus on image compression and present an inference-time content-adaptive fine-tuning scheme that optimizes the latent representation of an end-to-end learned image codec, aimed at improving the compression efficiency for machine-consumption. The conducted experiments targeting instance segmentation task network show that our online finetuning brings an average bitrate saving (BD-rate) of -3.66% with respect to our pretrained image codec. In particular, at low bitrate points, our proposed method results in a significant bitrate saving of -9.85%. Overall, our pretrained-and-then-finetuned system achieves - 30.54% BD-rate over the state-of-the-art image/video codec Versatile Video Coding (VVC) on instance segmentation.
Original languageEnglish
Title of host publication2021 IEEE International Conference on Multimedia and Expo (ICME)
PublisherIEEE
Pages1-6
Number of pages6
ISBN (Electronic)978-1-6654-3864-3
DOIs
Publication statusPublished - 2021
Publication typeA4 Article in conference proceedings
EventIEEE International Conference on Multimedia and Expo - , China
Duration: 5 Jul 20219 Jul 2021

Publication series

Name
ISSN (Electronic)1945-788X

Conference

ConferenceIEEE International Conference on Multimedia and Expo
Country/TerritoryChina
Period5/07/219/07/21

Keywords

  • Video coding
  • Machine-to-machine communications
  • Image segmentation
  • Computer vision
  • Image coding
  • Codecs
  • Bit rate
  • Image coding for machines
  • learned image compression
  • content-adaptation
  • finetuning
  • video coding for machines

Publication forum classification

  • Publication forum level 1

Fingerprint

Dive into the research topics of 'Learned Image Coding for Machines: A Content-Adaptive Approach'. Together they form a unique fingerprint.

Cite this