Siirry päänavigointiin Siirry hakuun Siirry pääsisältöön

Learned Image Coding for Machines: A Content-Adaptive Approach

  • Nam Le
  • , Honglei Zhang
  • , Francesco Cricri
  • , Ramin Ghaznavi-Youvalari
  • , Hamed R. Tavakoli
  • , Esa Rahtu

Tutkimustuotos: KonferenssiartikkeliTieteellinenvertaisarvioitu

55 Sitaatiot (Scopus)

Abstrakti

Today, according to the Cisco Annual Internet Report (2018-2023), the fastest-growing category of Internet traffic is machine-to-machine communication. In particular, machine-to-machine communication of images and videos represents a new challenge and opens up new perspectives in the context of data compression. One possible solution approach consists of adapting current human-targeted image and video coding standards to the use case of machine consumption. Another approach consists of developing completely new compression paradigms and architectures for machine-to-machine communications. In this paper, we focus on image compression and present an inference-time content-adaptive fine-tuning scheme that optimizes the latent representation of an end-to-end learned image codec, aimed at improving the compression efficiency for machine-consumption. The conducted experiments targeting instance segmentation task network show that our online finetuning brings an average bitrate saving (BD-rate) of -3.66% with respect to our pretrained image codec. In particular, at low bitrate points, our proposed method results in a significant bitrate saving of -9.85%. Overall, our pretrained-and-then-finetuned system achieves - 30.54% BD-rate over the state-of-the-art image/video codec Versatile Video Coding (VVC) on instance segmentation.
AlkuperäiskieliEnglanti
Otsikko2021 IEEE International Conference on Multimedia and Expo (ICME)
KustantajaIEEE
Sivut1-6
Sivumäärä6
ISBN (elektroninen)978-1-6654-3864-3
DOI - pysyväislinkit
TilaJulkaistu - 2021
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaIEEE International Conference on Multimedia and Expo - , Kiina
Kesto: 5 heinäk. 20219 heinäk. 2021

Julkaisusarja

Nimi
ISSN (elektroninen)1945-788X

Conference

ConferenceIEEE International Conference on Multimedia and Expo
Maa/AlueKiina
Ajanjakso5/07/219/07/21

Julkaisufoorumi-taso

  • Jufo-taso 1

Sormenjälki

Sukella tutkimusaiheisiin 'Learned Image Coding for Machines: A Content-Adaptive Approach'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä