Luma Range Scaling for Enhanced VVC Efficiency in Video Coding for Machines

Tero Partanen, Alban Marie, Alexandre Mercat, Jarno Vanne, Miska M. Hannuksela, Honglei Zhang, Alireza Aminlou, Francesco Cricri

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

17 Downloads (Pure)

Abstract

Recent years have shown significant growth in video data traffic for machine vision applications, catalyzing new standardization efforts in video coding for machines (VCM). These activities focus on compressing images and videos for machine vision tasks, rather than for human viewing. In this work, we propose a novel method that scales down the luma range to enhance the coding efficiency of Versatile Video Coding (VVC) for machine consumption. This method results in a lower bitrate after encoding and has only minimal adverse effects on the accuracy of machine vision tasks. In our experiments, we down-scale the luma channel of the input video using luma-scaling factors from 0.2 to 0.9 and evaluate coding results with optional back-scaling to the original range before machine vision tasks. Our results with the VVC Test Model (VTM) demonstrate that the proposed technique achieves coding gain of up to 37.9 % and 46.1% for the same object detection and tracking accuracy, respectively.
Original languageEnglish
Title of host publication2024 IEEE 26th International Workshop on Multimedia Signal Processing (MMSP)
PublisherIEEE
Pages1-6
Number of pages6
ISBN (Electronic)979-8-3503-8725-4
DOIs
Publication statusPublished - Oct 2024
Publication typeA4 Article in conference proceedings
EventIEEE International Workshop on Multimedia Signal Processing - West Lafayette, United States
Duration: 2 Oct 20244 Oct 2024

Publication series

Name
ISSN (Electronic)2473-3628

Conference

ConferenceIEEE International Workshop on Multimedia Signal Processing
Country/TerritoryUnited States
CityWest Lafayette
Period2/10/244/10/24

Keywords

  • Video coding
  • Video Coding for Machines (VCM)
  • Machine vision
  • Bit rate
  • object tracking
  • Object detection
  • Standardization
  • Streaming media
  • Encoding
  • Common Test Conditions (CTC)
  • Versatile Video Coding (VVC)

Publication forum classification

  • Publication forum level 1

Fingerprint

Dive into the research topics of 'Luma Range Scaling for Enhanced VVC Efficiency in Video Coding for Machines'. Together they form a unique fingerprint.

Cite this