Self-Organized Variational Autoencoders (Self-Vae) For Learned Image Compression

M. Akın Yílmaz, Onur Kelesş, Hilal Güven, A. Murat Tekalp, Junaid Malik, Serkan Kíranyaz

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

20 Citations (Scopus)

Abstract

In end-to-end optimized learned image compression, it is standard practice to use a convolutional variational autoencoder with generalized divisive normalization (GDN) to transform images into a latent space. Recently, Operational Neural Networks (ONNs) that learn the best non-linearity from a set of alternatives, and their “self-organized” variants, Self-ONNs, that approximate any non-linearity via Taylor series have been proposed to address the limitations of convolutional layers and a fixed nonlinear activation. In this paper, we propose to replace the convolutional and GDN layers in the variational autoencoder with self-organized operational layers, and propose a novel self-organized variational autoencoder (Self-VAE) architecture that benefits from stronger non-linearity. The experimental results demonstrate that the proposed Self-VAE yields improvements in both rate-distortion performance and perceptual image quality.
Original languageEnglish
Title of host publication2021 IEEE International Conference on Image Processing (ICIP)
Pages3732-3736
Number of pages5
ISBN (Electronic)978-1-6654-4115-5
DOIs
Publication statusPublished - 2021
Publication typeA4 Article in conference proceedings
EventIEEE International Conference on Image Processing - , United States
Duration: 19 Sept 202122 Sept 2021

Publication series

NameProceedings : International Conference on Image Processing
ISSN (Electronic)2381-8549

Conference

ConferenceIEEE International Conference on Image Processing
Country/TerritoryUnited States
Period19/09/2122/09/21

Keywords

  • Convolutional codes
  • Measurement
  • Visualization
  • Image coding
  • Codecs
  • Neurons
  • Rate-distortion
  • end-to-end learned image compression
  • variational autoencoder
  • self-organized operational layer
  • rate-distortion performance
  • perceptual quality metrics

Publication forum classification

  • Publication forum level 1

Fingerprint

Dive into the research topics of 'Self-Organized Variational Autoencoders (Self-Vae) For Learned Image Compression'. Together they form a unique fingerprint.

Cite this