Tailored AVX2 Transform Kernels for Versatile Video Coding

Tutkimustuotos: KonferenssiartikkeliTieteellinenvertaisarvioitu

1 Sitaatiot (Scopus)
61 Lataukset (Pure)

Abstrakti

Transform coding tools play an integral part in video codecs due to their substantial impact on coding efficiency. The latest video coding standard, Versatile Video Coding (VVC), makes the most of these tools by introducing new DST7, DCT8, and non-square transforms alongside the conventional DCT2 transform. This paper proposes optimized AVX2 kernels for all these transforms to speed up VVC coding. Unlike existing solutions, our kernels are specially tailored for each VVC transform type and block size. Accelerating our open-source uvg266 VVC encoder with the proposed kernels yields up to a 1.1× speedup under all intra (AI) coding condition without any coding overhead. Our implementations make forward DCT2 and DST7/DCT8 transforms 4.0× and 6.7× as fast as their respective scalar implementations in the VTM reference encoder. They also outpace the AVX2 kernels of the practical VVenC encoder by factors of 3.0× and 2.8×. The respective speedups rise up to 5.3×, 11.1×, 3.4×, and 3.0× with inverse transforms.
AlkuperäiskieliEnglanti
Otsikko2023 IEEE Nordic Circuits and Systems Conference (NorCAS)
KustantajaIEEE
Sivumäärä6
ISBN (elektroninen)979-8-3503-3757-0
ISBN (painettu)979-8-3503-3758-7
DOI - pysyväislinkit
TilaJulkaistu - 31 lokak. 2023
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaIEEE Nordic Circuits and Systems Conference - Aalborg, Tanska
Kesto: 30 lokak. 20231 marrask. 2023

Conference

ConferenceIEEE Nordic Circuits and Systems Conference
Maa/AlueTanska
KaupunkiAalborg
Ajanjakso30/10/231/11/23

Julkaisufoorumi-taso

  • Jufo-taso 1

Sormenjälki

Sukella tutkimusaiheisiin 'Tailored AVX2 Transform Kernels for Versatile Video Coding'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä