AVX2-optimized Kvazaar HEVC intra encoder

Ari Lemmetti, Ari Koivula, Marko Viitanen, Jarno Vanne, Timo Hämäläinen

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    15 Citations (Scopus)
    255 Downloads (Pure)

    Abstract

    This paper presents efficient SIMD optimizations for the open-source Kvazaar HEVC intra encoder. The C implementation of Kvazaar is accelerated by Intel AVX2 instructions whose effect on Kvazaar ultrafast preset is profiled. According to our profiling results, C functions of SATD, DCT, quantization, and intra prediction account for over 60% of the total intra coding time of Kvazaar ultrafast preset. This work shows that optimizing primarily these functions doubles the coding speed of a single-threaded Kvazaar intra encoder for the same rate-distortion performance. The highest performance boost is obtained by deploying the proposed optimizations jointly with multithreading. On the Intel 8-core i7 processor, the AVX2-optimized 16-threaded Kvazaar ultrafast preset achieves real-time (30 fps) intra coding speed up to 1080p resolution. Compared to AVX2-optimized ultrafast preset of x265, Kvazaar is 20% times faster and still obtains 9.1% bit rate gain for the same quality. These results justify that Kvazaar is currently the leading open-source HEVC intra encoder in terms of real-time coding speed and efficiency.
    Original languageEnglish
    Title of host publication2016 IEEE International Conference on Image Processing (ICIP)
    PublisherIEEE
    Pages549-553
    ISBN (Electronic)978-1-4673-9961-6
    DOIs
    Publication statusPublished - 2016
    Publication typeA4 Article in a conference publication
    EventIEEE International Conference on Image Processing -
    Duration: 1 Jan 1900 → …

    Publication series

    Name
    ISSN (Electronic)2381-8549

    Conference

    ConferenceIEEE International Conference on Image Processing
    Period1/01/00 → …

    Publication forum classification

    • Publication forum level 1

    Cite this