Object Detection in Equirectangular Panorama

Wenyan Yang, Yanlin Qian, Joni-Kristian Kämäräinen, Francesco Cricri, Lixin Fan

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    87 Citations (Scopus)

    Abstract

    We introduce a high-resolution equirectangular panorama (aka 360-degree, virtual reality, VR) dataset for object detection and propose a multi-projection variant of the YOLO detector. The main challenges with equirectangular panorama images are i) the lack of annotated training data, ii) high-resolution imagery and iii) severe geometric distortions of objects near the panorama projection poles. In this work, we solve the challenges by I) using training examples available in the “conventional datasets” (ImageNet and COCO), II) employing only low resolution images that require only moderate GPU computing power and memory, and III) our multi-projection YOLO handles projection distortions by making multiple stereographic sub-projections. In our experiments, YOLO outperforms the other state-of-the-art detector, Faster R-CNN, and our multi-projection YOLO achieves the best accuracy with low-resolution input.
    Original languageEnglish
    Title of host publication2018 24th International Conference on Pattern Recognition (ICPR)
    PublisherIEEE
    Pages2190-2195
    Number of pages6
    ISBN (Electronic)978-1-5386-3788-3
    ISBN (Print)978-1-5386-3789-0
    DOIs
    Publication statusPublished - Aug 2018
    Publication typeA4 Article in conference proceedings
    EventInternational Conference on Pattern Recognition -
    Duration: 1 Jan 1900 → …

    Publication series

    Name
    ISSN (Print)1051-4651

    Conference

    ConferenceInternational Conference on Pattern Recognition
    Period1/01/00 → …

    Keywords

    • image resolution
    • neural nets
    • object detection
    • virtual reality
    • YOLO detector
    • equirectangular panorama images
    • annotated training data
    • high-resolution imagery
    • panorama projection poles
    • training examples
    • conventional datasets
    • low resolution images
    • multiprojection YOLO
    • projection distortions
    • multiple stereographic sub-projections
    • low-resolution input
    • high-resolution equirectangular panorama dataset
    • geometric distortions
    • moderate GPU computing power
    • CNN
    • Detectors
    • Distortion
    • Object detection
    • Cameras
    • Image resolution
    • Virtual reality
    • Graphics processing units

    Publication forum classification

    • Publication forum level 1

    Fingerprint

    Dive into the research topics of 'Object Detection in Equirectangular Panorama'. Together they form a unique fingerprint.

    Cite this