OVE6D: Object Viewpoint Encoding for Depth-based 6D Object Pose Estimation

Dingding Cai, Janne Heikkilä, Esa Rahtu

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

21 Citations (Scopus)
8 Downloads (Pure)

Abstract

This paper proposes a universal framework, called OVE6D, for model-based 6D object pose estimation from a single depth image and a target object mask. Our model is trained using purely synthetic data rendered from ShapeNet, and, unlike most of the existing methods, it generalizes well on new real-world objects without any fine-tuning. We achieve this by decomposing the 6D pose into viewpoint, in-plane rotation around the camera optical axis and translation, and introducing novel lightweight modules for estimating each component in a cascaded manner. The resulting network contains less than 4M parameters while demon-strating excellent performance on the challenging T-LESS and Occluded LINEMOD datasets without any dataset-specific training. We show that OVE6D outperforms some contemporary deep learning-based pose estimation methods specifically trained for individual objects or datasets with real-world training data. The implementation is available at https://github.com/dingdingcai/OVE6D-pose.

Original languageEnglish
Title of host publication2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
PublisherIEEE
Pages6793-6803
Number of pages11
ISBN (Electronic)9781665469463
DOIs
Publication statusPublished - 2022
Publication typeA4 Article in conference proceedings
EventIEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) - New Orleans, United States
Duration: 18 Jun 202224 Jun 2022

Publication series

NameIEEE Computer Society Conference on Computer Vision and Pattern Recognition
ISSN (Print)1063-6919
ISSN (Electronic)2575-7075

Conference

ConferenceIEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Country/TerritoryUnited States
CityNew Orleans
Period18/06/2224/06/22

Keywords

  • categorization
  • Pose estimation and tracking
  • Recognition: detection
  • retrieval
  • RGBD sensors and analytics

Publication forum classification

  • Publication forum level 2

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'OVE6D: Object Viewpoint Encoding for Depth-based 6D Object Pose Estimation'. Together they form a unique fingerprint.

Cite this