Abstract
In this paper, we propose an encoder-decoder convolutional neural network (CNN) architecture for estimating camera pose (orientation and location) from a single RGB-image. The architecture has a hourglass shape consisting of a chain of convolution and up-convolution layers followed by a regression part. The up-convolution layers are introduced to preserve the fine-grained information of the input image. Following the common practice, we train our model in end-to-end manner utilizing transfer learning from large scale classification data. The experiments demonstrate the performance of the approach on data exhibiting different lighting conditions, reflections, and motion blur The results indicate a clear improvement over the previous state-of-the-art even when compared to methods that utilize sequence of test frames instead of a single frame.
| Original language | English |
|---|---|
| Title of host publication | 2017 IEEE International Conference on Computer Vision Workshops, ICCVW 2017 |
| Publisher | IEEE |
| Pages | 870-877 |
| Number of pages | 8 |
| ISBN (Electronic) | 9781538610343 |
| DOIs | |
| Publication status | Published - 19 Jan 2018 |
| Publication type | A4 Article in conference proceedings |
| Event | IEEE International Conference on Computer Vision Workshops - Duration: 1 Jan 1900 → … |
Conference
| Conference | IEEE International Conference on Computer Vision Workshops |
|---|---|
| Period | 1/01/00 → … |
Publication forum classification
- Publication forum level 1
ASJC Scopus subject areas
- Computer Science Applications
- Computer Vision and Pattern Recognition
Fingerprint
Dive into the research topics of 'Image-Based Localization Using Hourglass Networks'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver