Abstrakti
The amount of digital media created annually has recently increased exponentially due to the ubiquitousness of digital cameras. This increase has also made manual analysis of the data impossible, hence, computer vision algorithms have been adopted to automatically analyze and extract meaningful information. One particular field of computer vision is generic visual object tracking, where the location of the target is only known in the first frame and the algorithm tries to detect its location in the following frames. Most of the existing literature has focused on tracking using traditional RGB cameras however, lately, cheap and reliable depth sensors have become easily accessible. This has opened up new possibilities for many computer vision algorithms including object tracking.
Depth information allows algorithms to leverage 3D understanding of the surroundings. Adding this new dimension can be particularly helpful in tracking where occlusions and shape changes are common. This thesis, focuses on generic visual object tracking on RGBD cameras and contributes to the field in two different ways. Firstly, it develops novel object tracking algorithms using RGBD data. For achieving this, it builds on top of elegant yet powerful fundamentals of spatially constrained correlation filters. To the best of our knowledge, this is the first attempt to use depth inherently in a correlation filters framework. Additionally, it also takes advantage of depth data for detecting self and external occlusions. This allows dynamic adjustment of filter updates to avoid model corruption. Evaluations on publicly available benchmarks validate the proposed algorithms by showing that they achieve state of the art results.
Second contribution of this thesis is the creation of a novel RGBD tracking dataset. This dataset addresses the shortcomings of previous RGBD benchmarks and provides a new challenge to facilitate new RGBD trackers.
Depth information allows algorithms to leverage 3D understanding of the surroundings. Adding this new dimension can be particularly helpful in tracking where occlusions and shape changes are common. This thesis, focuses on generic visual object tracking on RGBD cameras and contributes to the field in two different ways. Firstly, it develops novel object tracking algorithms using RGBD data. For achieving this, it builds on top of elegant yet powerful fundamentals of spatially constrained correlation filters. To the best of our knowledge, this is the first attempt to use depth inherently in a correlation filters framework. Additionally, it also takes advantage of depth data for detecting self and external occlusions. This allows dynamic adjustment of filter updates to avoid model corruption. Evaluations on publicly available benchmarks validate the proposed algorithms by showing that they achieve state of the art results.
Second contribution of this thesis is the creation of a novel RGBD tracking dataset. This dataset addresses the shortcomings of previous RGBD benchmarks and provides a new challenge to facilitate new RGBD trackers.
Alkuperäiskieli | Englanti |
---|---|
Julkaisupaikka | Tampere |
Kustantaja | Tampere University |
ISBN (elektroninen) | 978-952-03-2058-4 |
ISBN (painettu) | 978-952-03-2057-7 |
Tila | Julkaistu - 2021 |
OKM-julkaisutyyppi | G5 Artikkeliväitöskirja |
Julkaisusarja
Nimi | Tampere University Dissertations - Tampereen yliopiston väitöskirjat |
---|---|
Vuosikerta | 452 |
ISSN (painettu) | 2489-9860 |
ISSN (elektroninen) | 2490-0028 |