Abstract
Sequential decision making under uncertainty is studied in a mixed observability domain. The goal is to maximize the amount of information obtained on a partially observable stochastic process under constraints imposed by a fully observable internal state. An upper bound for the optimal value function is derived by relaxing constraints. We identify conditions under which the relaxed problem is a multi-armed bandit whose optimal policy is easily computable. The upper bound is applied to prune the search space in the original problem, and the effect on solution quality is assessed via simulation experiments. Empirical results show effective pruning of the search space in a target monitoring domain.
| Original language | English |
|---|---|
| Title of host publication | 2015 IEEE International Conference on Robotics and Automation (ICRA), 26-30 May 2015, Seattle, WA |
| Pages | 4807-4812 |
| Number of pages | 6 |
| Volume | 2015-June |
| DOIs | |
| Publication status | Published - 29 Jun 2015 |
| Publication type | A4 Article in conference proceedings |
| Event | IEEE International Conference on Robotics and Automation - Duration: 1 Jan 1900 → 1 Jan 2000 |
Conference
| Conference | IEEE International Conference on Robotics and Automation |
|---|---|
| Period | 1/01/00 → 1/01/00 |
Publication forum classification
- Publication forum level 1
ASJC Scopus subject areas
- Software
- Artificial Intelligence
- Control and Systems Engineering
- Electrical and Electronic Engineering
Fingerprint
Dive into the research topics of 'Optimal sensing via multi-armed bandit relaxations in mixed observability domains'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver