Abstract
In this paper, we consider online planning in par-tially observable domains. Solving the corresponding POMDP problem is a very challenging task, particularly in an online setting. Our key contribution is a novel algorithmic approach, Simplified Information Theoretic Belief Space Planning (SITH-BSP), which aims to speed up POMDP planning considering belief-dependent rewards, without compromising the solution's accuracy. We do so by mathematically relating the simplified el-ements of the problem to the corresponding counterparts of the original problem. Specifically, we focus on belief simplification and use it to formulate bounds on the corresponding original belief-dependent rewards. These bounds in turn are used to perform branch pruning over the belief tree, in the process of extracting the optimal policy from this existing belief tree. We further introduce the notion of adaptive simplification, while re-using calculations between different simplification levels, and exploit it to prune, at each level in the belief tree, all branches but one. Therefore, our approach is guaranteed to find the optimal solution (policy) that corresponds to the given belief tree but with substantial speedup. As a second key contribution, we derive novel analytical bounds for differential entropy, considering a sampling-based belief representation, which we believe are of interest on their own. We validate our approach in simulation using these bounds and where simplification corresponds to reducing the number of samples, exhibiting a significant computational speedup while yielding the optimal solution for the given belief tree.
| Original language | English |
|---|---|
| Title of host publication | 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2022 |
| Pages | 7174-7181 |
| Number of pages | 8 |
| ISBN (Electronic) | 9781665479271 |
| DOIs | |
| State | Published - 2022 |
| Event | 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2022 - Kyoto, Japan Duration: 23 Oct 2022 → 27 Oct 2022 |
Conference
| Conference | 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2022 |
|---|---|
| Country/Territory | Japan |
| City | Kyoto |
| Period | 23/10/22 → 27/10/22 |
ASJC Scopus subject areas
- Control and Systems Engineering
- Software
- Computer Vision and Pattern Recognition
- Computer Science Applications
Fingerprint
Dive into the research topics of 'Speeding up POMDP Planning via Simplification'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver