Abstract
In this paper, we consider online planning in par-tially observable domains. Solving the corresponding POMDP problem is a very challenging task, particularly in an online setting. Our key contribution is a novel algorithmic approach, Simplified Information Theoretic Belief Space Planning (SITH-BSP), which aims to speed up POMDP planning considering belief-dependent rewards, without compromising the solution's accuracy. We do so by mathematically relating the simplified el-ements of the problem to the corresponding counterparts of the original problem. Specifically, we focus on belief simplification and use it to formulate bounds on the corresponding original belief-dependent rewards. These bounds in turn are used to perform branch pruning over the belief tree, in the process of extracting the optimal policy from this existing belief tree. We further introduce the notion of adaptive simplification, while re-using calculations between different simplification levels, and exploit it to prune, at each level in the belief tree, all branches but one. Therefore, our approach is guaranteed to find the optimal solution (policy) that corresponds to the given belief tree but with substantial speedup. As a second key contribution, we derive novel analytical bounds for differential entropy, considering a sampling-based belief representation, which we believe are of interest on their own. We validate our approach in simulation using these bounds and where simplification corresponds to reducing the number of samples, exhibiting a significant computational speedup while yielding the optimal solution for the given belief tree.
Original language | English |
---|---|
Title of host publication | IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2022 |
Pages | 7174-7181 |
Number of pages | 8 |
ISBN (Electronic) | 9781665479271 |
DOIs | |
State | Published - 2022 |
Event | 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2022 - Kyoto, Japan Duration: 23 Oct 2022 → 27 Oct 2022 |
Conference
Conference | 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2022 |
---|---|
Country/Territory | Japan |
City | Kyoto |
Period | 23/10/22 → 27/10/22 |
All Science Journal Classification (ASJC) codes
- Control and Systems Engineering
- Software
- Computer Vision and Pattern Recognition
- Computer Science Applications