Probabilistic goal Markov decision processes

Huan Xu, Shie Mannor

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The Markov decision process model is a powerful tool in planing tasks and sequential decision making problems. The randomness of state transitions and rewards implies that the performance of a policy is often stochastic. In contrast to the standard approach that studies the expected performance, we consider the policy that maximizes the probability of achieving a pre-determined target performance, a criterion we term probabilistic goal Markov decision processes. We show that this problem is NP-hard, but can be solved using a pseudo-polynomial algorithm. We further consider a variant dubbed "chance-constraint Markov decision problems," that treats the probability of achieving target performance as a constraint instead of the maximizing objective. This variant is NP-hard, but can be solved in pseudo-polynomial time.

Original languageEnglish
Title of host publicationIJCAI 2011 - 22nd International Joint Conference on Artificial Intelligence
Pages2046-2052
Number of pages7
DOIs
StatePublished - 2011
Event22nd International Joint Conference on Artificial Intelligence, IJCAI 2011 - Barcelona, Catalonia, Spain
Duration: 16 Jul 201122 Jul 2011

Publication series

NameIJCAI International Joint Conference on Artificial Intelligence

Conference

Conference22nd International Joint Conference on Artificial Intelligence, IJCAI 2011
Country/TerritorySpain
CityBarcelona, Catalonia
Period16/07/1122/07/11

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Probabilistic goal Markov decision processes'. Together they form a unique fingerprint.

Cite this