Maximizing the Conditional Expected Reward for Reaching the Goal
Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review
Contributors
Abstract
The paper addresses the problem of computing maximal conditional expected accumulated rewards until reaching a target state (briefly called maximal conditional expectations) in finite-state Markov decision processes where the condition is given as a reachability constraint. Conditional expectations of this type can, e.g., stand for the maximal expected termination time of probabilistic programs with non-determinism, under the condition that the program eventually terminates, or for the worst-case expected penalty to be paid, assuming that at least three deadlines are missed. The main results of the paper are (i) a polynomial-time algorithm to check the finiteness of maximal conditional expectations, (ii) PSPACE-completeness for the threshold problem in acyclic Markov decision processes where the task is to check whether the maximal conditional expectation exceeds a given threshold, (iii) a pseudo-polynomial-time algorithm for the threshold problem in the general (cyclic) case, and (iv) an exponential-time algorithm for computing the maximal conditional expectation and an optimal scheduler.
Details
| Original language | English |
|---|---|
| Title of host publication | Tools and Algorithms for the Construction and Analysis of Systems |
| Editors | Axel Legay, Tiziana Margaria |
| Publisher | Springer, Berlin [u. a.] |
| Pages | 269–285 |
| ISBN (print) | 978-3-662-54579-9 |
| Publication status | Published - 2017 |
| Peer-reviewed | Yes |
Publication series
| Series | Lecture Notes in Computer Science, Volume 10206 |
|---|---|
| ISSN | 0302-9743 |
Conference
| Title | 23rd International Conference on Tools and Algorithms for the Construction and Analysis of Systems |
|---|---|
| Abbreviated title | TACAS 2017 |
| Conference number | |
| Duration | 22 - 29 April 2017 |
| Degree of recognition | International event |
| Location | |
| City | Uppsala |
| Country | Sweden |
External IDs
| ORCID | /0000-0002-5321-9343/work/142236722 |
|---|---|
| Scopus | 85017515216 |
| ORCID | /0000-0003-1724-2586/work/165453592 |
Keywords
Keywords
- Conditional Expected Reward, Goal State, Markov Decision Process, Threshold Algorithm, Maximal Path