Maximizing the Conditional Expected Reward for Reaching the Goal

Christel Baier; Joachim Klein; Sascha Klüppelholz; Sascha Wunderlich

doi:10.1007/978-3-662-54580-5_16

Maximizing the Conditional Expected Reward for Reaching the Goal

Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review

Contributors

Christel Baier - , Chair of Algebraic and Logical Foundations of Computer Science (Author)
Joachim Klein - , Chair of Algebraic and Logical Foundations of Computer Science (Author)
Sascha Klüppelholz - , Chair of Algebraic and Logical Foundations of Computer Science (Author)
Sascha Wunderlich - , Chair of Algebraic and Logical Foundations of Computer Science (Author)

Abstract

The paper addresses the problem of computing maximal conditional expected accumulated rewards until reaching a target state (briefly called maximal conditional expectations) in finite-state Markov decision processes where the condition is given as a reachability constraint. Conditional expectations of this type can, e.g., stand for the maximal expected termination time of probabilistic programs with non-determinism, under the condition that the program eventually terminates, or for the worst-case expected penalty to be paid, assuming that at least three deadlines are missed. The main results of the paper are (i) a polynomial-time algorithm to check the finiteness of maximal conditional expectations, (ii) PSPACE-completeness for the threshold problem in acyclic Markov decision processes where the task is to check whether the maximal conditional expectation exceeds a given threshold, (iii) a pseudo-polynomial-time algorithm for the threshold problem in the general (cyclic) case, and (iv) an exponential-time algorithm for computing the maximal conditional expectation and an optimal scheduler.

Details

Original language	English
Title of host publication	Tools and Algorithms for the Construction and Analysis of Systems
Editors	Axel Legay, Tiziana Margaria
Publisher	Springer, Berlin [u. a.]
Pages	269–285
ISBN (print)	978-3-662-54579-9
Publication status	Published - 2017
Peer-reviewed	Yes

Publication series

Series	Lecture Notes in Computer Science, Volume 10206
ISSN	0302-9743

Conference

Title	23rd International Conference on Tools and Algorithms for the Construction and Analysis of Systems
Abbreviated title	TACAS 2017
Conference number
Duration	22 - 29 April 2017
Degree of recognition	International event
Location
City	Uppsala
Country	Sweden

External IDs

ORCID	/0000-0002-5321-9343/work/142236722
Scopus	85017515216
ORCID	/0000-0003-1724-2586/work/165453592

Keywords

Conditional Expected Reward, Goal State, Markov Decision Process, Threshold Algorithm, Maximal Path

Library keywords

004 Computer science

Research Portal of the TU Dresden

Maximizing the Conditional Expected Reward for Reaching the Goal

Contributors

Abstract

Details

Publication series

Conference

External IDs

Keywords

Keywords

Library keywords

Related content

Maximizing the Conditional Expected Reward for Reaching the Goal