Maximizing the Conditional Expected Reward for Reaching the Goal

Christel Baier; Joachim Klein; Sascha Klüppelholz; Sascha Wunderlich

doi:10.1007/978-3-662-54580-5_16

Maximizing the Conditional Expected Reward for Reaching the Goal

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung

Beitragende

Christel Baier - , Professur für Algebraische und logische Grundlagen der Informatik (Autor:in)
Joachim Klein - , Professur für Algebraische und logische Grundlagen der Informatik (Autor:in)
Sascha Klüppelholz - , Professur für Algebraische und logische Grundlagen der Informatik (Autor:in)
Sascha Wunderlich - , Professur für Algebraische und logische Grundlagen der Informatik (Autor:in)

Abstract

The paper addresses the problem of computing maximal conditional expected accumulated rewards until reaching a target state (briefly called maximal conditional expectations) in finite-state Markov decision processes where the condition is given as a reachability constraint. Conditional expectations of this type can, e.g., stand for the maximal expected termination time of probabilistic programs with non-determinism, under the condition that the program eventually terminates, or for the worst-case expected penalty to be paid, assuming that at least three deadlines are missed. The main results of the paper are (i) a polynomial-time algorithm to check the finiteness of maximal conditional expectations, (ii) PSPACE-completeness for the threshold problem in acyclic Markov decision processes where the task is to check whether the maximal conditional expectation exceeds a given threshold, (iii) a pseudo-polynomial-time algorithm for the threshold problem in the general (cyclic) case, and (iv) an exponential-time algorithm for computing the maximal conditional expectation and an optimal scheduler.

Details

Originalsprache	Englisch
Titel	Tools and Algorithms for the Construction and Analysis of Systems
Redakteure/-innen	Axel Legay, Tiziana Margaria
Herausgeber (Verlag)	Springer, Berlin [u. a.]
Seiten	269–285
ISBN (Print)	978-3-662-54579-9
Publikationsstatus	Veröffentlicht - 2017
Peer-Review-Status	Ja

Publikationsreihe

Reihe	Lecture Notes in Computer Science, Volume 10206
ISSN	0302-9743

Konferenz

Titel	23rd International Conference on Tools and Algorithms for the Construction and Analysis of Systems
Kurztitel	TACAS 2017
Veranstaltungsnummer
Dauer	22 - 29 April 2017
Bekanntheitsgrad	Internationale Veranstaltung
Ort
Stadt	Uppsala
Land	Schweden

Externe IDs

ORCID	/0000-0002-5321-9343/work/142236722
Scopus	85017515216
ORCID	/0000-0003-1724-2586/work/165453592

Schlagworte

Schlagwörter

Conditional Expected Reward, Goal State, Markov Decision Process, Threshold Algorithm, Maximal Path

Bibliotheksschlagworte

004 Informatik

Forschungsportal der TU Dresden

Maximizing the Conditional Expected Reward for Reaching the Goal

Beitragende

Abstract

Details

Publikationsreihe

Konferenz

Externe IDs

Schlagworte

Schlagwörter

Bibliotheksschlagworte

Verknüpfte Inhalte

Maximizing the Conditional Expected Reward for Reaching the Goal