Maximizing the Conditional Expected Reward for Reaching the Goal

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/GutachtenBeitrag in KonferenzbandBeigetragenBegutachtung

Abstract

The paper addresses the problem of computing maximal conditional expected accumulated rewards until reaching a target state (briefly called maximal conditional expectations) in finite-state Markov decision processes where the condition is given as a reachability constraint. Conditional expectations of this type can, e.g., stand for the maximal expected termination time of probabilistic programs with non-determinism, under the condition that the program eventually terminates, or for the worst-case expected penalty to be paid, assuming that at least three deadlines are missed. The main results of the paper are (i) a polynomial-time algorithm to check the finiteness of maximal conditional expectations, (ii) PSPACE-completeness for the threshold problem in acyclic Markov decision processes where the task is to check whether the maximal conditional expectation exceeds a given threshold, (iii) a pseudo-polynomial-time algorithm for the threshold problem in the general (cyclic) case, and (iv) an exponential-time algorithm for computing the maximal conditional expectation and an optimal scheduler.

Details

OriginalspracheEnglisch
TitelTools and Algorithms for the Construction and Analysis of Systems
Redakteure/-innenAxel Legay, Tiziana Margaria
Herausgeber (Verlag)Springer, Berlin [u. a.]
Seiten269–285
ISBN (Print)978-3-662-54579-9
PublikationsstatusVeröffentlicht - 2017
Peer-Review-StatusJa

Publikationsreihe

ReiheLecture Notes in Computer Science, Volume 10206
ISSN0302-9743

Konferenz

Titel23rd International Conference on Tools and Algorithms for the Construction and Analysis of Systems
KurztitelTACAS 2017
Veranstaltungsnummer
Dauer22 - 29 April 2017
BekanntheitsgradInternationale Veranstaltung
Ort
StadtUppsala
LandSchweden

Externe IDs

ORCID /0000-0002-5321-9343/work/142236722

Schlagworte

Schlagwörter

  • Conditional Expected Reward, Goal State, Markov Decision Process, Threshold Algorithm, Maximal Path

Bibliotheksschlagworte