The variance-penalized stochastic shortest path problem
Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung
Beitragende
Abstract
The stochastic shortest path problem (SSPP) asks to resolve the non-deterministic choices in a Markov decision process (MDP) such that the expected accumulated weight before reaching a target state is maximized. This paper addresses the optimization of the variance-penalized expectation (VPE) of the accumulated weight, which is a variant of the SSPP in which a multiple of the variance of accumulated weights is incurred as a penalty. It is shown that the optimal VPE in MDPs with non-negative weights as well as an optimal deterministic finite-memory scheduler can be computed in exponential space. The threshold problem whether the maximal VPE exceeds a given rational is shown to be EXPTIME-hard and to lie in NEXPTIME. Furthermore, a result of interest in its own right obtained on the way is that a variance-minimal scheduler among all expectation-optimal schedulers can be computed in polynomial time.
Details
Originalsprache | Englisch |
---|---|
Titel | 49th EATCS International Conference on Automata, Languages, and Programming, ICALP 2022 |
Redakteure/-innen | Mikołaj Bojańczyk, Emanuela Merelli, David P. Woodruff |
Herausgeber (Verlag) | Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing |
Seiten | 129:1–129:19 |
Seitenumfang | 19 |
ISBN (elektronisch) | 9783959772358 |
ISBN (Print) | 978-3-95977-235-8 |
Publikationsstatus | Veröffentlicht - 1 Juli 2022 |
Peer-Review-Status | Ja |
Publikationsreihe
Reihe | 49th EATCS International Conference on Automata, Languages, and Programming (ICALP 2022) ; Vol. 229 |
---|---|
Band | 229 |
ISSN | 1868-8969 |
Konferenz
Titel | 49th EATCS International Colloquium on Automata, Languages and Programming |
---|---|
Kurztitel | ICALP 2022 |
Dauer | 4 - 8 Juli 2022 |
Bekanntheitsgrad | Internationale Veranstaltung |
Ort | hybrid |
Stadt | Paris |
Land | Frankreich |
Externe IDs
Scopus | 85133468318 |
---|---|
dblp | conf/icalp/PiribauerSB22 |
Mendeley | 4b556c64-0cd5-3b56-a6aa-bf19263a8017 |
ORCID | /0000-0002-5321-9343/work/142236696 |
ORCID | /0000-0003-4829-0476/work/165453933 |
Schlagworte
Forschungsprofillinien der TU Dresden
DFG-Fachsystematik nach Fachkollegium
Fächergruppen, Lehr- und Forschungsbereiche, Fachgebiete nach Destatis
ASJC Scopus Sachgebiete
Schlagwörter
- Markov decision process, variance, stochastic shortest path problem