The variance-penalized stochastic shortest path problem
Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review
Contributors
Abstract
The stochastic shortest path problem (SSPP) asks to resolve the non-deterministic choices in a Markov decision process (MDP) such that the expected accumulated weight before reaching a target state is maximized. This paper addresses the optimization of the variance-penalized expectation (VPE) of the accumulated weight, which is a variant of the SSPP in which a multiple of the variance of accumulated weights is incurred as a penalty. It is shown that the optimal VPE in MDPs with non-negative weights as well as an optimal deterministic finite-memory scheduler can be computed in exponential space. The threshold problem whether the maximal VPE exceeds a given rational is shown to be EXPTIME-hard and to lie in NEXPTIME. Furthermore, a result of interest in its own right obtained on the way is that a variance-minimal scheduler among all expectation-optimal schedulers can be computed in polynomial time.
Details
| Original language | English |
|---|---|
| Title of host publication | 49th EATCS International Conference on Automata, Languages, and Programming, ICALP 2022 |
| Editors | Mikołaj Bojańczyk, Emanuela Merelli, David P. Woodruff |
| Publisher | Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing |
| Pages | 129:1–129:19 |
| Number of pages | 19 |
| ISBN (electronic) | 9783959772358 |
| ISBN (print) | 978-3-95977-235-8 |
| Publication status | Published - 1 Jul 2022 |
| Peer-reviewed | Yes |
Publication series
| Series | 49th EATCS International Conference on Automata, Languages, and Programming (ICALP 2022) ; Vol. 229 |
|---|---|
| Volume | 229 |
| ISSN | 1868-8969 |
Conference
| Title | 49th EATCS International Colloquium on Automata, Languages and Programming |
|---|---|
| Abbreviated title | ICALP 2022 |
| Conference number | 49 |
| Duration | 4 - 8 July 2022 |
| Website | |
| Degree of recognition | International event |
| Location | Université de Paris & Online |
| City | Paris |
| Country | France |
External IDs
| Scopus | 85133468318 |
|---|---|
| dblp | conf/icalp/PiribauerSB22 |
| Mendeley | 4b556c64-0cd5-3b56-a6aa-bf19263a8017 |
| ORCID | /0000-0002-5321-9343/work/142236696 |
| ORCID | /0000-0003-4829-0476/work/165453933 |
Keywords
Research priority areas of TU Dresden
DFG Classification of Subject Areas according to Review Boards
Subject groups, research areas, subject areas according to Destatis
ASJC Scopus subject areas
Keywords
- Markov decision process, variance, stochastic shortest path problem