资源论文Computational Approaches for Stochastic Shortest Path on Succinct MDPs

Computational Approaches for Stochastic Shortest Path on Succinct MDPs

2019-11-05 | |  79 |   46 |   0
Abstract We consider the stochastic shortest path (SSP) problem for succinct Markov decision processes (MDPs), where the MDP consists of a set of variables, and a set of nondeterministic rules that update the variables. First, we show that several examples from the AI literature can be modeled as succinct MDPs. Then we present computational approaches for upper and lower bounds for the SSP problem: (a) for computing upper bounds, our method is polynomial-time in the implicit description of the MDP; (b) for lower bounds, we present a polynomial-time (in the size of the implicit description) reduction to quadratic programming. Our approach is applicable even to infinite-state MDPs. Finally, we present experimental results to demonstrate the effectiveness of our approach on several classical examples from the AI literature.‡

上一篇:Expectation Optimization with Probabilistic Guarantees in POMDPs with Discounted-sum Objectives?

下一篇:Local Minima, Heavy Tails, and Search Effort for GBFS Eldan Cohen and J. Christopher Beck

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...