资源论文Structure Learning in Ergodic Factored MDPs without Knowledge of the Transition Function’s In-Degree

Structure Learning in Ergodic Factored MDPs without Knowledge of the Transition Function’s In-Degree

2020-02-27 | |  61 |   34 |   0

Abstract

This paper introduces Learn Structure and Exploit RMax (LSE-RMax), a novel model based structure learning algorithm for ergodic factored-state MDPs. Given a planning horizon that satisfies a condition, LSE-RMax provably guarantees a return very close to the optimal return, with a high certainty, without requiring any prior knowledge of the in-degree of the transition function as input. LSE-RMax is fully implemented with a thorough analysis of its sample complexity. We also present empirical results demonstrating its effectiveness compared to prior approaches to the problem.

上一篇:Parsing Natural Scenes and Natural Language with Recursive Neural Networks

下一篇:A Co-training Approach for Multi-view Spectral Clustering

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to learn...

    The move from hand-designed features to learned...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...