资源论文Linear Bayesian Reinforcement Learning Nikolaos Tziortziotis Christos Dimitrakakis Konstantinos Blekas

Linear Bayesian Reinforcement Learning Nikolaos Tziortziotis Christos Dimitrakakis Konstantinos Blekas

2019-11-11 | |  96 |   48 |   0
Abstract This paper proposes a simple linear Bayesian approach to reinforcement learning. We show that with an appropriate basis, a Bayesian linear Gaussian model is sufficient for accurately estimating the system dynamics, and in particular when we allow for correlated noise. Policies are estimated by first sampling a transition model from the current posterior, and then performing approximate dynamic programming on the sampled model. This form of approximate Thompson sampling results in good exploration in unknown environments. The approach can also be seen as a Bayesian generalisation of least-squares policy iteration, where the empirical transition matrix is replaced with a sample from the posterior.

上一篇:Non-Negative Multiple Matrix Factorization

下一篇:Multi Class Learning with Individual Sparsity Ben Zion Vatashsky and Koby Crammer

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...