资源论文Path Integral Control by Reproducing Kernel Hilbert Space Embedding Konrad Rawlik Marc Toussaint Sethu Vijayakumar

Path Integral Control by Reproducing Kernel Hilbert Space Embedding Konrad Rawlik Marc Toussaint Sethu Vijayakumar

2019-11-11 | |  42 |   31 |   0
Abstract We present an embedding of stochastic optimal control problems, of the so called path integral form, into reproducing kernel Hilbert spaces. Using consistent, sample based estimates of the embedding leads to a model-free, non-parametric approach for calculation of an approximate solution to the control problem. This formulation admits a decomposition of the problem into an invariant and task dependent component. Consequently, we make much more efficient use of the sample data compared to previous sample based approaches in this domain, e.g., by allowing sample re-use across tasks. Numerical examples on test problems, which illustrate the sample efficiency, are provided.

上一篇:Active Learning from Relative Queries

下一篇:Machine-Learning-Based Circuit Synthesis Lior Rokach1 and Meir Kalech1 and Gregory Provan2 and Alexander Feldman2

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...