资源论文LEARNING TO PLAN IN HIGH DIMENSIONS VIAN EURAL EXPLORATION -E XPLOITATION TREES

LEARNING TO PLAN IN HIGH DIMENSIONS VIAN EURAL EXPLORATION -E XPLOITATION TREES

2020-01-02 | |  77 |   46 |   0

Abstract

We propose a meta path planning algorithm named Neural Exploration-Exploitation Trees (NEXT) for learning from prior experience for solving new path planning problems in high dimensional continuous state and action spaces. Compared to more classical sampling-based methods like RRT, our approach achieves much better sample efficiency in high-dimensions and can benefit from prior experience of planning in similar environments. More specifically, NEXT exploits a novel neural architecture which can learn promising search directions from problem structures. The learned prior is then integrated into a UCB-type algorithm to achieve an online balance between exploration and exploitation when solving a new problem. We conduct thorough experiments to show that NEXT accomplishes new planning problems with more compact search trees and significantly outperforms state-of-the-art methods on several benchmarks.

上一篇:HARNESSING THE POWER OF INFINITELY WIDE DEEPN ETS ON SMALL -DATA TASKS

下一篇:THE FUNCTION OF CONTEXTUALILLUSIONS RECURRENT NEURAL CIRCUITS FOR CONTOUR DETECTION

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...