资源论文Constructing Skill Trees for Reinforcement Learning Agents from Demonstration Trajectories

Constructing Skill Trees for Reinforcement Learning Agents from Demonstration Trajectories

2020-01-08 | |  67 |   33 |   0

Abstract
We introduce CST, an algorithm for constructing skill trees from demonstration trajectories in continuous reinforcement learning domains. CST uses a changepoint detection method to segment each trajectory into a skill chain by detecting a change of appropriate abstraction, or that a segment is too complex to model as a single skill. The skill chains from each trajectory are then merged to form a skill tree. We demonstrate that CST constructs an appropriate skill tree that can be further refined through learning in a challenging continuous domain, and that it can be used to segment demonstration trajectories on a mobile manipulator into chains of skills where each skill is assigned an appropriate abstraction.

上一篇:Feature Construction for Inverse Reinforcement Learning

下一篇:Interval Estimation for Reinforcement-Learning Algorithms in Continuous-State Domains

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to learn...

    The move from hand-designed features to learned...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...