Active Imitation Learning of Hierarchical Policies

资源分类

2019-11-20 |

63 |

37 |

Abstract In this paper, we study the problem of imitation learning of hierarchical policies from demonstrations. The main dif?culty in learning hierarchical policies by imitation is that the high level intention structure of the policy, which is often critical for understanding the demonstration, is unobserved. We formulate this problem as active learning of Probabilistic State-Dependent Grammars (PSDGs) from demonstrations. Given a set of expert demonstrations, our approach learns a hierarchical policy by actively selecting demonstrations and using queries to explicate their intentional structure at selected points. Our contributions include a new algorithm for imitation learning of hierarchical policies and principled heuristics for the selection of demonstrations and queries. Experimental results in ?ve different domains exhibit successful learning using fewer queries than a variety of alternatives.

上一篇：Robust Subspace Segmentation by Simultaneously Learning Data Representations and Their Affinity Matrix

下一篇：Identification of Time-Dependent Causal Model: A Gaussian Process Treatment

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Learning to Predi...

Much of model-based reinforcement learning invo...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
The Variational S...

Unlike traditional images which do not offer in...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Rating-Boosted La...

The performance of a recommendation system reli...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com