资源论文Bayesian Nonparametric Feature Construction for Inverse Reinforcement Learning

Bayesian Nonparametric Feature Construction for Inverse Reinforcement Learning

2019-11-11 | |  81 |   35 |   0
Abstract Most of the algorithms for inverse reinforcement learning (IRL) assume that the reward function is a linear function of the pre-defined state and action features. However, it is often difficult to manually specify the set of features that can make the true reward function representable as a linear function. We propose a Bayesian nonparametric approach to identifying useful composite features for learning the reward function. The composite features are assumed to be the logical conjunctions of the predefined atomic features so that we can represent the reward function as a linear function of the composite features. We empirically show that our approach is able to learn composite features that capture important aspects of the reward function on synthetic domains, and predict taxi drivers’ behaviour with high accuracy on a real GPS trace dataset.

上一篇:Domain Adaptation with Topical Correspondence Learning Zheng Chen and Weixiong Zhang

下一篇:A Lossy Counting Based Approach for Learning on Streams of Graphs on a Budget

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...