Meta Reinforcement Learning with Task Embedding and Shared Policy

资源分类

2019-10-09 |

160 |

62 |

Abstract Despite signifificant progress, deep reinforcement learning (RL) suffers from data-ineffificiency and limited generalization. Recent efforts apply metalearning to learn a meta-learner from a set of RL tasks such that a novel but related task could be solved quickly. Though specifific in some ways, different tasks in meta-RL are generally similar at a high level. However, most meta-RL methods do not explicitly and adequately model the specifific and shared information among different tasks, which limits their ability to learn training tasks and to generalize to novel tasks. In this paper, we propose to capture the shared information on the one hand and meta-learn how to quickly abstract the specifific information about a task on the other hand. Methodologically, we train an SGD meta-learner to quickly optimize a task encoder for each task, which generates a task embedding based on past experience. Meanwhile, we learn a policy which is shared across all tasks and conditioned on task embeddings. Empirical results1 on four simulated tasks demonstrate that our method has better learning capacity on both training and novel tasks and attains up to 3 to 4 times higher returns compared to baselines

上一篇：Incremental Learning of Planning Actions in Model-Based Reinforcement Learning

下一篇：Playing FPS Games With Environment-Aware Hierarchical Reinforcement Learning

用户评价

全部评价

还没有评论，说两句吧！

热门资源

The Variational S...

Unlike traditional images which do not offer in...
Learning to Predi...

Much of model-based reinforcement learning invo...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Rating-Boosted La...

The performance of a recommendation system reli...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com