资源论文Implicit Quantile Networks for Distributional Reinforcement Learning

Implicit Quantile Networks for Distributional Reinforcement Learning

2020-03-16 | |  91 |   47 |   0

Abstract

In this work, we build on recent advances in distributional reinforcement learning to give a generally applicable, flexible, and state-of-the-art dis tributional variant of DQN. We achieve this by using quantile regression to approximate the full quantile function for the state-action return distr bution. By reparameterizing a distribution over the sample space, this yields an implicitly defined return distribution and gives rise to a large class risk-sensitive policies. We demonstrate improved performance on the 57 Atari 2600 games in the ALE, and use our algorithm’s implicitly defined distributions to study the effects of risk-sensitiv policies in Atari games.

上一篇:Mixed batches and symmetric discriminators for GAN training

下一篇:Lightweight Stochastic Optimization for Minimizing Finite Sums with Infinite Data

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...