资源论文EXPLAIN YOUR MOVE :U NDERSTANDING AGENT AC -TIONS USING FOCUSED FEATURE SALIENCY

EXPLAIN YOUR MOVE :U NDERSTANDING AGENT AC -TIONS USING FOCUSED FEATURE SALIENCY

2020-01-02 | |  58 |   38 |   0

Abstract

As deep reinforcement learning (RL) is applied to more tasks, there is a need to visualize and understand the behavior of learned agents. Saliency maps explain agent behavior by highlighting the features of the input state that are most relevant for the agent in taking an action. Existing perturbation-based approaches to compute saliency often highlight regions of the input that are not relevant to the action taken by the agent. Our approach generates more focused saliency maps by balancing two aspects (specificity and relevance) that capture different desiderata of saliency. The first captures the impact of perturbation on the relative expected reward of the action to be explained. The second downweights irrelevant features that alter the relative expected rewards of actions other than the action to be explained. We compare our approach with existing approaches on agents trained to play board games (Chess and Go) and Atari games (Breakout, Pong and Space Invaders). We show through illustrative examples (Chess, Atari, Go), human studies (Chess), and automated evaluation methods (Chess) that our approach generates saliency maps that are more interpretable for humans than existing approaches.

上一篇:JACOBIAN ADVERSARIALLY REGULARIZEDN ETWORKS FOR ROBUSTNESS

下一篇:ADVERSARIALLY ROBUST REPRESENTATIONS WITHS MOOTH ENCODERS

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...