资源论文Approximating Value Equivalence in Interactive Dynamic Influence Diagrams Using Behavioral Coverage

Approximating Value Equivalence in Interactive Dynamic Influence Diagrams Using Behavioral Coverage

2019-11-22 | |  65 |   53 |   0
Abstract Interactive dynamic influence diagrams (I-DIDs) provide an explicit way of modeling how a subject agent solves decision making problems in the presence of other agents in a common setting. To optimize its decisions, the subject agent needs to predict the other agents’ behavior, that is generally obtained by solving their candidate models. This becomes extremely difficult since the model space may be rather large, and grows when the other agents act and observe over the time. A recent proposal for solving I-DIDs lies in a concept of value equivalence (VE) that shows potential advances on significantly reducing the model space. In this paper, we establish a principled framework to implement the VE techniques and propose an approximate method to compute VE of candidate models. The development offers ample opportunity of exploiting VE to further improve the scalability of IDID solutions. We theoretically analyze properties of the approximate techniques and show empirical results in multiple problem domains.

上一篇:Better Strategyproof Mechanisms without Payments or Prior — An Analytic Approach

下一篇:Selective Norm Monitoring

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...