资源论文Efficient Pure Exploration in Adaptive Round model

Efficient Pure Exploration in Adaptive Round model

2020-02-20 | |  64 |   39 |   0

Abstract

In the adaptive setting, many multi-armed bandit applications allow the learner to adaptively draw samples and adjust sampling strategy in rounds. In many real applications, not only the query complexity but also the round complexity need to be optimized. In this paper, we study both PAC and exact top-k arm identification problems and design efficient algorithms considering both round complexity and query complexity. For PAC problem, we achieve optimal query complexity and use only 图片.png rounds, which matches the lower bound of round complexity, while most of existing works need 图片.png rounds. For exact top-k arm identification, we improve the round complexity factor from log n to 图片.png and achieve near optimal query complexity. In experiments, our algorithms conduct far fewer rounds, and outperform state of the art by orders of magnitude with respect to query cost.

上一篇:Primal-Dual Block Generalized Frank-Wolfe

下一篇:Tensor Monte Carlo: Particle Methods for the GPU era

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • Learning to learn...

    The move from hand-designed features to learned...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...