资源论文Stochastic convex optimization with bandit feedback

Stochastic convex optimization with bandit feedback

2020-01-08 | |  102 |   43 |   0

Abstract

This paper addresses the problem of minimizing a convex, Lipschitz function f over a convex, compact set X under a stochastic bandit feedback model. In this model, the algorithm is allowed to observe noisy realizations of the function value f (x) at any query point 图片.png We demonstrate  a generalization of the ellipsoid algorithm that  incurs 图片.png re-gret. Since any algorithm has regret at least 图片.png on this problem, our algorithm is optimal in terms of the scaling with T .

上一篇:Multiclass Boosting: Theory and Algorithms

下一篇:N EWTRON: an Efficient Bandit algorithm for Online Multiclass Prediction

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • Learning to learn...

    The move from hand-designed features to learned...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...