资源论文Online Learning with Abstention

Online Learning with Abstention

2020-03-16 | |  57 |   43 |   0

Abstract

We present an extensive study of a key problem in online learning where the learner can opt to ab stain from making a prediction, at a certain cost. In the adversarial setting, we show how existing online algorithms and guarantees can be adapted to this problem. In the stochastic setting, we fir point out a bias problem that limits the straightf ward extension of algorithms such as UCB N to this context. Next, we give a new algorithm, UCB GT, that exploits historical data and time-varying feedback graphs. We show that this algorithm benefits from more favorable regret guarantees than a natural extension of UCB N. We further report th results of a series of experiments demonstrating that UCB GT largely outperforms that extension of UCB N, as well as other standard baselines.

上一篇:Efficient Gradient-Free Variational Inference using Policy Search

下一篇:Fair and Diverse DPP-Based Data Summarization

用户评价
全部评价

热门资源

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to learn...

    The move from hand-designed features to learned...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...