Beyond Online Balanced Descent: An Optimal Algorithm for Smoothed Online Optimization

资源分类

2020-02-20 |

93 |

80 |

Abstract

We study online convex optimization in a setting where the learner seeks to minimize the sum of a per-round hitting cost and a movement cost which is incurred when changing decisions between rounds. We prove a new lower bound on the competitive ratio of any online algorithm in the setting where the costs are mstrongly convex and the movement costs are the squared `2 norm. This lower bound shows that no algorithm can achieve a competitive ratio that is 图片.png as m tends to zero. No existing algorithms have competitive ratios matching this bound, and we show that the state-of-the-art algorithm, Online Balanced Decent (OBD), has a competitive ratio that is We additionally propose two new algorithms, Greedy OBD (G-OBD) and Regularized OBD (R-OBD) and prove that both algorithms have an 图片.png competitive ratio. The result for G-OBD holds when the hitting costs are quasiconvex and the movement costs are the squared `2 norm, while the result for R-OBD holds when the hitting costs are m-strongly convex and the movement costs are Bregman Divergences. Further, we show that R-OBD simultaneously achieves constant, dimension-free competitive ratio and sublinear regret when hitting costs are strongly convex.

上一篇：Balancing Efficiency and Fairness in On-Demand Ridesourcing

下一篇：Generalized Block-Diagonal Structure Pursuit: Learning Soft Latent Task Assignment against Negative Transfer

用户评价

全部评价

还没有评论，说两句吧！

热门资源

A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Supervised Descen...

Many computer vision problems (e.
Learning to Predi...

Much of model-based reinforcement learning invo...
The Variational S...

Unlike traditional images which do not offer in...
Depth Super Resol...

We tackle the problem of jointly increasing the...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com