Contextual bandits with surrogate losses: Margin bounds and efficient algorithms

登录免费注册

资源分类

论文
算法
数据集
经验分享
技术动态
行业动态

论文
学习
研究领域

算法
学习
研究领域

数据集
自动驾驶
图片

经验分享
学习
研究领域

技术动态
计算机视觉
自然语言处理

行业动态
教育
语音识别

》资源》论文》Contextual bandits with surrogate losses: Margin bounds and efficient algorithms

Contextual bandits with surrogate losses: Margin bounds and efficient algorithms

2020-02-17 |

37 |

34 |

Contextual bandits with surrogate losses: Margin bounds and efficient algorithms
论文

Abstract

We use surrogate losses to obtain several new regret bounds and new algorithms for contextual bandit learning. Using the ramp loss, we derive new margin-based regret bounds in terms of standard sequential complexity measures of a benchmark class of real-valued regression functions. Using the hinge loss, we derive an efficient algorithm with a -type mistake bound against benchmark policies induced by d-dimensional regressors. Under realizability assumptions, our results also yield classical regret bounds.