Online Continuous Submodular Maximization: From Full-Information to Bandit Feedback

登录免费注册

资源分类

论文
算法
数据集
经验分享
技术动态
行业动态

论文
学习
研究领域

算法
学习
研究领域

数据集
自动驾驶
图片

经验分享
学习
研究领域

技术动态
计算机视觉
自然语言处理

行业动态
教育
语音识别

》资源》论文》Online Continuous Submodular Maximization: From Full-Information to Bandit Feedback

Online Continuous Submodular Maximization: From Full-Information to Bandit Feedback

2020-02-20 |

36 |

Online Continuous Submodular Maximization: From Full-Information to Bandit Feedback
论文

Abstract

In this paper, we propose three online algorithms for submodular maximization. The first one, Mono-Frank-Wolfe, reduces the number of per-function gradient evaluations from 图片.png [18] and [17] to 1, and achieves a -regret bound of The second one, Bandit-Frank-Wolfe, is the first bandit algorithm for continuous DR-submodular maximization, which achieves a -regret bound of . Finally, we extend Bandit-Frank-Wolfe to a bandit algorithm for discrete submodular maximization, Responsive-Frank-Wolfe, which attains a 图片.png -regret bound of in the responsive bandit setting.