Upper Confidence Weighted Learning for Efficient Exploration in Multiclass Prediction with Binary Feedback

资源分类

2019-11-11 |

76 |

43 |

Abstract We introduce a novel algorithm called Upper Con?dence Weighted Learning (UCWL) for online multiclass learning from binary feedback. UCWL combines the Upper Con?dence Bound (UCB) framework with the Soft Con?dence Weighted (SCW) online learning scheme. UCWL achieves state of the art performance (especially on noisy and nonseparable data) with low computational costs. Estimated con?dence intervals are used for informed exploration, which enables faster learning than the uninformed exploration case or the case where exploration is not used. The targeted application setting is human-robot interaction (HRI), in which a robot is learning to classify its observations while a human teaches it by providing only binary feedback (e.g., right/wrong). Results in an HRI experiment, and with two benchmark datasets, show UCWL outperforms other algorithms in the online binary feedback setting, and surprisingly even sometimes beats state-of-the-art algorithms that get full feedback, while UCWL gets only binary feedback on the same data.

上一篇：Accelerated Robust Point Cloud Registration in Natural Environments through Positive and Unlabeled Learning

下一篇：Towards Active Event Recognition

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Learning to Predi...

Much of model-based reinforcement learning invo...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
The Variational S...

Unlike traditional images which do not offer in...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Rating-Boosted La...

The performance of a recommendation system reli...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com