Learning How to Active Learn by Dreaming

资源分类

2019-09-20 |

545 |

97 |

Abstract Heuristic-based active learning (AL) methods are limited when the data distribution of the underlying learning problems vary. Recent data-driven AL policy learning methods are also restricted to learn from closely related domains. We introduce a new sample-efficient method that learns the AL policy directly on the target domain of interest by using wake and dream cycles. Our approach interleaves between querying the annotation of the selected datapoints to update the underlying student learner and improving AL policy using simulation where the current student learner acts as an imperfect annotator. We evaluate our method on cross-domain and cross-lingual text classification and named entity recognition tasks. Experimental results show that our dream-based AL policy training strategy is more effective than applying the pretrained policy without further fine-tuning, and better than the existing strong baseline methods that use heuristics or reinforcement learning

上一篇：Learning from omission

下一篇：Learning Latent Trees with Stochastic Perturbations and Differentiable Dynamic Programming

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Learning to Predi...

Much of model-based reinforcement learning invo...
dynamical system ...

allows to preform manipulations of heavy or bul...
Rating-Boosted La...

The performance of a recommendation system reli...
The Variational S...

Unlike traditional images which do not offer in...
Learning to learn...

The move from hand-designed features to learned...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com