Track and Transfer: Watching Videos to Simulate Strong Human Supervision for Weakly-Supervised Object Detection

资源分类

2019-12-26 |

47 |

44 |

Abstract

The status quo approach to training object detectors re-quires expensive bounding box annotations. Our frameworktakes a markedly different direction: we transfer tracked ob-ject boxes from weakly-labeled videos to weakly-labeled im-ages to automatically generate pseudo ground-truth boxes, which replace manually annotated bounding boxes. Wefirst mine discriminative regions in the weakly-labeled im-age collection that frequently/rarely appear in the positive/negative images. We then match those regions to videosand retrieve the corresponding tracked object boxes. Finally, we design a hough transform algorithm to vote for the best box to serve as the pseudo GT for each image, and use them to train an object detector. Together, these lead to state-of-the-art weakly-supervised detection results on the PASCAL 2007 and 2010 datasets.

上一篇：Convolutional Two-Stream Network Fusion for Video Action Recognition

下一篇：Efficient Intersection of Three Quadrics and Applications in Computer Vision

用户评价

全部评价

还没有评论，说两句吧！

热门资源

The Variational S...

Unlike traditional images which do not offer in...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
Learning to learn...

The move from hand-designed features to learned...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Learning to Predi...

Much of model-based reinforcement learning invo...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com