Object Detection from Video Tubelets with Convolutional Neural Networks

资源分类

2019-12-26 |

46 |

48 |

Abstract

Deep Convolution Neural Networks (CNNs) have shown impressive performance in various vision tasks such as im-age classification, object detection and semantic segmentation. For object detection, particularly in still images, the performance has been significantly increased last year thanks to powerful deep networks (e.g. GoogleNet) and detection frameworks (e.g. Regions with CNN features (RCNN)). The lately introduced ImageNet [6] task on object detection from video (VID) brings the object detection task into the video domain, in which objects’ locations at each frame are required to be annotated with bounding boxes. In this work, we introduce a complete framework for the VID task based on still-image object detection and general object tracking. Their relations and contributions in the VID task are thoroughly studied and evaluated. In addition, a temporal convolution network is proposed to incorporate temporal information to regularize the detection results and shows its effectiveness forthe task. Code is available at https://github.com/ myfavouritekk/vdetlib.

上一篇：Answer-Type Prediction for Visual Question Answering

下一篇：Face2Face: Real-time Face Capture and Reenactment of RGB Videos

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Learning to Predi...

Much of model-based reinforcement learning invo...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
The Variational S...

Unlike traditional images which do not offer in...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Learning to learn...

The move from hand-designed features to learned...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com