Expanding Object Detector’s HORIZON: Incremental Learning Framework for Object Detection in Videos

资源分类

2019-12-25 |

138 |

130 |

Abstract

Over the last several years it has been shown that image-based object detectors are sensitive to the training data andoften fail to generalize to examples that fall outside the original training sample domain (e.g., videos). A number ofdomain adaptation (DA) techniques have been proposed toaddress this problem. DA approaches are designed to adapta fixed complexity model to the new (e.g., video) domain.We posit that unlabeled data should not only allow adap-tation, but also improve (or at least maintain) performanceon the original and other domains by dynamically adjust-ing model complexity and parameters. We call this notiondomain expansion. To this end, we develop a new scalableand accurate incremental object detection algorithm, basedon several extensions of large-margin embedding (LME).Our detection model consists of an embedding space andmultiple class prototypes in that embedding space, that rep-resent object classes; distance to those prototypes allowsus to reason about multi-class detection. By incrementallydetecting object instances in video and adding confident de-tections into the model, we are able to dynamically adjustthe complexity of the detector over time by instantiating new prototypes to span all domains the model has seen. We test performance of our approach by expanding an object detector trained on ImageNet to detect objects in egocentric videos of Activity Daily Living (ADL) dataset and challenging videos from YouTube Objects (YTO) dataset.

上一篇：Semantic Alignment of LiDAR Data at City Scale

下一篇：Zero-Shot Object Recognition by Semantic Manifold Distance

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Regularizing RNNs...

Recently, caption generation with an encoder-de...
Deep Cross-media ...

Cross-media retrieval is a research hotspot in ...
Learning Expressi...

Facial expression is temporally dynamic event w...
Compact MDDs for ...

Pseudo-Boolean (PB) constraints are usually en...
Supervised Descen...

Many computer vision problems (e.

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com