资源论文HOPE: Hierarchical Object Prototype Encoding for Efficient Object Instance Search in Videos

HOPE: Hierarchical Object Prototype Encoding for Efficient Object Instance Search in Videos

2019-12-10 | |  51 |   36 |   0

Abstract

This paper tackles the problem of object instance search in videos. To effectively capture the relevance between a query and video frames and precisely localize the particular object, we leverage the object proposals to improve the quality of object instance search in videos. However, hundreds of object proposals obtained from each frame could result in unaffordable memory and computational cost. To this end, we present a simple yet effective hierarchical object prototype encoding (HOPE) model to accelerate the object instance search without sacrifificing accuracy, which exploits both the spatial and temporal self-similarity property existing in object proposals generated from video frames. We design two types of sphere k-means methods, i.e., spatially-constrained sphere k-means and temporallyconstrained sphere k-means to learn frame-level object prototypes and dataset-level object prototypes, respectively. In this way, the object instance search problem is cast to the sparse matrix-vector multiplication problem. Thanks to the sparsity of the codes, both the memory and computational cost are signifificantly reduced. Experimental results on two video datasets demonstrate that our approach signifificantly improves the performance of video object instance search over other state-of-the-art fast search schemes

上一篇:Fully Convolutional Instance-aware Semantic Segmentation

下一篇:Indoor Scene Parsing with Instance Segmentation, Semantic Labeling and Support Relationship Inference

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...