资源论文Video Event Detection by Inferring Temporal Instance Labels

Video Event Detection by Inferring Temporal Instance Labels

2019-12-16 | |  63 |   37 |   0

Abstract

Video event detection allows intelligent indexing of video content based on events. Traditional approaches extract features from video frames or shots, then quantize and pool the features to form a single vector representation for the entire video. Though simple and effificient, the fifinal pooling step may lead to loss of temporally local information, which is important in indicating which part in a long video signi- fifies presence of the event. In this work, we propose a novel instance-based video event detection approach. We represent each video as multiple instances, defifined as video segments of different temporal intervals. The objective is to learn an instance-level event detection model based on only video-level labels. To solve this problem, we propose a large-margin formulation which treats the instance labels as hidden latent variables, and simultaneously infers the instance labels as well as the instance-level classifification model. Our framework infers optimal solutions that assume positive videos have a large number of positive instances while negative videos have the fewest ones. Extensive experiments on large-scale video event datasets demonstrate signifificant performance gains. The proposed method is also useful in explaining the detection results by localizing the temporal segments in a video which is responsible for the positive detection

上一篇:Pyramid-based Visual Tracking Using Sparsity Represented Mean Transform

下一篇:Predicting User Annoyance Using Visual Attributes

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...