资源论文Combining Per-frame and Per-track Cues for Multi-person Action Recognition

Combining Per-frame and Per-track Cues for Multi-person Action Recognition

2020-04-02 | |  69 |   44 |   0

Abstract

We propose a model to combine per-frame and per-track cues for action recognition. With multiple targets in a scene, our model simultaneously captures the natural harmony of an individual’s action in a scene and the flow of actions of an individual in a video sequence, inferring valid tracks in the process. Our motivation is based on the unlikely discordance of an action in a structured scene, both at the track level and the frame level (e.g ., a person dancing in a crowd of joggers). While we can utilize sampling approaches for inference in our model, we instead devise a global inference algorithm by decomposing the problem and solving the subproblems exactly and efficiently, recovering a globally optimal joint solution in several cases. Finally, we improve on the state- of-the-art action recognition results for two publicly available datasets.

上一篇:Age Invariant Face Verification with Relative Craniofacial Growth Model

下一篇:Separability Oriented Preprocessing for Illumination-Insensitive Face Recognition

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • Learning to learn...

    The move from hand-designed features to learned...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...