What if we do not have multiple videos of the same action? — Video Action Localization Using Web Images

资源分类

2019-12-26 |

74 |

79 |

Abstract

This paper tackles the problem of spatio-temporal ac-tion localization in a video, without assuming the availabil-ity of multiple videos or any prior annotations. Action islocalized by employing images downloaded from internetusing action name. Given web images, we first dampen im-age noise using random walk and evade distracting backgrounds within images using image action proposals. Then, given a video, we generate multiple spatio-temporal actionproposals. We suppress camera and background generatedproposals by exploiting optical flow gradients within pro-posals. To obtain the most action representative proposals,we propose to reconstruct action proposals in the video byleveraging the action proposals in images. Moreover, we preserve the temporal smoothness of the video and reconstruct all proposal bounding boxes jointly using the con-straints that push the coefficients for each bounding box to-ward a common consensus, thus enforcing the coefficient similarity across multiple frames. We solve this optimization problem using variant of two-metric projection algorithm. Finally, the video proposal that has the lowest reconstruction cost and is motion salient is used to localize the action. Our method is not only applicable to the trimmed videos, but it can also be used for action localization in untrimmed videos, which is a very challenging problem. We present extensive experiments on trimmed aswell as untrimmed datasets to validate the effectiveness ofthe proposed approach.

上一篇：Cascaded Interactional Targeting Network for Egocentric Video Analysis

下一篇：End-to-end Learning of Action Detection from Frame Glimpses in Videos

用户评价

全部评价

还没有评论，说两句吧！

热门资源

A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Learning to Predi...

Much of model-based reinforcement learning invo...
The Variational S...

Unlike traditional images which do not offer in...
Hierarchical Task...

We extend hierarchical task network planning wi...
Shape-based Autom...

We present an algorithm for automatic detection...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com