Beyond Short Snippets: Deep Networks for Video Classification

资源分类

2019-12-25 |

82 |

55 |

Abstract

Convolutional neural networks (CNNs) have been exten-sively applied for image recognition problems giving state-of-the-art results on recognition, detection, segmentationand retrieval. In this work we propose and evaluate severaldeep neural network architectures to combine image infor-mation across a video over longer time periods than previ-ously attempted. We propose two methods capable of han-dling full length videos. The first method explores variousconvolutional temporal feature pooling architectures, ex-amining the various design choices which need to be madewhen adapting a CNN for this task. The second proposedmethod explicitly models the video as an ordered sequence of frames. For this purpose we employ a recurrent neuralnetwork that uses Long Short-Term Memory (LSTM) cellswhich are connected to the output of the underlying CNN.Our best networks exhibit significant performance improvements over previously published results on the Sports 1 million dataset (73.1% vs. 60.9%) and the UCF-101 datasets with (88.6% vs. 88.0%) and without additional optical flow information (82.6% vs. 73.0%).

上一篇：Low-level Vision by Consensus in a Spatial Hierarchy of Regions

下一篇：Joint Inference of Groups, Events and Human Roles in Aerial Videos

用户评价

全部评价

还没有评论，说两句吧！

热门资源

A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Learning to Predi...

Much of model-based reinforcement learning invo...
The Variational S...

Unlike traditional images which do not offer in...
Hierarchical Task...

We extend hierarchical task network planning wi...
Shape-based Autom...

We present an algorithm for automatic detection...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com