资源论文Deep Bilinear Learning for RGB-D Action Recognition

Deep Bilinear Learning for RGB-D Action Recognition

2019-10-23 | |  51 |   41 |   0
Abstract. In this paper, we focus on exploring modality-temporal mutual information for RGB-D action recognition. In order to learn timevarying information and multi-modal features jointly, we propose a novel deep bilinear learning framework. In the framework, we propose bilinear blocks that consist of two linear pooling layers for pooling the input cube features from both modality and temporal directions, separately. To capture rich modality-temporal information and facilitate our deep bilinear learning, a new action feature called modality-temporal cube is presented in a tensor structure for characterizing RGB-D actions from a comprehensive perspective. Our method is extensively tested on two public datasets with four different evaluation settings, and the results show that the proposed method outperforms the state-of-the-art approaches

上一篇:Toward Scale-Invariance and Position-Sensitive Region Proposal Networks

下一篇:Self-Supervised Relative Depth Learning for Urban Scene Understanding

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...