资源论文Modality and Component Aware Feature Fusion for RGB-D Scene Classification

Modality and Component Aware Feature Fusion for RGB-D Scene Classification

2019-12-20 | |  59 |   39 |   0

Abstract

While convolutional neural networks (CNN) have beenexcellent for object recognition, the greater spatial vari-ability in scene images typically meant that the standardfull-image CNN features are suboptimal for scene classifi-cation. In this paper, we investigate a framework allowinggreater spatial flexibility, in which the Fisher vector (FV)encoded distribution of local CNN features, obtained froma multitude of region proposals per image, is considered in-stead. The CNN features are computed from an augment-ed pixel-wise representation comprising multiple modali-ties of RGB, HHA and surface normals, as extracted fromRGB-D data. More significantly, we make two postulates: (1) component sparsity — that only a small variety of region proposals and their corresponding FV GMM components contribute to scene discriminability, and (2) modal non-sparsity — within these discriminative components, allmodalities have important contribution. In our framework,these are implemented through regularization terms applying group lasso to GMM components and exclusive grouplasso across modalities. By learning and combining regres-sors for both proposal-based FV features and global CNN features, we were able to achieve state-of-the-art sceneclassification performance on the SUNRGBD Dataset and NYU Depth Dataset V2.

上一篇:Macroscopic Interferometry: Rethinking Depth Estimation with Frequency-Domain Time-of-Flight

下一篇:Multivariate Regression on the Grassmannian for Predicting Novel Domains

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...