资源论文Pulling Things out of Perspective

Pulling Things out of Perspective

2019-12-11 | |  73 |   56 |   0

Abstract

The limitations of current state-of-the-art methods for single-view depth estimation and semantic segmentations are closely tied to the property of perspective geometry, that the perceived size of the objects scales inversely with the distance. In this paper, we show that we can use this property to reduce the learning of a pixel-wise depth classififier to a much simpler classififier predicting only the likelihood of a pixel being at an arbitrarily fifixed canonical depth. The likelihoods for any other depths can be obtained by applying the same classififier after appropriate image manipulations. Such transformation of the problem to the canonical depth removes the training data bias towards certain depths and the effect of perspective. The approach can be straight-forwardly generalized to multiple semantic classes, improving both depth estimation and semantic segmentation performance by directly targeting the weaknesses of independent approaches. Conditioning the semantic label on the depth provides a way to align the data to their physical scale, allowing to learn a more discriminative classififier. Conditioning depth on the semantic class helps the classififier to distinguish between ambiguities of the otherwise ill-posed problem. We tested our algorithm on the KITTI road scene dataset and NYU2 indoor dataset and obtained obtained results that signifificantly outperform current state-of-the-art in both single-view depth and semantic segmentation domain

上一篇:Distance Encoded Product Quantization

下一篇:Deformable Object Matching via Deformation Decomposition based 2D Label MRF

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...