资源论文Monocular 3D Scene Modeling and Inference: Understanding Multi-Object Traffic Scenes

Monocular 3D Scene Modeling and Inference: Understanding Multi-Object Traffic Scenes

2020-03-31 | |  53 |   34 |   0

Abstract

Scene understanding has (again) become a focus of computer vision research, leveraging advances in detection, context modeling, and tracking. In this paper, we present a novel probabilistic 3D scene model that encompasses multi-class object detection, object tracking, scene labeling, and 3D geometric relations. This integrated 3D model is able to represent complex interactions like inter-object occlusion, physical exclusion between objects, and geometric context. Inference allows to recover 3D scene context and perform 3D multiob- ject tracking from a mobile observer, for objects of multiple categories, using only monocular video as input. In particular, we show that a joint scene track- let model for the evidence collected over multiple frames substantially improves performance. The approach is evaluated for two different types of challenging on- board sequences. We first show a substantial improvement to the state-of-the-art in 3D multi-people tracking. Moreover, a similar performance gain is achieved for multi-class 3D tracking of cars and trucks on a new, challenging dataset.

上一篇:A Minimal Case Solution to the Calibrated Relative Pose Problem for the Case of Two Known Orientation Angles*

下一篇:Blocks World Revisited: Image Understanding Using Qualitative Geometry and Mechanics

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...