资源论文Deep Convolutional Inverse Graphics Network

Deep Convolutional Inverse Graphics Network

2020-02-04 | |  62 |   48 |   0

Abstract 

This paper presents the Deep Convolution Inverse Graphics Network (DCIGN), a model that aims to learn an interpretable representation of images, disentangled with respect to three-dimensional scene structure and viewing transformations such as depth rotations and lighting variations. The DCIGN model is composed of multiple layers of convolution and de-convolution operators and is trained using the Stochastic Gradient Variational Bayes (SGVB) algorithm [10]. We propose a training procedure to encourage neurons in the graphics code layer to represent a specific transformation (e.g. pose or light). Given a single input image, our model can generate new images of the same object with variations in pose and lighting. We present qualitative and quantitative tests of the model’s efficacy at learning a 3D rendering engine for varied object classes including faces and chairs.

上一篇:Halting in Random Walk Kernels

下一篇:Associative Memory via a Sparse Recovery Model

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...