Free supervision from video games

资源分类

2019-10-16 |

84 |

93 |

Abstract Deep networks are extremely hungry for data. They devour hundreds of thousands of labeled images to learn robust and semantically meaningful feature representations. Current networks are so data hungry that collecting labeled data has become as important as designing the networks themselves. Unfortunately, manual data collection is both expensive and time consuming. We present an alternative, and show how ground truth labels for many vision tasks are easily extracted from video games in real time as we play them. We interface the popular Microsoft R DirectX R rendering API, and inject specialized rendering code into the game as it is running. This code produces ground truth labels for instance segmentation, semantic labeling, depth estimation, optical flow, intrinsic image decomposition, and instance tracking. Instead of labeling images, a researcher now simply plays video games all day long. Our method is general and works on a wide range of video games. We collected a dataset of 220k training images, and 60k test images across 3 video games, and evaluate state of the art optical flow, depth estimation and intrinsic image decomposition algorithms. Our video game data is visually closer to real world images, than other synthetic dataset

上一篇：Frame-Recurrent Video Super-Resolution

下一篇：Future Person Localization in First-Person Videos

用户评价

全部评价

还没有评论，说两句吧！

热门资源

A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Visual Reinforcem...

For an autonomous agent to fulfill a wide range...
The Variational S...

Unlike traditional images which do not offer in...
Online Learning v...

In this paper, we use differential privacy as a...
Regularizing RNNs...

Recently, caption generation with an encoder-de...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com