资源论文Visually Indicated Sounds

Visually Indicated Sounds

2019-12-26 | |  50 |   35 |   0

Abstract

Objects make distinctive sounds when they are hit or scratched. These sounds reveal aspects of an object’s material properties, as well as the actions that produced them. In this paper, we propose the task of predicting what sound an object makes when struck as a way of studying physical interactions within a visual scene. We present an algorithm that synthesizes sound from silent videos of people hitting and scratching objects with a drumstick. This algorithm uses a recurrent neural network to predict sound features from videos and then produces a waveform from these features with an example-based synthesis procedure. We show that the sounds predicted by our model are realistic enoughto fool participants in a “real or fake” psychophysical experiment, and that they convey significant information about material properties and physical interactions.

上一篇:Using Self-Contradiction to Learn Confidence Measures in Stereo Vision

下一篇:Part-Stacked CNN for Fine-Grained Visual Categorization

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • Learning to learn...

    The move from hand-designed features to learned...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...