Probabalistic Models and Informative Subspaces for Audiovisual Correspondence

资源分类

2020-03-23 |

62 |

38 |

Abstract

We propose a probabalistic model of single source multi- modal generation and show how algorithms for maximizing mutual infor- mation can find the correspondences between components of each signal. We show how non-parametric techniques for finding informative sub- spaces can capture the complex statistical relationship between signals in difierent modalities. We extend a previous technique for finding infor- mative subspaces to include new priors on the pro jection weights, yield- ing more robust results. Applied to human speakers, our model can find the relationship between audio speech and video of facial motion, and partially segment out background events in both channels. We present new results on the problem of audio-visual verification, and show how the audio and video of a speaker can be matched even when no prior model of the speaker’s voice or appearance is available.

上一篇：Recovery of Re?ectances and Varying Illuminants from Multiple Views

下一篇：The Relevance of Non-generic Events in Scale Space Models

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Learning to Predi...

Much of model-based reinforcement learning invo...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
The Variational S...

Unlike traditional images which do not offer in...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Rating-Boosted La...

The performance of a recommendation system reli...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com