资源论文Predicting the Visual Focus of Attention in Multi-Person Discussion Videos

Predicting the Visual Focus of Attention in Multi-Person Discussion Videos

2019-10-09 | |  87 |   43 |   0

 Abstract Visual focus of attention in multi-person discussions is a crucial nonverbal indicator in tasks such as inter-personal relation inference, speech transcription, and deception detection. However, predicting the focus of attention remains a challenge because the focus changes rapidly, the discussions are highly dynamic, and the people’s behaviors are inter-dependent. Here we propose ICAF (Iterative Collective Attention Focus), a collective classififi- cation model to jointly learn the visual focus of attention of all people. Every person is modeled using a separate classififier. ICAF models the people collectively—the predictions of all other people’s classififiers are used as inputs to each person’s classififier. This explicitly incorporates interdependencies between all people’s behaviors. We evaluate ICAF on a novel dataset of 5 videos (35 people, 109 minutes, 7604 labels in all) of the popular Resistance game and a widely-studied meeting dataset with supervised prediction. ICAF outperforms the strongest baseline by 1%–5% accuracy in predicting the people’s visual focus of attention. Further, we propose a lightly supervised technique to train models in the absence of training labels. We show that light-supervised ICAF performs similar to the supervised ICAF, thus showing its effectiveness and generality to previously unseen videos

上一篇:Predicting Dominance in Multi-person Videos

下一篇:Recurrent Generative Networks for Multi-Resolution Satellite Data: An Application in Cropland Monitoring

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...