资源论文The Language of Actions: Recovering the Syntax and Semantics of Goal-Directed Human Activities

The Language of Actions: Recovering the Syntax and Semantics of Goal-Directed Human Activities

2019-12-17 | |  44 |   39 |   0

Abstract

This paper describes a framework for modeling human activities as temporally structured processes. Our approach is motivated by the inherently hierarchical nature of human activities and the close correspondence between human actions and speech: We model action units using Hidden Markov Models, much like words in speech. These action units then form the building blocks to model complex human activities as sentences using an action grammar. To evaluate our approach, we collected a large dataset of daily cooking activities: The dataset includes a total of 52 participants, each performing a total of 10 cooking activities in multiple real-life kitchens, resulting in over 77 hours of video footage. We evaluate the HTK toolkit, a stateof-the-art speech recognition engine, in combination with multiple video feature descriptors, for both the recognition of cooking activities (e.g., making pancakes) as well as the semantic parsing of videos into action units (e.g., cracking eggs). Our results demonstrate the benefifits of structured temporal generative approaches over existing discriminative approaches in coping with the complexity of human daily life activities.

上一篇:Multi-target Tracking with Motion Context in Tensor Power Iteration

下一篇:Enriching Visual Knowledge Bases via Object Discovery and Segmentation

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...