资源论文Unsupervised Structure Discovery for Semantic Analysis of Audio

Unsupervised Structure Discovery for Semantic Analysis of Audio

2020-01-16 | |  85 |   42 |   0

Abstract
Approaches to audio classification and retrieval tasks largely rely on detectionbased discriminative models. We submit that such models make a simplistic assumption in mapping acoustics directly to semantics, whereas the actual process is likely more complex. We present a generative model that maps acoustics in a hierarchical manner to increasingly higher-level semantics. Our model has two layers with the first layer modeling generalized sound units with no clear semantic associations, while the second layer models local patterns over these sound units. We evaluate our model on a large-scale retrieval task from TRECVID 2011, and report significant improvements over standard baselines.

上一篇:Semi-Supervised Domain Adaptation with Non-Parametric Copulas

下一篇:Fine-grained recognition refers to a subordinate level of recognition, such as recognizing different species of animals and plants.

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...