资源论文Discriminatively Trained Recurrent Neural Networks for Continuous Dimensional Emotion Recognition from Audio

Discriminatively Trained Recurrent Neural Networks for Continuous Dimensional Emotion Recognition from Audio

2019-11-26 | |  63 |   39 |   0

Abstract Continuous dimensional emotion recognition from audio is a sequential regression problem, where the goal is to maximize correlation between sequences of regression outputs and continuous-valued emotion contours, while minimizing the average deviation. As in other domains, deep neural networks trained on simple acoustic features achieve good performance on this task. Yet, the usual squared error objective functions for neural network training do not fully take into account the above-named goal. Hence, in this paper we introduce a technique for the discriminative training of deep neural networks using the concordance correlation coeffificient as cost function, which unites both correlation and mean squared error in a single differentiable function. Results on the MediaEval 2013 and AV+EC 2015 Challenge data sets show that the proposed method can signifificantly improve the evaluation criteria compared to standard mean squared error training, both in the music and speech domains.

上一篇:Multi-Grained Role Labeling Based on Multi-Modality Information for Real Customer Service Telephone Conversation

下一篇:i, Poet: Automatic Poetry Composition through Recurrent Neural Networks with Iterative Polishing Schema

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • Learning to learn...

    The move from hand-designed features to learned...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...