Hierarchical Text Generation and Planning for Strategic Dialogue

资源分类

2020-03-11 |

88 |

45 |

Abstract

End-to-end models for goal-orientated dialogue are challenging to train, because linguistic and strategic aspects are entangled in latent state vec tors. We introduce an approach to learning representations of messages in dialogues by maximizing the likelihood of subsequent sentences and actions, which decouples the semantics of the dialogue utterance from its linguistic realization. We then use these latent sentence representations for hierarchical language generation, planning and reinforcement learning. Experiments show that our approach increases the endtask reward achieved by the model, improves the effectiveness of long-term planning using rollouts, and allows self-play reinforcement learning to improve decision making without diverging from human language. Our hierarchical latentvariable model outperforms previous work both linguistically and strategically.

上一篇：End-to-End Learning for the Deep Multivariate Probit Model

下一篇：Conditional Noise-Contrastive Estimation of Unnormalised Models

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Learning to Predi...

Much of model-based reinforcement learning invo...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
The Variational S...

Unlike traditional images which do not offer in...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Learning to learn...

The move from hand-designed features to learned...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com