Know What You Don’t Know: Modeling a Pragmatic Speaker that Refers to Objects of Unknown Categories

资源分类

2019-09-22 |

157 |

133 |

Abstract Zero-shot learning in Language & Vision is the task of correctly labelling (or naming) objects of novel categories. Another strand of work in L&V aims at pragmatically informative rather than “correct” object descriptions, e.g. in reference games. We combine these lines of research and model zero-shot reference games, where a speaker needs to successfully refer to a novel object in an image. Inspired by models of “rational speech acts”, we extend a neural generator to become a pragmatic speaker reasoning about uncertain object categories. As a result of this reasoning, the generator produces fewer nouns and names of distractor categories as compared to a literal speaker. We show that this conversational strategy for dealing with novel objects often improves communicative success, in terms of resolution accuracy of an automatic listener

上一篇：Informative Image Captioning with External Sources of Information

下一篇：Label-Agnostic Sequence Labeling by Copying Nearest Neighbors

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Deep Cross-media ...

Cross-media retrieval is a research hotspot in ...
Regularizing RNNs...

Recently, caption generation with an encoder-de...
Learning Expressi...

Facial expression is temporally dynamic event w...
Attributed Graph ...

Graph clustering is a fundamental task which di...
Compact MDDs for ...

Pseudo-Boolean (PB) constraints are usually en...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com