Context-aware Captions from Context-agnostic Supervision

资源分类

2019-12-02 |

53 |

50 |

Abstract

We introduce an inference technique to produce discriminative context-aware image captions (captions that describe differences between images or visual concepts) using only generic context-agnostic training data (captions that describe a concept or an image in isolation). For example, given images and captions of “siamese cat” and “tiger cat”, we generate language that describes the “siamese cat” in a way that distinguishes it from “tiger cat”. Our key novelty is that we show how to do joint inference over a language model that is context-agnostic and a listener which distinguishes closely-related concepts. We fifirst apply our technique to a justifification task, namely to describe why an image contains a particular fifine-grained category as opposed to another closely-related category of the CUB- 200-2011 dataset. We then study discriminative image captioning to generate language that uniquely refers to one of two semantically-similar images in the COCO dataset. Evaluations with discriminative ground truth for justifification and human studies for discriminative image captioning reveal that our approach outperforms baseline generative and speaker-listener approaches for discrimination.

上一篇：Consensus Maximization with Linear Matrix Inequality Constraints

下一篇：Context-Aware Correlation Filter Tracking

用户评价

全部评价

还没有评论，说两句吧！

热门资源

A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
Learning to Predi...

Much of model-based reinforcement learning invo...
dynamical system ...

allows to preform manipulations of heavy or bul...
The Variational S...

Unlike traditional images which do not offer in...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com