资源算法Mulimodal Compact Bilinear Pooling for VQA

Mulimodal Compact Bilinear Pooling for VQA

2019-09-20 | |  41 |   0 |   0

The current state-of-the-art model for visual question answering, as described in the following paper:

<br/>@article{fukui16mcb,<br/> title={Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding},<br/> author={Fukui, Akira and Park, Dong Huk and Yang, Daylen and Rohrbach, Anna and Darrell, Trevor and Rohrbach, Marcus},<br/> journal={arXiv:1606.01847},<br/> year={2016},<br/>}<br/>

[[arXiv](https://arxiv.org/abs/1606.01847)] [[GitHub repo](https://github.com/akirafukui/vqa-mcb/)]

无链接

上一篇:Mixture DCNN

下一篇:Pose-Aware CNN Models (PAMs) for Face Recognition

用户评价
全部评价

热门资源

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...

  • inferno-boilerplate

    This is a very basic boilerplate example for pe...