资源算法Mulimodal Compact Bilinear Pooling for VQA

Mulimodal Compact Bilinear Pooling for VQA

2019-09-20 | |  54 |   0 |   0

The current state-of-the-art model for visual question answering, as described in the following paper:

<br/>@article{fukui16mcb,<br/> title={Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding},<br/> author={Fukui, Akira and Park, Dong Huk and Yang, Daylen and Rohrbach, Anna and Darrell, Trevor and Rohrbach, Marcus},<br/> journal={arXiv:1606.01847},<br/> year={2016},<br/>}<br/>

[[arXiv](https://arxiv.org/abs/1606.01847)] [[GitHub repo](https://github.com/akirafukui/vqa-mcb/)]

无链接

上一篇:Mixture DCNN

下一篇:Pose-Aware CNN Models (PAMs) for Face Recognition

用户评价
全部评价

热门资源

  • TensorFlow-Course

    This repository aims to provide simple and read...

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • mxnet_VanillaCNN

    This is a mxnet implementation of the Vanilla C...

  • DuReader_QANet_BiDAF

    Machine Reading Comprehension on DuReader Usin...

  • Klukshu-Sockeye-...

    KLUKSHU SOCKEYE PROJECTS 2016 This repositor...