ronghanghu / n2nmn
Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017
☆271Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for n2nmn
- Neural Module Network for VQA in Pytorch☆108Updated 6 years ago
- Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17☆164Updated 5 years ago
- Visual Question Answering Project with state of the art single Model performance.☆132Updated 6 years ago
- Neural module networks☆403Updated 7 years ago
- PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning☆170Updated 6 years ago
- Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)☆497Updated 3 years ago
- visual dialog model in pytorch☆110Updated 6 years ago
- Starter code in PyTorch for the Visual Dialog challenge☆192Updated last year
- [CVPR 2017] Torch code for Visual Dialog☆228Updated 5 years ago
- ☆349Updated 6 years ago
- code for Stacked attention networks for image question answering☆108Updated 7 years ago
- The source code for "An Actor Critic Algorithm for Structured Prediction"☆167Updated 7 years ago
- Simple Baseline for Visual Question Answering☆186Updated 7 years ago
- Tensorflow Implementation of Deeper LSTM+ normalized CNN for Visual Question Answering☆100Updated 7 years ago
- Visual Q&A reading list☆435Updated 6 years ago
- Re-implementation of the m-RNN model using TensorFLow☆109Updated 8 years ago
- [ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering☆205Updated 5 years ago
- ☆222Updated 8 years ago
- The toolbox for the Google Refexp dataset proposed in this paper: http://arxiv.org/abs/1511.02283☆162Updated 7 years ago
- Attention-based Visual Question Answering in Torch☆101Updated 7 years ago
- Mixed Incremental Cross-Entropy REINFORCE ICLR 2016☆332Updated 7 years ago
- Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017☆148Updated 5 years ago
- [CVPR 2017] AMT chat interface code used to collect the Visual Dialog dataset☆80Updated 2 years ago
- Generate captions for an image using PyTorch☆128Updated 7 years ago
- Toolkit for Visual7W visual question answering dataset☆76Updated 5 years ago
- The implementation of key value memory networks in tensorflow☆248Updated 6 years ago
- Tensorflow implementations of Relational Networks and a VQA dataset named Sort-of-CLEVR proposed by DeepMind.☆322Updated 5 years ago
- Image Caption and Text to Image papers.☆68Updated 6 years ago
- Memory, Attention and Composition (MAC) Network for CLEVR implemented in PyTorch☆85Updated 5 years ago
- Implementation of CVPR 2016 paper☆76Updated 3 years ago