ronghanghu / n2nmn
Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017
☆271Updated 4 years ago
Alternatives and similar repositories for n2nmn:
Users that are interested in n2nmn are comparing it to the libraries listed below
- Neural module networks☆405Updated 7 years ago
- Neural Module Network for VQA in Pytorch☆108Updated 7 years ago
- Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17☆164Updated 5 years ago
- Visual Question Answering Project with state of the art single Model performance.☆132Updated 6 years ago
- PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning☆170Updated 6 years ago
- Attention-based Visual Question Answering in Torch☆101Updated 7 years ago
- Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)☆498Updated 3 years ago
- visual dialog model in pytorch☆110Updated 6 years ago
- Starter code in PyTorch for the Visual Dialog challenge☆192Updated last year
- [CVPR 2017] Torch code for Visual Dialog☆228Updated 6 years ago
- ☆349Updated 6 years ago
- Tensorflow implementations of Relational Networks and a VQA dataset named Sort-of-CLEVR proposed by DeepMind.☆322Updated 6 years ago
- code for Stacked attention networks for image question answering☆108Updated 8 years ago
- Review Network for Caption Generation☆182Updated 7 years ago
- The source code for "An Actor Critic Algorithm for Structured Prediction"☆167Updated 7 years ago
- Simple Baseline for Visual Question Answering☆186Updated 8 years ago
- Visual Q&A reading list☆435Updated 6 years ago
- [ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering☆205Updated 5 years ago
- ☆221Updated 8 years ago
- Porting of Skip-Thoughts pretrained models from Theano to PyTorch & Torch7☆148Updated 5 years ago
- Train embodied agents that can answer questions in environments☆302Updated last year
- Re-implementation of the m-RNN model using TensorFLow☆109Updated 8 years ago
- Tensorflow Implementation of Deeper LSTM+ normalized CNN for Visual Question Answering☆100Updated 7 years ago
- Implementation of CVPR 2016 paper☆76Updated 3 years ago
- [CVPR 2017] AMT chat interface code used to collect the Visual Dialog dataset☆80Updated 2 years ago
- Memory, Attention and Composition (MAC) Network for CLEVR implemented in PyTorch☆85Updated 5 years ago
- Visual7W visual question answering models☆63Updated 5 years ago
- Use transformer for captioning☆156Updated 5 years ago
- Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"☆335Updated 7 years ago
- This repository provides code for reproducing experiments of the paper Talk The Walk: Navigating New York City Through Grounded Dialogue …☆112Updated 3 years ago