Cloud-CV / VQA
CloudCV Visual Question Answering Demo
☆66Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for VQA
- Attention-based Visual Question Answering in Torch☆101Updated 7 years ago
- Visaul Question Generation as Dual Task of Visual Question Answering (PyTorch Version)☆82Updated 6 years ago
- Visual Question Answering Project with state of the art single Model performance.☆132Updated 6 years ago
- Starter code in PyTorch for the Visual Dialog challenge☆192Updated last year
- ☆73Updated 6 years ago
- An implementation of the NAACL'18 paper "Punny Captions: Witty Wordplay in Image Descriptions".☆33Updated 6 years ago
- Tensorflow Implementation of Deeper LSTM+ normalized CNN for Visual Question Answering☆100Updated 7 years ago
- Image Caption and Text to Image papers.☆68Updated 6 years ago
- Learning to Evaluate Image Captioning. CVPR 2018☆83Updated 6 years ago
- visual dialog model in pytorch☆110Updated 6 years ago
- Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17☆164Updated 5 years ago
- ☆20Updated 5 years ago
- CoDraw dataset☆93Updated 5 years ago
- Structured Attentions for Visual Question Answering☆47Updated 6 years ago
- [COLING 2018] Learning Visually-Grounded Semantics from Contrastive Adversarial Samples.☆57Updated 5 years ago
- Contains approaches introduced in the MovieQA benchmark dataset paper☆80Updated 7 years ago
- [ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering☆205Updated 5 years ago
- Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19☆33Updated 5 years ago
- Code for Interpretable Counting for Visual Question Answering for ICLR 2018 reproducibility challenge.☆18Updated 6 years ago
- Visual7W visual question answering models☆62Updated 5 years ago
- An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…☆54Updated 6 years ago
- A Pytorch tutorial for implementation of Dynamic memory Network Plus☆64Updated 6 years ago
- Co-attending Regions and Detections for VQA.☆41Updated 6 years ago
- Implementation for our paper "Phrase Localization and Visual Relationship Detection with Comprehensive Image-Language Cues."☆39Updated 7 years ago
- Self-supervised learning of visual features through embedding images into text topic spaces☆95Updated 2 years ago
- ☆349Updated 6 years ago
- Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017☆148Updated 5 years ago
- [CVPR 2017] AMT chat interface code used to collect the Visual Dialog dataset☆80Updated 2 years ago
- Visual Storytelling API☆35Updated 7 years ago
- Code for the Grounded Visual Question Answering (GVQA) model from the paper -- Don't Just Assume; Look and Answer: Overcoming Priors for …☆21Updated 2 years ago