tkim-snu / GLACNet
GLAC Net: GLocal Attention Cascading Network for the Visual Storytelling Challenge
☆45Updated 4 years ago
Alternatives and similar repositories for GLACNet:
Users that are interested in GLACNet are comparing it to the libraries listed below
- Official Github repo of the VIST Challenge NAACL 2018☆17Updated 6 years ago
- Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"☆64Updated 2 years ago
- Memory, Attention and Composition (MAC) Network for CLEVR implemented in PyTorch☆85Updated 6 years ago
- vist story telling evaluation tool☆21Updated last year
- Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379☆96Updated 4 years ago
- Pre-trained V+L Data Preparation☆46Updated 4 years ago
- Starter code in PyTorch for the Visual Dialog challenge☆192Updated 2 years ago
- PyTorch code for Large-Scale Answerer in Questioner's Mind for Visual Dialog Question Generation (AQM+) (ICLR 2019)☆50Updated 6 years ago
- Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space☆58Updated 6 years ago
- Code for the ACL paper "No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling"☆136Updated 4 years ago
- Information Maximizing Visual Question Generation☆66Updated last year
- ✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"☆45Updated 2 years ago
- DSTC8-AVSD: Sentence generation task for Audio Visual Scene-aware Dialog☆14Updated 3 years ago
- Pytorch implementation of https://arxiv.org/pdf/1909.10470.pdf☆32Updated 3 years ago
- Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"☆19Updated 6 years ago
- Transfer Learning via Unsupervised Task Discovery for Visual Question Answering☆31Updated 5 years ago
- Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)☆99Updated 2 years ago
- Visaul Question Generation as Dual Task of Visual Question Answering (PyTorch Version)☆81Updated 6 years ago
- Code for the CoNLL 2019 paper "Compositional Generalization in Image Captioning" by Mitja Nikolaus, Mostafa Abdou, Matthew Lamm, Rahul Ar…☆26Updated 4 years ago
- ☆54Updated 5 years ago
- Codes of AAAI 2020 paper "What Makes A Good Story? Designing Composite Rewards for Visual Storytelling"☆26Updated 3 years ago
- Visual Coreference Resolution in Visual Dialog using Neural Module Networks☆57Updated 3 years ago
- Use transformer for captioning☆156Updated 5 years ago
- ☆53Updated 5 years ago
- [ACL 2019] Visually Grounded Neural Syntax Acquisition☆89Updated last year
- [EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering☆173Updated 2 years ago
- PororoQA, https://arxiv.org/abs/1707.00836☆27Updated 2 years ago
- ☆30Updated 6 years ago
- Scene Graph Parsing as Dependency Parsing☆41Updated 5 years ago
- ☆44Updated 2 years ago