[CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog
☆34Feb 1, 2021Updated 5 years ago
Alternatives and similar repositories for avsd
Users that are interested in avsd are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of https://arxiv.org/pdf/1909.10470.pdf☆32Aug 23, 2021Updated 4 years ago
- ☆54Nov 18, 2019Updated 6 years ago
- Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"☆64Mar 24, 2023Updated 2 years ago
- ✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"☆45Mar 19, 2023Updated 2 years ago
- Code for NIPS 2018 paper, "Chain of Reasoning for Visual Question Answering"☆28Nov 23, 2018Updated 7 years ago
- Official EvalAI Command Line Tool☆57Mar 29, 2025Updated 11 months ago
- Collection of evaluation code for natural language generation.☆12Jan 6, 2021Updated 5 years ago
- Visual Coreference Resolution in Visual Dialog using Neural Module Networks☆57Oct 12, 2021Updated 4 years ago
- Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``☆27May 26, 2020Updated 5 years ago
- cordial-sync is a software package than can be used to reproduce the results from the paper "A Cordial Sync: Going Beyond Marginal Polici…☆41Jan 13, 2021Updated 5 years ago
- DSTC8-AVSD: Sentence generation task for Audio Visual Scene-aware Dialog☆14Jun 10, 2021Updated 4 years ago
- An implementation of the NAACL'18 paper "Punny Captions: Witty Wordplay in Image Descriptions".☆33Jun 27, 2018Updated 7 years ago
- Train embodied agents that can answer questions in environments☆316Jul 25, 2023Updated 2 years ago
- Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)☆100Oct 17, 2022Updated 3 years ago
- ☆19Dec 5, 2019Updated 6 years ago
- logboard: Monitor and Compare Logs on Browser/Terminal.☆21Sep 19, 2019Updated 6 years ago
- MAC: Mining Activity Concepts for Language-based Temporal Localization☆36Nov 26, 2018Updated 7 years ago
- CVPR'17 Spotlight: What’s in a Question: Using Visual Questions as a Form of Supervision☆44Aug 31, 2018Updated 7 years ago
- Other than papers from big-name labs and universities, most AI research papers get less than 10 readers, even though there might be gems …☆15Jul 20, 2018Updated 7 years ago
- Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"☆19Oct 25, 2018Updated 7 years ago
- Video analysis using python and OpenCV☆22Jun 21, 2017Updated 8 years ago
- ☆19Feb 6, 2019Updated 7 years ago
- Vision and Language Agent Navigation☆85Jan 29, 2021Updated 5 years ago
- Kaggle ultrasound nerve segmentation using Keras☆23Jan 22, 2017Updated 9 years ago
- Implementation of the approach described in "Understanding deep features with computer-generated imagery" , M. Aubry and B. Russell, ICCV…☆21Aug 15, 2023Updated 2 years ago
- MetaPix: Few-Shot Video Retargeting☆47Dec 22, 2019Updated 6 years ago
- Starter code in PyTorch for the Visual Dialog challenge☆189Mar 24, 2023Updated 2 years ago
- CLEVR-Robot: a reinforcement learning environment combining vision, language and control.☆138Aug 4, 2024Updated last year
- The code for "An Auto-Encoder Matching Model for Learning Utterance-Level Semantic Dependency in Dialogue Generation" (EMNLP 2018)☆47Aug 27, 2018Updated 7 years ago
- Evaluating Visual Conversational Agents via Cooperative Human-AI Games☆23Nov 22, 2022Updated 3 years ago
- Dataset for Visually Indicated Sound Generation by Perceptually Optimized Classification☆22Apr 6, 2020Updated 5 years ago
- Code for the paper "Representation Learning for Grounded Spatial Reasoning"☆52Jul 2, 2020Updated 5 years ago
- PyTorch code for Large-Scale Answerer in Questioner's Mind for Visual Dialog Question Generation (AQM+) (ICLR 2019)☆51Feb 12, 2019Updated 7 years ago
- A simple but well-performing "single-hop" visual attention model for the GQA dataset☆20Aug 8, 2019Updated 6 years ago
- 监控文件改动,随时自动备份,彻底防止误删☆24Mar 11, 2020Updated 5 years ago
- A Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"☆95Sep 21, 2019Updated 6 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- ☆23Jul 20, 2017Updated 8 years ago
- ☆27May 4, 2020Updated 5 years ago