☆54Nov 18, 2019Updated 6 years ago
Alternatives and similar repositories for DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge
Users that are interested in DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆27May 4, 2020Updated 5 years ago
- We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD…☆56Jun 12, 2023Updated 2 years ago
- Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)☆11Jun 16, 2025Updated 10 months ago
- Starter code in PyTorch for the Visual Dialog challenge☆189Mar 24, 2023Updated 3 years ago
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Aug 19, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``☆27May 26, 2020Updated 5 years ago
- [CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog☆34Feb 1, 2021Updated 5 years ago
- Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)☆100Oct 17, 2022Updated 3 years ago
- Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue☆14Oct 12, 2021Updated 4 years ago
- A pytorch implementation of "Latent Variable Dialogue Models and their Diversity"☆18Nov 30, 2017Updated 8 years ago
- PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning☆169Oct 10, 2018Updated 7 years ago
- A Layered Memory Network for MovieQA☆16Apr 27, 2018Updated 7 years ago
- Repository for MarioQA: Answering Questions by Watching Gameplay Videos in ICCV 2017☆10Oct 28, 2025Updated 5 months ago
- Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing☆24Dec 29, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations☆107Nov 12, 2022Updated 3 years ago
- This repository contains the Pytorch implementation for our SCAI (EMNLP-2018) submission "A Knowledge-Grounded Multimodal Search-Based Co…☆30Jun 4, 2020Updated 5 years ago
- DSTC6: End-to-End Conversation Modeling Track☆57Jan 19, 2018Updated 8 years ago
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Apr 6, 2023Updated 3 years ago
- ✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"☆45Mar 19, 2023Updated 3 years ago
- PyTorch Implementation of Multi-View Attention Networks for Visual Dialog☆43Mar 24, 2023Updated 3 years ago
- Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379☆97Mar 31, 2020Updated 6 years ago
- Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"☆64Mar 24, 2023Updated 3 years ago
- The dataset consists of public social media url pairs and the corresponding entailment label for an external conference (ACL 2021). Each …☆14Aug 16, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This code extracts context embedding from sentence☆27Jul 4, 2018Updated 7 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆48Jun 22, 2024Updated last year
- Official repository for "MMConv: An Environment for Multimodal Conversational Search across Multiple Domains"☆34Jul 15, 2021Updated 4 years ago
- Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018☆208Apr 3, 2021Updated 5 years ago
- Implementation for CVPR 2020 Paper "Two Causal Principles for Improving Visual Dialog"☆31Feb 19, 2023Updated 3 years ago
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆18Jan 22, 2024Updated 2 years ago
- CMU Document Grounded Conversation Dataset☆112Sep 21, 2018Updated 7 years ago
- [EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering☆182Oct 25, 2022Updated 3 years ago
- The implementation of the paper "Evaluating Coherence in Dialogue Systems using Entailment"☆74Sep 21, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- BossNet: Disentangling Language and Knowledge in Task Oriented Dialogs☆17Dec 8, 2022Updated 3 years ago
- [ACL 2019]: Interconnected Question Generation with Coreference Alignment and Conversation Flow Modeling☆88Apr 5, 2020Updated 6 years ago
- Boiler plate code for Torch based ML projects☆10Jul 14, 2021Updated 4 years ago
- Multi-turn dialogue baselines written in PyTorch☆163Mar 10, 2020Updated 6 years ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆24Jan 6, 2026Updated 3 months ago
- ☆10Jun 9, 2017Updated 8 years ago
- This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality☆13Jul 2, 2019Updated 6 years ago