hudaAlamri/DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hudaAlamri/DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge)

hudaAlamri / DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge

☆54

Alternatives and similar repositories for DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge

Users that are interested in DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dialogtekgeek / AudioVisualSceneAwareDialog
View on GitHub
☆27May 4, 2020Updated 6 years ago
ictnlp / DSTC8-AVSD
View on GitHub
We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD…
☆56Jun 12, 2023Updated 3 years ago
salesforce / BiST
View on GitHub
Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)
☆11Jun 16, 2025Updated last year
batra-mlp-lab / visdial-challenge-starter-pytorch
View on GitHub
Starter code in PyTorch for the Visual Dialog challenge
☆188Mar 24, 2023Updated 3 years ago
idansc / simple-avsd
View on GitHub
Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``
☆27May 26, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
batra-mlp-lab / avsd
View on GitHub
[CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog
☆34Feb 1, 2021Updated 5 years ago
henryhungle / MTN
View on GitHub
Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)
☆100Oct 17, 2022Updated 3 years ago
afperezm / acoustic-images-distillation
View on GitHub
Code for the paper: Audio-Visual Model Distillation Using Acoustic Images
☆21Mar 24, 2023Updated 3 years ago
facebookresearch / DVDialogues
View on GitHub
Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue
☆14Oct 12, 2021Updated 4 years ago
guxd / VariationalSeq2Seq
View on GitHub
A pytorch implementation of "Latent Variable Dialogue Models and their Diversity"
☆18Nov 30, 2017Updated 8 years ago
bowong / Layered-Memory-Network
View on GitHub
A Layered Memory Network for MovieQA
☆16Apr 27, 2018Updated 8 years ago
JonghwanMun / MarioQA
View on GitHub
Repository for MarioQA: Answering Questions by Watching Gameplay Videos in ICCV 2017
☆10Oct 28, 2025Updated 9 months ago
SALT-NLP / Persuasion_Strategy_WVAE
View on GitHub
☆13Jan 8, 2021Updated 5 years ago
facebookresearch / simmc2
View on GitHub
Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
☆109Nov 12, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
taesunwhang / MVAN-VisDial
View on GitHub
PyTorch Implementation of Multi-View Attention Networks for Visual Dialog
☆43Mar 24, 2023Updated 3 years ago
dialogtekgeek / DSTC6-End-to-End-Conversation-Modeling
View on GitHub
DSTC6: End-to-End Conversation Modeling Track
☆57Jan 19, 2018Updated 8 years ago
gicheonkang / dan-visdial
View on GitHub
✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"
☆44Mar 19, 2023Updated 3 years ago
flynnmd / deep-features-video
View on GitHub
Scripts to extract CNN features from video frames with Keras.
☆24Nov 26, 2016Updated 9 years ago
Chenrj233 / LMEDR
View on GitHub
Code for AAAI 2023 paper 'Learning to Memorize Entailment and Discourse Relations for Persona-Consistent Dialogues'
☆33May 27, 2023Updated 3 years ago
vmurahari3 / visdial-bert
View on GitHub
Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379
☆95Mar 31, 2020Updated 6 years ago
google-research-datasets / recognizing-multimodal-entailment
View on GitHub
The dataset consists of public social media url pairs and the corresponding entailment label for an external conference (ACL 2021). Each …
☆14Aug 16, 2021Updated 4 years ago
ldynx / SAVE
View on GitHub
☆25Nov 22, 2024Updated last year
YapengTian / AVE-ECCV18
View on GitHub
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
☆210Apr 3, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
liziliao / MMConv
View on GitHub
Official repository for "MMConv: An Environment for Multimodal Conversational Search across Multiple Domains"
☆34Jul 15, 2021Updated 5 years ago
simpleshinobu / visdial-principles
View on GitHub
Implementation for CVPR 2020 Paper "Two Causal Principles for Improving Visual Dialog"
☆31Feb 19, 2023Updated 3 years ago
xiaobai1217 / Unseen-Modality-Interaction
View on GitHub
This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"
☆18Jan 22, 2024Updated 2 years ago
LYX0501 / SPRING
View on GitHub
☆13Mar 25, 2023Updated 3 years ago
festvox / datasets-CMU_DoG
View on GitHub
CMU Document Grounded Conversation Dataset
☆112Sep 21, 2018Updated 7 years ago
jayleicn / TVQA
View on GitHub
[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering
☆181Oct 25, 2022Updated 3 years ago
nouhadziri / DialogEntailment
View on GitHub
The implementation of the paper "Evaluating Coherence in Dialogue Systems using Entailment"
☆74Sep 21, 2024Updated last year
V-Sense / 360AudioVisual
View on GitHub
This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality
☆13Jul 2, 2019Updated 7 years ago
wjko2 / INQUISITIVE
View on GitHub
☆17Mar 15, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lancopku / SACT
View on GitHub
Code for the article "Automatic Temperature Control for Neural Machine Translation" (EMNLP 2018)
☆14Apr 16, 2019Updated 7 years ago
majumderb / pabst
View on GitHub
Code for "Unsupervised Enrichment of Persona-grounded Dialog with Background Stories", ACL 2021
☆10Jul 8, 2021Updated 5 years ago
michimoeller / liftingLayers
View on GitHub
☆12Dec 8, 2022Updated 3 years ago
TonyNemo / UBAR-MultiWOZ
View on GitHub
AAAI 2021: "UBAR: Towards Fully End-to-End Task-Oriented Dialog System with GPT-2"
☆97Mar 10, 2021Updated 5 years ago
Seth-Park / MultimodalExplanations
View on GitHub
Code release for Park et al. Multimodal Multimodal Explanations: Justifying Decisions and Pointing to the Evidence. in CVPR, 2018
☆49Jul 27, 2018Updated 8 years ago
lizekang / ITDD
View on GitHub
The source code of our ACL2019 paper "Incremental Transformer with Deliberation Decoder for Document Grounded Conversations "
☆86Aug 30, 2019Updated 6 years ago
iseesaw / SMP-MCC2020
View on GitHub
Dataset and Baseline for SMP-MCC2020
☆23Jul 6, 2023Updated 3 years ago