idansc/simple-avsd

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/idansc/simple-avsd)

idansc / simple-avsd

Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``

☆27

Alternatives and similar repositories for simple-avsd

Users that are interested in simple-avsd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

idansc / fga
View on GitHub
☆30Oct 20, 2021Updated 4 years ago
dialogtekgeek / AudioVisualSceneAwareDialog
View on GitHub
☆27May 4, 2020Updated 6 years ago
dialogtekgeek / DSTC8-AVSD_official
View on GitHub
DSTC8-AVSD: Sentence generation task for Audio Visual Scene-aware Dialog
☆14Jun 10, 2021Updated 5 years ago
simpleshinobu / visdial-principles
View on GitHub
Implementation for CVPR 2020 Paper "Two Causal Principles for Improving Visual Dialog"
☆31Feb 19, 2023Updated 3 years ago
zilongzheng / visdial-gnn
View on GitHub
PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations
☆42Jun 30, 2021Updated 5 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
ramakanth-pasunuru / video-dialogue
View on GitHub
Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"
☆19Oct 25, 2018Updated 7 years ago
salesforce / BiST
View on GitHub
Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)
☆11Jun 16, 2025Updated last year
satwikkottur / clevr-dialog
View on GitHub
Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog
☆50Feb 18, 2020Updated 6 years ago
yuleiniu / rva
View on GitHub
Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"
☆64Mar 24, 2023Updated 3 years ago
hudaAlamri / DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge
View on GitHub
☆54Nov 18, 2019Updated 6 years ago
gicheonkang / dan-visdial
View on GitHub
✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"
☆44Mar 19, 2023Updated 3 years ago
henryhungle / NADST
View on GitHub
Code for the paper Non-Autoregressive Dialog State Tracking (ICLR20)
☆44Feb 25, 2020Updated 6 years ago
facebookresearch / corefnmn
View on GitHub
Visual Coreference Resolution in Visual Dialog using Neural Module Networks
☆58Oct 12, 2021Updated 4 years ago
quangvnai / visdial
View on GitHub
Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)
☆29Aug 5, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
wh0330 / CAG_VisDial
View on GitHub
☆15Aug 13, 2020Updated 5 years ago
vmurahari3 / visdial-bert
View on GitHub
Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379
☆95Mar 31, 2020Updated 6 years ago
youjiangxu / seqvlad-pytorch
View on GitHub
The implementation of Sequential VLAD in Pytorch
☆20Jun 20, 2019Updated 7 years ago
jiasenlu / visDial.pytorch
View on GitHub
visual dialog model in pytorch
☆110May 16, 2018Updated 8 years ago
batra-mlp-lab / avsd
View on GitHub
[CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog
☆34Feb 1, 2021Updated 5 years ago
ZJULearning / TreeAttention
View on GitHub
A Better Way to Attend: Attention with Trees for Video Question Answering
☆25Mar 25, 2019Updated 7 years ago
henryhungle / MTN
View on GitHub
Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)
☆100Oct 17, 2022Updated 3 years ago
V-Sense / 360AudioVisual
View on GitHub
This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality
☆13Jul 2, 2019Updated 7 years ago
tzuhsial / pytorch-vqa-dan
View on GitHub
A PyTorch implementation of Dual Attention Network
☆30Mar 27, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
fidler-lab / Caption-Lifetime-by-Asking-Questions
View on GitHub
PyTorch code for Learning to Caption Images through a Lifetime by Asking Questions (ICCV 2019)
☆16Sep 17, 2019Updated 6 years ago
batra-mlp-lab / visdial-rl
View on GitHub
PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning
☆169Oct 10, 2018Updated 7 years ago
shubhamagarwal92 / visdial_conv
View on GitHub
This repository contains code used in our ACL'20 paper History for Visual Dialog: Do we really need it?
☆33Mar 24, 2023Updated 3 years ago
panthap2 / AssociatingNLCommentCodeEntities
View on GitHub
Dataset and code corresponding to Associating Natural Language Comment and Source Code Entities (AAAI 2020)
☆20Oct 24, 2020Updated 5 years ago
lil-lab / atis
View on GitHub
☆44May 22, 2019Updated 7 years ago
tehmaze / natural
View on GitHub
Convert data to their natural (human-readable) format
☆30Nov 4, 2021Updated 4 years ago
OSUPCVLab / VideoToTextDNN
View on GitHub
MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.
☆24Jul 12, 2019Updated 7 years ago
ZihaoW123 / UniMM
View on GitHub
Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"
☆13May 12, 2023Updated 3 years ago
ronghanghu / lcgn
View on GitHub
Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019
☆92Aug 9, 2019Updated 6 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
mwcvitkovic / Open-Vocabulary-Learning-on-Source-Code-with-a-Graph-Structured-Cache--Code-Preprocessor
View on GitHub
Library for preprocessing java source code into Augmented ASTs, as per the paper Open Vocabulary Learning on Source Code with a Graph-Str…
☆21Oct 22, 2018Updated 7 years ago
StanfordVL / STGraph
View on GitHub
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Mar 4, 2020Updated 6 years ago
daqingliu / CAVP
View on GitHub
Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Networ…
☆46Jul 27, 2019Updated 6 years ago
marianoguerra / band-blocks
View on GitHub
A blockly frontend for band.js
☆14Feb 27, 2017Updated 9 years ago
hchasestevens / fault-localization
View on GitHub
A fault localization tool for Python's pytest testing framework
☆25Aug 22, 2019Updated 6 years ago
facebookresearch / DVDialogues
View on GitHub
Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue
☆14Oct 12, 2021Updated 4 years ago
debadeepta / vnla
View on GitHub
Code accompanying the CVPR 2019 paper: https://arxiv.org/abs/1812.04155
☆61Mar 30, 2022Updated 4 years ago