dialogtekgeek/AudioVisualSceneAwareDialog

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dialogtekgeek/AudioVisualSceneAwareDialog)

dialogtekgeek / AudioVisualSceneAwareDialog

☆27

Alternatives and similar repositories for AudioVisualSceneAwareDialog

Users that are interested in AudioVisualSceneAwareDialog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

idansc / simple-avsd
View on GitHub
Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``
☆27May 26, 2020Updated 6 years ago
hudaAlamri / DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge
View on GitHub
☆54Nov 18, 2019Updated 6 years ago
dialogtekgeek / DSTC8-AVSD_official
View on GitHub
DSTC8-AVSD: Sentence generation task for Audio Visual Scene-aware Dialog
☆14Jun 10, 2021Updated 5 years ago
facebookresearch / DVDialogues
View on GitHub
Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue
☆14Oct 12, 2021Updated 4 years ago
dialogtekgeek / AVSD-DSTC10_Official
View on GitHub
Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)
☆27Aug 19, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
fidler-lab / Caption-Lifetime-by-Asking-Questions
View on GitHub
PyTorch code for Learning to Caption Images through a Lifetime by Asking Questions (ICCV 2019)
☆16Sep 17, 2019Updated 6 years ago
satwikkottur / clevr-dialog
View on GitHub
Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog
☆50Feb 18, 2020Updated 6 years ago
sanket0211 / WK-VQA
View on GitHub
World Knowledge Based Visual Question Answering
☆22Nov 26, 2020Updated 5 years ago
zilongzheng / visdial-gnn
View on GitHub
PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations
☆42Jun 30, 2021Updated 5 years ago
henryhungle / NADST
View on GitHub
Code for the paper Non-Autoregressive Dialog State Tracking (ICLR20)
☆44Feb 25, 2020Updated 6 years ago
jiasenlu / visDial.pytorch
View on GitHub
visual dialog model in pytorch
☆110May 16, 2018Updated 8 years ago
idansc / mrr-ndcg
View on GitHub
☆18Jun 10, 2024Updated 2 years ago
agakshat / visualdialog-pytorch
View on GitHub
Community Regularization of Visually Grounded Dialog https://arxiv.org/abs/1808.04359
☆15May 16, 2019Updated 7 years ago
wh0330 / CAG_VisDial
View on GitHub
☆15Aug 13, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
henryhungle / MTN
View on GitHub
Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)
☆100Oct 17, 2022Updated 3 years ago
ramakanth-pasunuru / video-dialogue
View on GitHub
Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"
☆19Oct 25, 2018Updated 7 years ago
tachi-hi / tts_samples
View on GitHub
Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…
☆15May 30, 2021Updated 5 years ago
V-Sense / 360AudioVisual
View on GitHub
This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality
☆13Jul 2, 2019Updated 7 years ago
YunseokJANG / tgif-qa
View on GitHub
Repository for our CVPR 2017 and IJCV: TGIF-QA
☆180Sep 6, 2021Updated 4 years ago
wjko2 / INQUISITIVE
View on GitHub
☆17Mar 15, 2023Updated 3 years ago
tzuhsial / pytorch-vqa-dan
View on GitHub
A PyTorch implementation of Dual Attention Network
☆30Mar 27, 2022Updated 4 years ago
ictnlp / DSTC8-AVSD
View on GitHub
We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD…
☆56Jun 12, 2023Updated 3 years ago
shubhamagarwal92 / visdial_conv
View on GitHub
This repository contains code used in our ACL'20 paper History for Visual Dialog: Do we really need it?
☆33Mar 24, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
batra-mlp-lab / avsd
View on GitHub
[CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog
☆34Feb 1, 2021Updated 5 years ago
lil-lab / atis
View on GitHub
☆45May 22, 2019Updated 7 years ago
ZihaoW123 / UniMM
View on GitHub
Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"
☆13May 12, 2023Updated 3 years ago
ZhuFengdaaa / SOON
View on GitHub
Dataset and baseline for Scenario Oriented Object Navigation (SOON)
☆25Nov 23, 2021Updated 4 years ago
Maluuba / GeNeVA_datasets
View on GitHub
Scripts to generate the CoDraw and i-CLEVR datasets used for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Gen…
☆41May 16, 2023Updated 3 years ago
afperezm / acoustic-images-distillation
View on GitHub
Code for the paper: Audio-Visual Model Distillation Using Acoustic Images
☆21Mar 24, 2023Updated 3 years ago
facebookresearch / codraw-models
View on GitHub
Models for the Collaborative Drawing (CoDraw) task
☆14Jan 15, 2019Updated 7 years ago
mwcvitkovic / Open-Vocabulary-Learning-on-Source-Code-with-a-Graph-Structured-Cache--Code-Preprocessor
View on GitHub
Library for preprocessing java source code into Augmented ASTs, as per the paper Open Vocabulary Learning on Source Code with a Graph-Str…
☆21Oct 22, 2018Updated 7 years ago
StanfordVL / STGraph
View on GitHub
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Mar 4, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xingyizhao / PURE
View on GitHub
Code associated with ICML (2024). "Defense against Backdoor Attack on Pre-trained Language Models via Head Pruning and Attention Normaliz…
☆11Feb 22, 2026Updated 4 months ago
yuleiniu / rva
View on GitHub
Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"
☆64Mar 24, 2023Updated 3 years ago
hchasestevens / fault-localization
View on GitHub
A fault localization tool for Python's pytest testing framework
☆25Aug 22, 2019Updated 6 years ago
enewe101 / iterable_queue
View on GitHub
A python queue that acts like an iterator and knows when producers are finished
☆14Aug 2, 2017Updated 8 years ago
taesunwhang / MVAN-VisDial
View on GitHub
PyTorch Implementation of Multi-View Attention Networks for Visual Dialog
☆43Mar 24, 2023Updated 3 years ago
yujiakimoto / mnemonic-reader
View on GitHub
PyTorch implementation of the Reinforced Mnemonic Reader + Answer Verifier model (https://arxiv.org/abs/1808.05759)
☆10Nov 23, 2018Updated 7 years ago
AlpacaLLaMa643 / All-day-CityScapes-segmentation
View on GitHub
All-day Semantic Segmentation & All-day CityScapes dataset
☆13Jul 26, 2025Updated 11 months ago