batra-mlp-lab/avsd

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/batra-mlp-lab/avsd)

batra-mlp-lab / avsd

[CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog

☆34

Alternatives and similar repositories for avsd

Users that are interested in avsd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yekeren / Story-Video_ads_understanding
View on GitHub
LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".
☆15Oct 30, 2020Updated 5 years ago
hudaAlamri / DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge
View on GitHub
☆54Nov 18, 2019Updated 6 years ago
vmurahari3 / visdial-diversity
View on GitHub
Pytorch implementation of https://arxiv.org/pdf/1909.10470.pdf
☆32Aug 23, 2021Updated 4 years ago
dialogtekgeek / AudioVisualSceneAwareDialog
View on GitHub
☆27May 4, 2020Updated 6 years ago
Cloud-CV / origami-lib
View on GitHub
Python package for origami
☆16Jan 10, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
facebookresearch / corefnmn
View on GitHub
Visual Coreference Resolution in Visual Dialog using Neural Module Networks
☆58Oct 12, 2021Updated 4 years ago
facebookresearch / DVDialogues
View on GitHub
Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue
☆14Oct 12, 2021Updated 4 years ago
JaywongWang / CBP
View on GitHub
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…
☆59Mar 24, 2023Updated 3 years ago
purvaten / punny_captions
View on GitHub
An implementation of the NAACL'18 paper "Punny Captions: Witty Wordplay in Image Descriptions".
☆33Jun 27, 2018Updated 8 years ago
Cloud-CV / evalai-cli
View on GitHub
Official EvalAI Command Line Tool
☆57Jun 14, 2026Updated last month
shivamsaboo17 / GLC
View on GitHub
Gold Loss Correction for training neural networks with labels corrupted with severe noise
☆13Aug 17, 2019Updated 6 years ago
ramakanth-pasunuru / video-dialogue
View on GitHub
Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"
☆19Oct 25, 2018Updated 7 years ago
yuleiniu / rva
View on GitHub
Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"
☆64Mar 24, 2023Updated 3 years ago
facebookresearch / EmbodiedQA
View on GitHub
Train embodied agents that can answer questions in environments
☆315Jul 25, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
raghakot / ultrasound-nerve-segmentation
View on GitHub
Kaggle ultrasound nerve segmentation using Keras
☆23Jan 22, 2017Updated 9 years ago
vmurahari3 / visdial-bert
View on GitHub
Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379
☆95Mar 31, 2020Updated 6 years ago
XiangChenchao / DDPN
View on GitHub
Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding
☆23Jun 27, 2018Updated 8 years ago
bupt-cist / vqa-playground-pytorch
View on GitHub
Code for NIPS 2018 paper, "Chain of Reasoning for Visual Question Answering"
☆28Nov 23, 2018Updated 7 years ago
dialogtekgeek / DSTC8-AVSD_official
View on GitHub
DSTC8-AVSD: Sentence generation task for Audio Visual Scene-aware Dialog
☆14Jun 10, 2021Updated 5 years ago
google-research / valan
View on GitHub
Vision and Language Agent Navigation
☆85Jan 29, 2021Updated 5 years ago
gicheonkang / dan-visdial
View on GitHub
✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"
☆44Mar 19, 2023Updated 3 years ago
runzhouge / MAC
View on GitHub
MAC: Mining Activity Concepts for Language-based Temporal Localization
☆36Nov 26, 2018Updated 7 years ago
castorini / TrecQA-NegEx
View on GitHub
Code and dataset for SIGIR 2017 short paper "Automatically Extracting High-Quality Negative Examples for Answer Selection in Question Ans…
☆10Aug 1, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
taoshen58 / ReSAN
View on GitHub
☆26May 2, 2018Updated 8 years ago
lancopku / AMM
View on GitHub
The code for "An Auto-Encoder Matching Model for Learning Utterance-Level Semantic Dependency in Dialogue Generation" （EMNLP 2018)
☆47Aug 27, 2018Updated 7 years ago
j-min / language-evaluation
View on GitHub
Collection of evaluation code for natural language generation.
☆12Jan 6, 2021Updated 5 years ago
chenhongshen / HVMN
View on GitHub
☆21Dec 5, 2019Updated 6 years ago
batra-mlp-lab / visdial-challenge-starter-pytorch
View on GitHub
Starter code in PyTorch for the Visual Dialog challenge
☆188Mar 24, 2023Updated 3 years ago
shugert / DRAW
View on GitHub
DRAW: A Recurrent Neural Network For Image Generation
☆29Jul 17, 2017Updated 9 years ago
ekazakos / temporal-binding-network
View on GitHub
Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch
☆112Jan 25, 2021Updated 5 years ago
qipeng / stay-hungry-stay-focused
View on GitHub
This repository hosts the authors' implementation of the paper "Stay Hungry, Stay Focused: Generating Informative and Specific Questions …
☆26Nov 10, 2020Updated 5 years ago
allenai / cordial-sync
View on GitHub
cordial-sync is a software package than can be used to reproduce the results from the paper "A Cordial Sync: Going Beyond Marginal Polici…
☆41Jan 13, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
kdexd / lang-emerge-parlai
View on GitHub
Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI
☆104Apr 2, 2019Updated 7 years ago
wkentaro / logboard
View on GitHub
logboard: Monitor and Compare Logs on Browser/Terminal.
☆21Sep 19, 2019Updated 6 years ago
zhaoxlpku / DynaAct
View on GitHub
☆15Nov 12, 2025Updated 8 months ago
zhuowangsylu / ColluEagle
View on GitHub
Group review spammer detection
☆10Sep 9, 2019Updated 6 years ago
google-research / clevr_robot_env
View on GitHub
CLEVR-Robot: a reinforcement learning environment combining vision, language and control.
☆139Aug 4, 2024Updated last year
Abhishaike / HyperProtoNetReproduce
View on GitHub
NeurIPS 2019 Paper Implementation
☆12Nov 22, 2022Updated 3 years ago
GeWu-Lab / MUSIC-AVQA
View on GitHub
MUSIC-AVQA, CVPR2022 (ORAL)
☆100Dec 30, 2022Updated 3 years ago