satwikkottur/clevr-dialog

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/satwikkottur/clevr-dialog)

satwikkottur / clevr-dialog

Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog

☆50

Alternatives and similar repositories for clevr-dialog

Users that are interested in clevr-dialog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zilongzheng / visdial-gnn
View on GitHub
PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations
☆42Jun 30, 2021Updated 5 years ago
yuleiniu / rva
View on GitHub
Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"
☆64Mar 24, 2023Updated 3 years ago
idansc / mrr-ndcg
View on GitHub
☆18Jun 10, 2024Updated 2 years ago
batra-mlp-lab / visdial-rl
View on GitHub
PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning
☆169Oct 10, 2018Updated 7 years ago
dialogtekgeek / AudioVisualSceneAwareDialog
View on GitHub
☆27May 4, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
naver / aqm-plus
View on GitHub
PyTorch code for Large-Scale Answerer in Questioner's Mind for Visual Dialog Question Generation (AQM+) (ICLR 2019)
☆51Feb 12, 2019Updated 7 years ago
GuessWhatGame / guesswhat
View on GitHub
GuessWhat?! Baselines
☆74Jul 12, 2022Updated 4 years ago
jiasenlu / visDial.pytorch
View on GitHub
visual dialog model in pytorch
☆110May 16, 2018Updated 8 years ago
salesforce / BiST
View on GitHub
Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)
☆11Jun 16, 2025Updated last year
agakshat / visualdialog-pytorch
View on GitHub
Community Regularization of Visually Grounded Dialog https://arxiv.org/abs/1808.04359
☆15May 16, 2019Updated 7 years ago
gicheonkang / dan-visdial
View on GitHub
✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"
☆44Mar 19, 2023Updated 3 years ago
batra-mlp-lab / visdial-challenge-starter-pytorch
View on GitHub
Starter code in PyTorch for the Visual Dialog challenge
☆188Mar 24, 2023Updated 3 years ago
vmurahari3 / visdial-bert
View on GitHub
Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379
☆95Mar 31, 2020Updated 6 years ago
henryhungle / MTN
View on GitHub
Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)
☆100Oct 17, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dialogtekgeek / DSTC8-AVSD_official
View on GitHub
DSTC8-AVSD: Sentence generation task for Audio Visual Scene-aware Dialog
☆14Jun 10, 2021Updated 5 years ago
kexinyi / ns-vqa
View on GitHub
Neural-symbolic visual question answering
☆283Mar 27, 2023Updated 3 years ago
ronghanghu / lcgn
View on GitHub
Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019
☆92Aug 9, 2019Updated 6 years ago
daqingliu / NMTree
View on GitHub
Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)
☆38Nov 23, 2019Updated 6 years ago
shubhamagarwal92 / visdial_conv
View on GitHub
This repository contains code used in our ACL'20 paper History for Visual Dialog: Do we really need it?
☆33Mar 24, 2023Updated 3 years ago
idansc / fga
View on GitHub
☆30Oct 20, 2021Updated 4 years ago
facebookresearch / clevr-dataset-gen
View on GitHub
A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
☆653Aug 30, 2021Updated 4 years ago
HKUST-KnowComp / Visual_PCR
View on GitHub
Dataset and Source code for EMNLP 2019 paper "What You See is What You Get: Visual Pronoun Coreference Resolution in Dialogues"
☆26Sep 10, 2021Updated 4 years ago
linjieli222 / VQA_ReGAT
View on GitHub
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"
☆187Apr 15, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Cyanogenoid / vqa-counting
View on GitHub
[ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering
☆208Mar 5, 2019Updated 7 years ago
ramakanth-pasunuru / video-dialogue
View on GitHub
Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"
☆19Oct 25, 2018Updated 7 years ago
simpleshinobu / visdial-principles
View on GitHub
Implementation for CVPR 2020 Paper "Two Causal Principles for Improving Visual Dialog"
☆31Feb 19, 2023Updated 3 years ago
shijx12 / XNM-Net
View on GitHub
Pytorch implementation of "Explainable and Explicit Visual Reasoning over Scene Graphs "
☆94Mar 17, 2019Updated 7 years ago
wh0330 / CAG_VisDial
View on GitHub
☆15Aug 13, 2020Updated 5 years ago
facebookresearch / grid-feats-vqa
View on GitHub
Grid features pre-training code for visual question answering
☆269Sep 17, 2021Updated 4 years ago
Kelym / FAST
View on GitHub
Code for "Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation"
☆62Sep 24, 2019Updated 6 years ago
kdexd / probnmn-clevr
View on GitHub
Code for ICML 2019 paper "Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering" [long-oral]
☆68Aug 3, 2023Updated 2 years ago
facebookresearch / EmbodiedQA
View on GitHub
Train embodied agents that can answer questions in environments
☆315Jul 25, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
XiaoxiaoGuo / fashion-retrieval
View on GitHub
This repository contains an implementation of the models introduced in the paper Dialog-based Interactive Image Retrieval. The network is…
☆70Oct 4, 2020Updated 5 years ago
expectorlin / DR-Attacker
View on GitHub
code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)
☆10Jul 15, 2022Updated 4 years ago
vmurahari3 / visdial-diversity
View on GitHub
Pytorch implementation of https://arxiv.org/pdf/1909.10470.pdf
☆32Aug 23, 2021Updated 4 years ago
ExplorerFreda / VGNSL
View on GitHub
[ACL 2019] Visually Grounded Neural Syntax Acquisition
☆90Feb 24, 2024Updated 2 years ago
itaigat / removing-bias-in-multi-modal-classifiers
View on GitHub
☆34Jan 5, 2021Updated 5 years ago
jayleicn / TVQAplus
View on GitHub
[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
☆132Oct 25, 2022Updated 3 years ago
zhaoyanpeng / vpcfg
View on GitHub
Visually Grounded PCFG Induction
☆38May 18, 2022Updated 4 years ago