phellonchen/awesome-visual-dialog

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/phellonchen/awesome-visual-dialog)

phellonchen / awesome-visual-dialog

Recent Advances in Visual Dialog

☆28

Alternatives and similar repositories for awesome-visual-dialog

Users that are interested in awesome-visual-dialog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZihaoW123 / UniMM
View on GitHub
Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"
☆13May 12, 2023Updated 3 years ago
HKUST-KnowComp / VD-PCR
View on GitHub
Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"
☆10Nov 1, 2022Updated 3 years ago
wh0330 / CAG_VisDial
View on GitHub
☆15Aug 13, 2020Updated 5 years ago
HKUST-KnowComp / Visual_PCR
View on GitHub
Dataset and Source code for EMNLP 2019 paper "What You See is What You Get: Visual Pronoun Coreference Resolution in Dialogues"
☆26Sep 10, 2021Updated 4 years ago
MengyuanChen21 / Awesome-Visual-Dialog
View on GitHub
A curated publication list on visual dialog
☆14May 8, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zilongzheng / visdial-gnn
View on GitHub
PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations
☆42Jun 30, 2021Updated 5 years ago
gicheonkang / dan-visdial
View on GitHub
✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"
☆44Mar 19, 2023Updated 3 years ago
gicheonkang / gst-visdial
View on GitHub
Official PyTorch Implementation for CVPR'23 Paper, "The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training"
☆20Dec 11, 2023Updated 2 years ago
salesforce / VD-BERT
View on GitHub
☆45Jun 16, 2025Updated last year
sairin1202 / Commonsense-Knowledge-Aware-Concept-Selection-For-Diverse-and-Informative-Visual-Storytelling
View on GitHub
The implement of Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling
☆12Aug 19, 2021Updated 4 years ago
idansc / fga
View on GitHub
☆30Oct 20, 2021Updated 4 years ago
CrossmodalGroup / SSL-VQA
View on GitHub
Code for our IJCAI2020 paper: Overcoming Language Priors with Self-supervised Learning for Visual Question Answering
☆52Aug 21, 2020Updated 5 years ago
YuJungHeo / kbvqa-public
View on GitHub
☆40Nov 29, 2022Updated 3 years ago
kevalnagda / StoryGeneration
View on GitHub
Hierarchical Story Generation based on (https://arxiv.org/abs/1805.04833)
☆11May 6, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
NeverMoreLCH / Awesome-Video-Grounding
View on GitHub
A reading list of papers about Visual Grounding.
☆31Aug 24, 2022Updated 3 years ago
MengyuanChen21 / CVPR2022-FTCL
View on GitHub
[CVPR 2022] Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization
☆45Jul 18, 2023Updated 3 years ago
baoqianyue / DFC2021-Track-MSD
View on GitHub
Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD
☆10Mar 31, 2021Updated 5 years ago
TIMMY-CHAN / MISS
View on GitHub
[ICANN 2024 (Oral)] MISS: A Generative Pre-training and Fine-tuning Approach for Med-VQA
☆12Aug 8, 2024Updated last year
Impression2805 / OpenMix
View on GitHub
PyTorch implementation of our CVPR2023 paper "OpenMix: Exploring Out-of-Distribution samples for Misclassification Detection"
☆28Oct 16, 2023Updated 2 years ago
najamnazar / designpatterndetection
View on GitHub
☆13Feb 18, 2022Updated 4 years ago
facebookresearch / corefnmn
View on GitHub
Visual Coreference Resolution in Visual Dialog using Neural Module Networks
☆58Oct 12, 2021Updated 4 years ago
yuleiniu / rva
View on GitHub
Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"
☆64Mar 24, 2023Updated 3 years ago
facebookresearch / DVDialogues
View on GitHub
Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue
☆14Oct 12, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
SijieSong / CVPR21-Cogrounding_semantic_attention
View on GitHub
☆14Jul 13, 2021Updated 5 years ago
maximek3 / e-ViL
View on GitHub
☆41Nov 23, 2022Updated 3 years ago
ImKeTT / ReSee
View on GitHub
[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation
☆12Dec 4, 2023Updated 2 years ago
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
Aman-4-Real / See-or-Guess
View on GitHub
[ACM MM 2024] See or Guess: Counterfactually Regularized Image Captioning
☆16Feb 17, 2025Updated last year
taesunwhang / MVAN-VisDial
View on GitHub
PyTorch Implementation of Multi-View Attention Networks for Visual Dialog
☆43Mar 24, 2023Updated 3 years ago
claws-lab / multimodal-robustness
View on GitHub
Code and resources for EMNLP 2022 paper on 'Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions'
☆10Mar 11, 2024Updated 2 years ago
MCG-NJU / TRACE
View on GitHub
[ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation
☆60Aug 27, 2022Updated 3 years ago
amirhakh / persian-beamer
View on GitHub
Persian template for slide with beamer and xepersian (in LaTeX)
☆14May 31, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jonmun / MM-SADA_Domain_Adaptation_Splits
View on GitHub
This repository contains the annotations used for evaluating Unsupervised Domain Adaptation on EPIC Kitchens, with individual kitchens us…
☆13Jun 2, 2020Updated 6 years ago
phellonchen / awesome-Vision-and-Language-Pre-training
View on GitHub
Recent Advances in Vision and Language Pre-training (VLP)
☆297Jun 6, 2023Updated 3 years ago
cleve / lmdb-viewer
View on GitHub
GUI to navigate over LMDB data
☆20Mar 28, 2026Updated 3 months ago
CGCL-codes / TreeCen
View on GitHub
☆13Oct 30, 2022Updated 3 years ago
Hugo101 / HyperEvidentialNN
View on GitHub
☆13Feb 12, 2024Updated 2 years ago
pseudoPixels / CloneCognition
View on GitHub
Machine Learning based Source Code Clone validation tool.
☆14May 8, 2019Updated 7 years ago
batra-mlp-lab / visdial-challenge-starter-pytorch
View on GitHub
Starter code in PyTorch for the Visual Dialog challenge
☆188Mar 24, 2023Updated 3 years ago