MengyuanChen21/Awesome-Visual-Dialog

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MengyuanChen21/Awesome-Visual-Dialog)

MengyuanChen21 / Awesome-Visual-Dialog

A curated publication list on visual dialog

☆14

Alternatives and similar repositories for Awesome-Visual-Dialog

Users that are interested in Awesome-Visual-Dialog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MengyuanChen21 / CVPR2023-OWTAL
View on GitHub
[CVPR 2023] Cascade Evidential Learning for Open-world Weakly-supervised Temporal Action Localization
☆12Jul 9, 2024Updated 2 years ago
MengyuanChen21 / ECCV2022-DELU
View on GitHub
[ECCV 2022] Dual-Evidential Learning for Weakly-supervised Temporal Action Localization
☆49Apr 19, 2024Updated 2 years ago
MengyuanChen21 / CVPR2022-FTCL
View on GitHub
[CVPR 2022] Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization
☆45Jul 18, 2023Updated 3 years ago
MengyuanChen21 / ICLR2024-REDL
View on GitHub
[ICLR 2024 Spotlight] R-EDL: Relaxing Nonessential Settings of Evidential Deep Learning
☆139Nov 18, 2024Updated last year
MengyuanChen21 / CVPR2023-CMPAE
View on GitHub
[CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception
☆37Jun 17, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gicheonkang / gst-visdial
View on GitHub
Official PyTorch Implementation for CVPR'23 Paper, "The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training"
☆20Dec 11, 2023Updated 2 years ago
phellonchen / awesome-visual-dialog
View on GitHub
Recent Advances in Visual Dialog
☆28Aug 19, 2022Updated 3 years ago
koalazf99 / tacube
View on GitHub
[EMNLP 2022] TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data
☆17May 17, 2023Updated 3 years ago
liy1shu / FlowBotHD
View on GitHub
FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation
☆13Dec 13, 2024Updated last year
MrZihan / Sim2Real-VLN-3DFF
View on GitHub
Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).
☆80Dec 26, 2025Updated 7 months ago
ZihaoW123 / UniMM
View on GitHub
Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"
☆13May 12, 2023Updated 3 years ago
HKUST-KnowComp / VD-PCR
View on GitHub
Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"
☆10Nov 1, 2022Updated 3 years ago
MengyuanChen21 / NeurIPS2024-CSP
View on GitHub
[NeurIPS 2024] Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models
☆40Oct 17, 2024Updated last year
HITsz-TMG / Cognitive-Visual-Language-Mapper
View on GitHub
The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehens…
☆17Jan 24, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Suhan-Ling / Coarse-to-fine_Affordance
View on GitHub
☆16Oct 10, 2024Updated last year
ItemZheng / KDDAug
View on GitHub
[ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering
☆13Nov 23, 2022Updated 3 years ago
ImKeTT / ReSee
View on GitHub
[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation
☆12Dec 4, 2023Updated 2 years ago
Aman-4-Real / See-or-Guess
View on GitHub
[ACM MM 2024] See or Guess: Counterfactually Regularized Image Captioning
☆16Feb 17, 2025Updated last year
gicheonkang / sglkt-visdial
View on GitHub
🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"
☆13Feb 1, 2023Updated 3 years ago
jonmun / MM-SADA_Domain_Adaptation_Splits
View on GitHub
This repository contains the annotations used for evaluating Unsupervised Domain Adaptation on EPIC Kitchens, with individual kitchens us…
☆13Jun 2, 2020Updated 6 years ago
Hugo101 / HyperEvidentialNN
View on GitHub
☆13Feb 12, 2024Updated 2 years ago
megvii-research / US3L-CVPR2023
View on GitHub
PyTorch implementation of US3L (Accepted to CVPR2023)
☆33Mar 15, 2023Updated 3 years ago
lisa-wm / entropybaseduq
View on GitHub
☆12Apr 4, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Delicate2000 / MMpedia
View on GitHub
☆15Jul 20, 2023Updated 3 years ago
Peiyannn / MM-PDE
View on GitHub
[ICLR24] Better Neural PDE Solvers Through Data-Free Mesh Movers
☆17Mar 20, 2024Updated 2 years ago
alexsax / robust-policies-via-midlevel-vision
View on GitHub
☆17Nov 16, 2020Updated 5 years ago
fyyCS / LSLD
View on GitHub
☆14Nov 13, 2023Updated 2 years ago
CrawlScript / RpHGNN
View on GitHub
Source code and dataset of the paper "Efficient Heterogeneous Graph Learning via Random Projection"
☆97Aug 26, 2024Updated last year
hrugved06 / Olympics-Medal-Prediction
View on GitHub
The goal of this project is to make a prediction model which will predict whether an athlete will win a medal or not.
☆10Sep 17, 2021Updated 4 years ago
WYuan1001 / AdaVD
View on GitHub
[CVPR2025] Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters
☆44Mar 11, 2025Updated last year
Panda-Shawn / VLA-OS-Language-Planning-Labeling
View on GitHub
Language planning labeling for VLA-OS
☆16Jun 25, 2025Updated last year
TianZhuAI4S / DiffAffinity
View on GitHub
Predicting mutational effects on protein-protein binding via a side-chain diffusion probabilistic model (NeurIPS 2023 Poster)
☆37Dec 11, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SqrtiZhang / openreview_ICRL2024_analysis
View on GitHub
☆10Nov 28, 2023Updated 2 years ago
chenyaofo / image-classification-codebase
View on GitHub
Image Classification Codebase with PyTorch
☆15Sep 10, 2025Updated 10 months ago
mingyuliutw / JointGeodesicUpsampling
View on GitHub
Joint geodesic upsampling
☆12Jan 16, 2018Updated 8 years ago
StanfordVL / HMS
View on GitHub
The repository of the code base of "Multi-Layer Semantic and Geometric Modeling with Neural Message Passing in 3D Scene Graphs for Hierar…
☆25Mar 13, 2021Updated 5 years ago
mvrl / ConText-CIR
View on GitHub
[CVPR'25] ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval
☆16Jun 17, 2026Updated last month
Kai-46 / DepthSensing
View on GitHub
project website for "depth sensing beyond LiDAR range"
☆11Jul 28, 2020Updated 5 years ago
YasminZhang / EBAMA
View on GitHub
[ECCV 2024] Official repository of ECCV 2024 paper: Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion M…
☆16May 24, 2025Updated last year