Aman-4-Real/awesome-multimodal-dialogue

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Aman-4-Real/awesome-multimodal-dialogue)

Aman-4-Real / awesome-multimodal-dialogue

Paper, dataset and code list for multimodal dialogue.

☆22

Alternatives and similar repositories for awesome-multimodal-dialogue

Users that are interested in awesome-multimodal-dialogue are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Aman-4-Real / MMTG
View on GitHub
[ACM MM 2022] (Oral): Multi-Modal Experience Inspired AI Creation
☆21Nov 27, 2024Updated last year
Aman-4-Real / OpenDomainDialogCorpus
View on GitHub
Open domain Chinese dialogue corpus and datasets.
☆17Jan 8, 2022Updated 4 years ago
Aman-4-Real / CrEval
View on GitHub
[ICLR 2026] Evaluating Text Creativity across Diverse Domains: A Dataset and a Large Language Model Evaluator
☆18Feb 28, 2026Updated 5 months ago
jokieleung / Maria
View on GitHub
PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".
☆23Sep 19, 2021Updated 4 years ago
Yuco-Z / Awesome-Multi-Modal-Dialog
View on GitHub
[Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics
☆36Jan 22, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jaromirsalamon / Awesome-Dialogue-System-Papers
View on GitHub
💬A curated list of incredible amount of publications related to Dialogue Systems especially Chatbots and Chit-chat Systems
☆10Dec 5, 2019Updated 6 years ago
Georgelingzj / up-to-date-Vision-Language-Models
View on GitHub
Up-to-date Vision Language Models collection. Mainly focus on computer vision
☆20Feb 9, 2023Updated 3 years ago
AIM3-RUC / VideoIC
View on GitHub
Danmuku dataset
☆12Jul 7, 2023Updated 3 years ago
silverriver / MMChat
View on GitHub
[LREC] MMChat: Multi-Modal Chat Dataset on Social Media
☆110Sep 25, 2022Updated 3 years ago
Aman-4-Real / PL0_Compiler
View on GitHub
PL0 Compiler 编译原理 C 语言实现的 PL/0 编译器 flex & bison
☆50Dec 26, 2019Updated 6 years ago
ahnjaewoo / MPCHAT
View on GitHub
📸 Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"
☆22Sep 5, 2023Updated 2 years ago
guxu313 / TeViS
View on GitHub
☆21Aug 26, 2025Updated 11 months ago
HAWLYQ / Qc-TextCap
View on GitHub
☆16Dec 25, 2021Updated 4 years ago
hanjanghoon / BERT_FP
View on GitHub
Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021
☆94Jul 8, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
andy-yangz / Awesome-RLHF
View on GitHub
Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD
☆23Dec 13, 2022Updated 3 years ago
AlbertTan404 / RoLD
View on GitHub
[MMM 2025 Best Paper] RoLD: Robot Latent Diffusion for Multi-Task Policy Modeling
☆24Aug 4, 2024Updated last year
zepingyu0512 / arithmetic-mechanism
View on GitHub
code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis
☆12Nov 17, 2024Updated last year
salesforce / MPT
View on GitHub
☆16Jun 12, 2023Updated 3 years ago
Haoqiu-Yan / PerceptiveAgent
View on GitHub
Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))
☆51Aug 6, 2024Updated last year
RUC-AIMind / TikTalk
View on GitHub
☆70Jun 1, 2025Updated last year
uds-lsv / MCSE
View on GitHub
NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings
☆58Jun 10, 2024Updated 2 years ago
Haskely / gsm8k-rft-llama7b-u13b_evaluation
View on GitHub
测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数
☆15Aug 10, 2023Updated 2 years ago
pkchat-focus / FoCus
View on GitHub
Source codes and dataset of Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge
☆61Aug 4, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
JiwanChung / esper
View on GitHub
ESPER
☆24Mar 29, 2024Updated 2 years ago
daveredrum / 3d-captioning
View on GitHub
Generate descriptions automatically for 3D shapes in ShapeNet via cross-modal joint embedding
☆15Jan 4, 2019Updated 7 years ago
kondoumh / sb2md
View on GitHub
CLI to convert Scrapbox page to Markdown
☆12Jun 27, 2026Updated last month
liziliao / MMConv
View on GitHub
Official repository for "MMConv: An Environment for Multimodal Conversational Search across Multiple Domains"
☆34Jul 15, 2021Updated 5 years ago
DaoD / KPN
View on GitHub
SIGIR 2021: Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals
☆11Jul 30, 2021Updated 4 years ago
zshyang / amg
View on GitHub
☆12Sep 15, 2024Updated last year
NUSTM / CCAC-ABSA
View on GitHub
☆10Jul 5, 2023Updated 3 years ago
IVY-LVLM / Counterfactual-Inception
View on GitHub
Official PyTorch Implementation for the "What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-mod…
☆20Sep 26, 2024Updated last year
gmftbyGMFTBY / Rep-Dropout
View on GitHub
[NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
☆41Oct 17, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
DaoD / ResearchFigure
View on GitHub
Some example codes for drawing figures in research paper
☆36Mar 3, 2022Updated 4 years ago
huayong / 3d-scene-reps-works
View on GitHub
最近几年三维场景表示相关工作的收集列表，重点关注深度学习相关的工作，包括Neural Radiance Field(NeRF)，Signed Distance Funciton(SDF)，Occupancy Field以及3D Gaussian Splatting等。不仅包…
☆13Dec 16, 2023Updated 2 years ago
datamllab / Mitigating_Gender_Bias_In_Captioning_System
View on GitHub
under review
☆14Mar 1, 2021Updated 5 years ago
nuochenpku / COMEDY
View on GitHub
This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…
☆25Nov 18, 2024Updated last year
x66ccff / liveideabench
View on GitHub
[𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬] 🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal C…
☆30Apr 21, 2026Updated 3 months ago
ImKeTT / ReSee
View on GitHub
[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation
☆12Dec 4, 2023Updated 2 years ago
terenceylchow124 / Meme-MultiModal
View on GitHub
Multimodal Model for Memotion Dataset
☆12May 17, 2021Updated 5 years ago