Paper, dataset and code list for multimodal dialogue.
☆22Jan 2, 2025Updated last year
Alternatives and similar repositories for awesome-multimodal-dialogue
Users that are interested in awesome-multimodal-dialogue are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACM MM 2022]: Multi-Modal Experience Inspired AI Creation☆21Nov 27, 2024Updated last year
- Open domain Chinese dialogue corpus and datasets.☆17Jan 8, 2022Updated 4 years ago
- PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".☆23Sep 19, 2021Updated 4 years ago
- [Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics☆37Jan 22, 2025Updated last year
- TL;DR: We propose a large-scale cross-domain persuasion dataset covers 13,000 scenarios in 35 domains, with the developed PersuGPT model …☆17Feb 12, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Danmuku dataset☆12Jul 7, 2023Updated 2 years ago
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆108Sep 25, 2022Updated 3 years ago
- PL0 Compiler 编译原理 C 语言 实现的 PL/0 编译器 flex & bison☆50Dec 26, 2019Updated 6 years ago
- 📸 Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"☆22Sep 5, 2023Updated 2 years ago
- ☆21Aug 26, 2025Updated 8 months ago
- Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021☆95Jul 8, 2021Updated 4 years ago
- Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD☆23Dec 13, 2022Updated 3 years ago
- [MMM 2025 Best Paper] RoLD: Robot Latent Diffusion for Multi-Task Policy Modeling☆24Aug 4, 2024Updated last year
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))☆50Aug 6, 2024Updated last year
- ESPER☆24Mar 29, 2024Updated 2 years ago
- Generate descriptions automatically for 3D shapes in ShapeNet via cross-modal joint embedding☆15Jan 4, 2019Updated 7 years ago
- ☆11Oct 12, 2023Updated 2 years ago
- ☆10Jul 5, 2023Updated 2 years ago
- Official repository for "MMConv: An Environment for Multimodal Conversational Search across Multiple Domains"☆34Jul 15, 2021Updated 4 years ago
- The official site of paper MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation☆204Sep 3, 2023Updated 2 years ago
- Official PyTorch Implementation for the "What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-mod…☆20Sep 26, 2024Updated last year
- Some example codes for drawing figures in research paper☆35Mar 3, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A curated list of Large Language Model with RAG☆83Nov 3, 2023Updated 2 years ago
- ☆11Oct 21, 2022Updated 3 years ago
- 最近几年三维场景表示相关工作的收集列表,重点关注深度学习相关的工作,包括Neural Radiance Field(NeRF),Signed Distance Funciton(SDF),Occupancy Field以及3D Gaussian Splatting等。不仅包…☆13Dec 16, 2023Updated 2 years ago
- ☆10Apr 5, 2025Updated last year
- https://openreview.net/forum?id=OC1o4_OI6Jw☆13May 27, 2022Updated 3 years ago
- Official implementation of “Watch Your Step: A Fine-Grained Evaluation Framework for Multi-hop Knowledge Editing in Large Language Models…☆45Nov 25, 2025Updated 5 months ago
- ☆12Sep 6, 2023Updated 2 years ago
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Jul 16, 2022Updated 3 years ago
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Aug 16, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".☆15Apr 27, 2023Updated 3 years ago
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆38Dec 19, 2024Updated last year
- ☆10Oct 7, 2024Updated last year
- Multimodal Model for Memotion Dataset☆12May 17, 2021Updated 4 years ago
- SIGIR 2021: Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals☆11Jul 30, 2021Updated 4 years ago
- A curated list of resources dedicated to face recognition.☆16Aug 4, 2018Updated 7 years ago
- Tools to count the number of public domain and free to distribute movies registered in IMDB☆24Feb 17, 2020Updated 6 years ago