Yuco-Z / Awesome-Multi-Modal-DialogLinks
[Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics
☆39Updated 5 months ago
Alternatives and similar repositories for Awesome-Multi-Modal-Dialog
Users that are interested in Awesome-Multi-Modal-Dialog are comparing it to the libraries listed below
Sorting:
- Paper, dataset and code list for multimodal dialogue.☆21Updated 5 months ago
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Updated 2 years ago
- [EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation☆12Updated last year
- Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”☆30Updated 2 years ago
- This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimoda…☆28Updated 3 months ago
- ☆104Updated 3 years ago
- Recent Advances in Visual Dialog☆30Updated 2 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Updated 2 years ago
- [Paperlist] Awesome paper list of controllable text generation via latent auto-encoders. Contributions of any kind are welcome.☆51Updated 2 years ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆55Updated last year
- ☆73Updated 6 years ago
- Code for "Small Models are Valuable Plug-ins for Large Language Models"☆129Updated 2 years ago
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆64Updated 2 years ago
- ☆22Updated 3 years ago
- [NeurIPS 2022] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings☆22Updated 2 years ago
- Code for our ACL-2023 paper: "Combo of Thinking and Observing for Outside-Knowledge VQA"☆12Updated last year
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Updated 2 years ago
- The SVO-Probes Dataset for Verb Understanding☆31Updated 3 years ago
- Code for our EMNLP-2022 paper: "Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA"☆39Updated 2 years ago
- ☆54Updated 5 months ago
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆102Updated 2 years ago
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆135Updated 2 years ago
- ☆132Updated 2 years ago
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆47Updated last year
- ☆12Updated 9 months ago
- ☆38Updated last year
- An automatic MLLM hallucination detection framework☆19Updated last year
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆34Updated 2 years ago
- DSTC10 Track1 - MOD: Internet Meme Incorporated Open-domain Dialog☆50Updated 2 years ago
- ☆29Updated 3 months ago