Yuco-Z / Awesome-Multi-Modal-Dialog
[Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics
☆39Updated 3 weeks ago
Alternatives and similar repositories for Awesome-Multi-Modal-Dialog:
Users that are interested in Awesome-Multi-Modal-Dialog are comparing it to the libraries listed below
- Paper, dataset and code list for multimodal dialogue.☆20Updated last month
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Updated 2 years ago
- [EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation☆12Updated last year
- [Paperlist] Awesome paper list of controllable text generation via latent auto-encoders. Contributions of any kind are welcome.☆50Updated 2 years ago
- ☆101Updated 2 years ago
- This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimoda…☆26Updated 9 months ago
- Official code for our COLING 2022 paper: In-Context Learning for Empathetic Dialogue Generation☆19Updated last year
- A curated list of vision-and-language pre-training (VLP). :-)☆57Updated 2 years ago
- Code for our EMNLP-2022 paper: "Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA"☆38Updated 2 years ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆55Updated 8 months ago
- [KBS] PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable Text Generation PyTorch Implementation☆25Updated last year
- Released code for our ICLR23 paper.☆63Updated last year
- Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022☆59Updated 2 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Updated last year
- ☆73Updated 2 years ago
- Official repository for the A-OKVQA dataset☆74Updated 9 months ago
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆47Updated last year
- DSTC10 Track1 - MOD: Internet Meme Incorporated Open-domain Dialog☆50Updated 2 years ago
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆43Updated last year
- Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”☆29Updated last year
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Updated 2 years ago
- ☆22Updated 6 months ago
- Recent Advances in Visual Dialog☆30Updated 2 years ago
- ☆15Updated 2 years ago
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆18Updated 8 months ago
- Data for evaluating GPT-4V☆11Updated last year
- code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification☆27Updated 2 years ago
- ☆54Updated 10 months ago
- This is the GPT2 baseline for ProtoQA☆12Updated 3 years ago
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆33Updated 2 years ago