ImKeTT / Awesome-Multi-Modal-Dialog
[Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics
☆37Updated last year
Related projects: ⓘ
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Updated 2 years ago
- Paper, dataset and code list for multimodal dialogue.☆18Updated last month
- ☆97Updated 2 years ago
- This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimoda…☆24Updated 4 months ago
- Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022☆58Updated 2 years ago
- Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”☆29Updated last year
- CHAIR metric is a rule-based metric for evaluating object hallucination in caption generation.☆19Updated 10 months ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆55Updated last year
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆55Updated 2 years ago
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Updated last year
- [Paperlist] Awesome paper list of controllable text generation via latent auto-encoders. Contributions of any kind are welcome.☆50Updated last year
- A curated list of vision-and-language pre-training (VLP). :-)☆56Updated 2 years ago
- A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)☆40Updated 2 years ago
- Code for our EMNLP-2022 paper: "Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA"☆35Updated last year
- Source code and data for Things not Written in Text: Exploring Spatial Commonsense from Visual Signals (ACL2022 main conference paper).☆20Updated last year
- The code for lifelong few-shot language learning☆53Updated 2 years ago
- ☆49Updated last year
- Code for our ACL-2023 paper: "Combo of Thinking and Observing for Outside-Knowledge VQA"☆11Updated last year
- [NeurIPS 2022 Workshop] A Case Study with Negated Prompts using T0 (3B, 11B), InstructGPT (350M-175B), GPT-3 (350M - 175B) & OPT (125M - …☆23Updated last year
- [CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias☆115Updated 2 years ago
- ☆26Updated last year
- ☆57Updated 5 years ago
- ☆25Updated 10 months ago
- my commonly-used tools☆46Updated last month
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆52Updated 3 months ago
- Data for evaluating GPT-4V☆11Updated 10 months ago
- [ACM MM 2022]: Multi-Modal Experience Inspired AI Creation☆18Updated 3 months ago
- DSTC10 Track1 - MOD: Internet Meme Incorporated Open-domain Dialog☆48Updated last year
- Dataset and Code for Multimodal Fact Checking and Explanation Generation (Mocheg)☆35Updated 9 months ago
- Official code for our paper "Model Composition for Multimodal Large Language Models"☆15Updated 4 months ago