ImKeTT / Awesome-Multi-Modal-Dialog
[Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics
☆38Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Multi-Modal-Dialog
- Paper, dataset and code list for multimodal dialogue.☆19Updated 3 months ago
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Updated 2 years ago
- [Paperlist] Awesome paper list of controllable text generation via latent auto-encoders. Contributions of any kind are welcome.☆49Updated last year
- [EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation☆12Updated 11 months ago
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Updated last year
- code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification☆27Updated 2 years ago
- Code for our EMNLP-2022 paper: "Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA"☆35Updated 2 years ago
- This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimoda…☆25Updated 6 months ago
- A curated list of vision-and-language pre-training (VLP). :-)☆56Updated 2 years ago
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆61Updated 2 years ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆53Updated 5 months ago
- Source code and data for Things not Written in Text: Exploring Spatial Commonsense from Visual Signals (ACL2022 main conference paper).☆20Updated 2 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Updated last year
- Implementation of "Visualize Before You Write: Imagination-Guided Open-Ended Text Generation".☆17Updated last year
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆97Updated last year
- Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”☆29Updated last year
- Code for ACL 2022 main conference paper "Neural Machine Translation with Phrase-Level Universal Visual Representations".☆21Updated last year
- [CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias☆116Updated 2 years ago
- Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022☆58Updated 2 years ago
- Data for evaluating GPT-4V☆11Updated last year
- ☆63Updated 5 years ago
- On the Effectiveness of Parameter-Efficient Fine-Tuning☆38Updated last year
- ☆101Updated 2 years ago
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆16Updated 5 months ago
- ☆32Updated last year
- ☆22Updated 3 months ago
- [NAACL 2022] Contrastive Learning for Prompt-based Few-shot Language Learners☆22Updated last year
- my commonly-used tools☆47Updated 3 months ago
- DSTC10 Track1 - MOD: Internet Meme Incorporated Open-domain Dialog☆49Updated last year
- This repository includes the official implementation of our paper "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness …☆19Updated last year