ImKeTT / Awesome-Multi-Modal-Dialog

[Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics

☆38

Related projects ⓘ

Alternatives and complementary repositories for Awesome-Multi-Modal-Dialog

Aman-4-Real / awesome-multimodal-dialogue
Paper, dataset and code list for multimodal dialogue.
☆19Updated 3 months ago
MichaelZhouwang / VLUE
This repo contains codes and instructions for baselines in the VLUE benchmark.
☆41Updated 2 years ago
ImKeTT / CTG-latentAEs
[Paperlist] Awesome paper list of controllable text generation via latent auto-encoders. Contributions of any kind are welcome.
☆49Updated last year
ImKeTT / ReSee
[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation
☆12Updated 11 months ago
e-bug / iglue
[ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"
☆49Updated last year
minicheshire / Robust-Prefix-Tuning
code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification
☆27Updated 2 years ago
PhoebusSi / VQA-VS
Code for our EMNLP-2022 paper: "Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA"
☆35Updated 2 years ago
declare-lab / MM-InstructEval
This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimoda…
☆25Updated 6 months ago
zhjohnchan / awesome-vision-and-language-pretraining
A curated list of vision-and-language pre-training (VLP). :-)
☆56Updated 2 years ago
guilk / KAT
Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"
☆61Updated 2 years ago
uds-lsv / MCSE
NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings
☆53Updated 5 months ago
xxxiaol / spatial-commonsense
Source code and data for Things not Written in Text: Exploring Spatial Commonsense from Visual Signals (ACL2022 main conference paper).
☆20Updated 2 years ago
Victorwz / VaLM
VaLM: Visually-augmented Language Modeling. ICLR 2023.
☆56Updated last year
VegB / iNLG
Implementation of "Visualize Before You Write: Imagination-Guided Open-Ended Text Generation".
☆17Updated last year
AkariAsai / ATTEMPT
This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)
☆97Updated last year
kugwzk / DiDE
Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”
☆29Updated last year
ictnlp / PLUVR
Code for ACL 2022 main conference paper "Neural Machine Translation with Phrase-Level Universal Visual Representations".
☆21Updated last year
yuleiniu / cfvqa
[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias
☆116Updated 2 years ago
morningmoni / UniPELT
Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022
☆58Updated 2 years ago
albertwy / GPT-4V-Evaluation
Data for evaluating GPT-4V
☆11Updated last year
LisaAnne / Hallucination
☆63Updated 5 years ago
fuzihaofzh / AnalyzeParameterEfficientFinetune
On the Effectiveness of Parameter-Efficient Fine-Tuning
☆38Updated last year
limanling / clip-event
☆101Updated 2 years ago
edchengg / infoseek_eval
EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions
☆16Updated 5 months ago
open-vision-language / infoseek
☆32Updated last year
Wusiwei0410 / SciMMIR
☆22Updated 3 months ago
yiren-jian / LM-SupCon
[NAACL 2022] Contrastive Learning for Prompt-based Few-shot Language Learners
☆22Updated last year
RenShuhuai-Andy / my-tools
my commonly-used tools
☆47Updated 3 months ago
lizekang / DSTC10-MOD
DSTC10 Track1 - MOD: Internet Meme Incorporated Open-domain Dialog
☆49Updated last year
UCSC-VLAA / Sight-Beyond-Text
This repository includes the official implementation of our paper "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness …
☆19Updated last year