victorsungo / MMDialogLinks
The official site of paper MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation
☆202Updated 2 years ago
Alternatives and similar repositories for MMDialog
Users that are interested in MMDialog are comparing it to the libraries listed below
Sorting:
- Paper, dataset and code list for multimodal dialogue.☆22Updated 10 months ago
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆108Updated 3 years ago
- Language Models Can See: Plugging Visual Controls in Text Generation☆259Updated 3 years ago
- Code for "Small Models are Valuable Plug-ins for Large Language Models"☆131Updated 2 years ago
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆133Updated 2 years ago
- Code, Models and Datasets for OpenViDial Dataset☆132Updated 3 years ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆106Updated last year
- ☆70Updated 5 months ago
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆47Updated 2 years ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆473Updated last year
- Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training (ACL 2023))☆92Updated 2 years ago
- Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)☆167Updated last year
- [ACM MM 2022]: Multi-Modal Experience Inspired AI Creation☆21Updated 11 months ago
- Released code for our ICLR23 paper.☆66Updated 2 years ago
- [COLING22] An End-to-End Library for Evaluating Natural Language Generation☆93Updated last year
- PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)☆374Updated 2 years ago
- DSTC10 Track1 - MOD: Internet Meme Incorporated Open-domain Dialog☆51Updated 2 years ago
- A paper list about diffusion models for natural language processing.☆182Updated 2 years ago
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Updated 3 years ago
- contrastive decoding☆205Updated 3 years ago
- ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Model…☆272Updated 3 years ago
- Official repository of the AAAI'2022 paper "GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning…☆108Updated 3 years ago
- ☆146Updated 3 years ago
- This respository contains the code for extracting the test samples we used in our paper: "A Multitask, Multilingual, Multimodal Evaluatio…☆80Updated last year
- [Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics☆37Updated 9 months ago
- This code repository is for the accepted ACL2022 paper "On Vision Features in Multimodal Machine Translation". We provide the details and…☆43Updated 3 years ago
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆357Updated last year
- Code for ACL 2022 main conference paper "Neural Machine Translation with Phrase-Level Universal Visual Representations".☆21Updated 2 years ago
- Official Implementation for the ICML2022 paper "Directed Acyclic Transformer for Non-Autoregressive Machine Translation"☆132Updated 2 years ago
- Attaching human-like eyes to the large language model. The codes of IEEE TMM paper "LMEye: An Interactive Perception Network for Large La…☆48Updated last year