scofield7419 / UMMT-VSHLinks
Code for the ACL 2023 paper Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination
☆13Updated 2 years ago
Alternatives and similar repositories for UMMT-VSH
Users that are interested in UMMT-VSH are comparing it to the libraries listed below
Sorting:
- ☆11Updated 7 months ago
- ☆16Updated 2 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆10Updated 2 years ago
- ☆24Updated 3 years ago
- Paper, dataset and code list for multimodal dialogue.☆20Updated 5 months ago
- Code for ACL 2022 main conference paper "Neural Machine Translation with Phrase-Level Universal Visual Representations".☆21Updated last year
- ☆34Updated last year
- This is the GPT2 baseline for ProtoQA☆12Updated 3 years ago
- ☆21Updated last year
- Code for ACL 2023 paper: Exploring Better Text Image Translation with Multimodal Codebook☆20Updated 3 weeks ago
- This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimoda…☆27Updated 2 months ago
- ☆27Updated 3 years ago
- This code repository is for the accepted ACL2022 paper "On Vision Features in Multimodal Machine Translation". We provide the details and…☆43Updated 2 years ago
- ☆22Updated last year
- ☆10Updated 5 months ago
- Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning☆36Updated last year
- The code for the paper "Neutral Utterances are Also Causes: Enhancing Conversational Causal Emotion Entailment with Social Commonsense Kn…☆27Updated 3 years ago
- Dataset and Code for Multimodal Fact Checking and Explanation Generation (Mocheg)☆53Updated last year
- Implementation of our ACL2023 paper: Unifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Langua…☆19Updated last year
- ☆38Updated last year
- Code for our EMNLP-2022 paper: "Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA"☆39Updated 2 years ago
- NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media, EMNLP 2021☆48Updated 8 months ago
- ☆27Updated 3 years ago
- ☆13Updated last year
- ☆26Updated 3 years ago
- KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation☆31Updated 3 years ago
- ☆42Updated last year
- Code and Data for the ACL22 main conference paper "MSCTD: A Multimodal Sentiment Chat Translation Dataset"☆41Updated 5 months ago
- A dataset and CLIP baseline for unrepresentative news thumbnail detection (ACL 2022 workshop)☆12Updated 3 years ago
- Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"☆31Updated 3 years ago