scofield7419 / UMMT-VSHLinks
Code for the ACL 2023 paper Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination
☆12Updated 2 years ago
Alternatives and similar repositories for UMMT-VSH
Users that are interested in UMMT-VSH are comparing it to the libraries listed below
Sorting:
- ☆17Updated 2 years ago
- ☆11Updated 11 months ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆10Updated 2 years ago
- ☆22Updated last year
- Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning☆40Updated last year
- ☆21Updated last year
- ☆36Updated last year
- Resource and Code for ICME 2021 paper "MNRE: A Challenge Multimodal Dataset for Neural Relation Extraction with Visual Evidence in Social…☆62Updated 3 years ago
- NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media, EMNLP 2021☆49Updated 2 months ago
- This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimoda…☆29Updated 6 months ago
- ☆24Updated 4 years ago
- Dataset and Code for Multimodal Fact Checking and Explanation Generation (Mocheg)☆58Updated last year
- ☆27Updated 3 years ago
- ☆36Updated last year
- [COLING2022] A Multi-turn Machine Reading Comprehension Framework with Rethink Mechanism for Emotion-Cause Pair Extraction☆18Updated 2 years ago
- ☆21Updated 4 years ago
- Official implementation of Towards Multi-Modal Sarcasm Detection via Hierarchical Congruity Modeling with Knowledge Enhancement.☆40Updated last year
- ☆36Updated 2 years ago
- Code for ACL 2022 main conference paper "Neural Machine Translation with Phrase-Level Universal Visual Representations".☆21Updated last year
- [NAACL 2022 Findings] Good Visual Guidance Makes A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extrac…☆113Updated 6 months ago
- ☆21Updated last year
- Official implementation of Dynamic Routing Transformer Network for Multimodal Sarcasm Detection (ACL'23)☆34Updated 2 years ago
- This code repository is for the accepted ACL2022 paper "On Vision Features in Multimodal Machine Translation". We provide the details and…☆43Updated 3 years ago
- Code for our EMNLP-2022 paper: "Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA"☆40Updated 2 years ago
- Code for ACM MM 2021 Paper "Multimodal Relation Extraction with Efficient Graph Alignment".☆101Updated 3 years ago
- This is the GPT2 baseline for ProtoQA☆12Updated 3 years ago
- ACL'2023: Multi-Task Pre-Training of Modular Prompt for Few-Shot Learning☆40Updated 2 years ago
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆98Updated 2 years ago
- ☆106Updated 3 years ago
- Danmuku dataset☆11Updated 2 years ago