scofield7419 / UMMT-VSHLinks
Code for the ACL 2023 paper Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination
☆12Updated 2 years ago
Alternatives and similar repositories for UMMT-VSH
Users that are interested in UMMT-VSH are comparing it to the libraries listed below
Sorting:
- ☆12Updated last year
- ☆17Updated 2 years ago
- ☆22Updated last year
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11Updated 2 years ago
- ☆22Updated last year
- ☆27Updated 3 years ago
- Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning☆45Updated last year
- ☆25Updated 4 years ago
- Code for our EMNLP-2022 paper: "Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA"☆40Updated 3 years ago
- ☆37Updated last year
- Code for ACL 2022 main conference paper "Neural Machine Translation with Phrase-Level Universal Visual Representations".☆21Updated 2 years ago
- ☆37Updated 2 years ago
- Paper, dataset and code list for multimodal dialogue.☆22Updated last year
- About Codes for ACL 2023 paper: Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling.☆20Updated last year
- Resource and Code for ICME 2021 paper "MNRE: A Challenge Multimodal Dataset for Neural Relation Extraction with Visual Evidence in Social…☆70Updated 4 years ago
- This is the GPT2 baseline for ProtoQA☆12Updated 4 years ago
- ☆25Updated 2 years ago
- Dataset and Code for Multimodal Fact Checking and Explanation Generation (Mocheg)☆61Updated 2 years ago
- Code for ACM MM 2021 Paper "Multimodal Relation Extraction with Efficient Graph Alignment".☆108Updated 3 years ago
- ☆109Updated 3 years ago
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆56Updated 3 years ago
- Official implementation of Dynamic Routing Transformer Network for Multimodal Sarcasm Detection (ACL'23)☆35Updated 2 years ago
- KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation☆31Updated 4 years ago
- Text-Image Relationships (ACL 2019)☆21Updated 2 years ago
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆100Updated 2 years ago
- ☆38Updated 2 years ago
- ☆10Updated last year
- Recent Advances in Visual Dialog☆30Updated 3 years ago
- ☆42Updated 2 years ago
- MSTI☆16Updated last year