danielpreotiuc / text-image-relationshipLinks
Text-Image Relationships (ACL 2019)
☆21Updated 2 years ago
Alternatives and similar repositories for text-image-relationship
Users that are interested in text-image-relationship are comparing it to the libraries listed below
Sorting:
- KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation☆31Updated 4 years ago
- ☆37Updated 2 years ago
- ☆15Updated 4 years ago
- Resource and Code for ICME 2021 paper "MNRE: A Challenge Multimodal Dataset for Neural Relation Extraction with Visual Evidence in Social…☆65Updated 3 years ago
- ☆17Updated 2 years ago
- Cross-media Structured Common Space for Multimedia Event Extraction (ACL2020)☆76Updated 2 years ago
- RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER☆76Updated 2 years ago
- Preprocessed Datasets for our Multimodal NER paper☆122Updated 2 years ago
- A Few-Shot Learning based Approach to Multimodal Social Relation Extraction☆14Updated 2 years ago
- "Can images help recognize entities? A study of the role of images for Multimodal NER" (W-NUT at EMNLP 2021)☆21Updated 4 years ago
- Multimodal entity linking for Tweets☆29Updated 4 years ago
- This repository contains code used in our ACL'20 paper History for Visual Dialog: Do we really need it?☆34Updated 2 years ago
- [TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-La…☆114Updated 3 years ago
- Dataset and codes for our IJCAI 2019 paper "Adapting BERT for Target-Oriented Multimodal Sentiment Classification"☆86Updated 5 years ago
- ☆23Updated last year
- ☆146Updated 3 years ago
- Multi-modal Graph Fusion for Named Entity Recognition with Targeted Visual Guidance☆69Updated last year
- [NAACL 2022 Findings] Good Visual Guidance Makes A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extrac…☆117Updated 8 months ago
- Codes and data for EMNLP 2021 paper "Self- and Pseudo-self-supervised Prediction of Speaker and Key-utterance for Multi-party Dialogue Re…☆16Updated 3 years ago
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆23Updated 6 years ago
- ☆27Updated 3 years ago
- ☆162Updated 4 months ago
- Dataset and code for EMNLP 2022 "Visual Named Entity Linking: A New Dataset and A Baseline"☆26Updated 2 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆10Updated 2 years ago
- [CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias☆126Updated 3 years ago
- ☆10Updated 5 years ago
- ☆25Updated 4 years ago
- Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations☆106Updated 3 years ago
- ☆22Updated last year
- Repository for VisualSem: a high-quality knowledge graph to support research in vision and language.☆89Updated 3 years ago