claws-lab / multimodal-robustnessLinks
Code and resources for EMNLP 2022 paper on 'Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions'
☆10Updated last year
Alternatives and similar repositories for multimodal-robustness
Users that are interested in multimodal-robustness are comparing it to the libraries listed below
Sorting:
- ☆16Updated 4 years ago
- ☆40Updated 3 years ago
- ☆10Updated 4 years ago
- ☆22Updated 4 years ago
- The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021…☆17Updated 3 years ago
- ☆12Updated 4 years ago
- Code for the ICCV'21 paper "Context-aware Scene Graph Generation with Seq2Seq Transformers"☆43Updated 3 years ago
- GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering☆65Updated 4 years ago
- ☆30Updated 2 years ago
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆34Updated 2 years ago
- DeVLBert: Learning Deconfounded Visio-Linguistic Representations☆27Updated 3 years ago
- [Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph☆72Updated last year
- ☆107Updated 3 years ago
- ☆25Updated 3 years ago
- ☆26Updated 4 years ago
- This is the repository for papr "One-Shot Scene Graph Generation"☆16Updated 4 years ago
- This is a code repository of Graphhopper: Multi-Hop Scene GraphReasoning for Visual Question Answering☆19Updated 4 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13Updated 2 years ago
- ☆14Updated 4 years ago
- ☆79Updated 3 years ago
- Video Graph Transformer for Video Question Answering (ECCV'22)☆49Updated 2 years ago
- A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)☆43Updated 3 years ago
- Learning Situation Hyper-Graphs for Video Question Answering☆22Updated last year
- ☆16Updated 2 years ago
- CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment☆22Updated 3 years ago
- Bridging Knowledge Graphs to Generate Scene Graphs, ECCV 2020☆70Updated last year
- Implementation for the paper "Dynamic Language Binding in Relational Visual Reasoning" (Le et al., IJCAI 2020)☆13Updated last year
- [ACL 2021] Learning Relation Alignment for Calibrated Cross-modal Retrieval☆33Updated 2 years ago
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Updated 2 years ago
- ☆29Updated 2 years ago