shivangi-aneja / COSMOS
[AAAI 2023] COSMOS: Catching Out-of-Context Misinformation using Self Supervised Learning
☆41Updated last year
Related projects ⓘ
Alternatives and complementary repositories for COSMOS
- NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media, EMNLP 2021☆34Updated 2 months ago
- Code for our CVPR'22 paper: Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources☆32Updated 2 years ago
- ☆18Updated 3 months ago
- [EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning☆87Updated 4 months ago
- MixGen: A New Multi-Modal Data Augmentation☆116Updated last year
- A curated list of works related to Misinformation Video Detection, as a companion material for an ACM Multimedia 2023 survey☆81Updated 3 months ago
- ☆19Updated 7 months ago
- This repository contains the dataset and source files to reproduce the results in the publication Müller-Budack et al. 2021: "Multimodal …☆24Updated last year
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21Updated last year
- An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)☆84Updated 2 years ago
- SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection☆32Updated 3 months ago
- [ACM MM 2021 Oral] Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation"☆39Updated 3 years ago
- Source code for EMNLP 2022 paper “PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models”☆47Updated 2 years ago
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆32Updated last year
- NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)☆44Updated 9 months ago
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆25Updated 11 months ago
- ☆40Updated last year
- MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs☆14Updated 3 months ago
- Official repository for the "VERITE: A Robust Benchmark for Multimodal Misinformation Detection Accounting for Unimodal Bias" paper.☆12Updated 10 months ago
- A reading list of papers about Visual Question Answering.☆32Updated 2 years ago
- ☆24Updated 3 years ago
- [ICML 2022] This is the pytorch implementation of "Rethinking Attention-Model Explainability through Faithfulness Violation Test" (https:…☆18Updated 2 years ago
- [CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias☆116Updated 2 years ago
- [ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos☆117Updated last year
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Updated last year
- SimVLM ---SIMPLE VISUAL LANGUAGE MODEL PRETRAINING WITH WEAK SUPERVISION☆35Updated 2 years ago
- [ICCV 2023] Prompt-aligned Gradient for Prompt Tuning☆151Updated last year
- A collections of papers about VQA-CP datasets and their results☆38Updated 2 years ago
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆202Updated last year
- Official implementation of "Everything at Once - Multi-modal Fusion Transformer for Video Retrieval". CVPR 2022☆95Updated 2 years ago