shivangi-aneja / COSMOS
[AAAI 2023] COSMOS: Catching Out-of-Context Misinformation using Self Supervised Learning
☆40Updated last year
Related projects: ⓘ
- NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media, EMNLP 2021☆30Updated last week
- Code for our CVPR'22 paper: Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources☆31Updated last year
- This repository contains the dataset and source files to reproduce the results in the publication Müller-Budack et al. 2021: "Multimodal …☆24Updated last year
- ☆17Updated last month
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆24Updated 9 months ago
- A curated list of works related to Misinformation Video Detection, as a companion material for an MM 2023 survey☆75Updated last month
- SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection☆17Updated last month
- [EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning☆82Updated 2 months ago
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21Updated last year
- ☆29Updated 3 years ago
- ☆30Updated 11 months ago
- ☆40Updated last year
- NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)☆44Updated 7 months ago
- Code and results accompanying our paper titled CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets☆53Updated last year
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆32Updated last year
- [CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning☆89Updated 5 months ago
- Code and data for "Learning Program Representations for Food Images and Cooking Recipes" (oral at CVPR 2022)☆15Updated 2 years ago
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆57Updated 3 years ago
- ☆97Updated 2 years ago
- ☆22Updated 2 years ago
- Source code for EMNLP 2022 paper “PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models”☆46Updated last year
- Official repository for the "VERITE: A Robust Benchmark for Multimodal Misinformation Detection Accounting for Unimodal Bias" paper.☆11Updated 8 months ago
- ☆24Updated 2 years ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆112Updated 2 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13Updated last year
- Repo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering☆24Updated 2 months ago
- ☆24Updated 2 years ago
- Learning to compose soft prompts for compositional zero-shot learning.☆80Updated last year
- Dataset and Code for Multimodal Fact Checking and Explanation Generation (Mocheg)☆35Updated 9 months ago
- Video captioning on MSR-VTT Dataset☆12Updated 3 years ago