shivangi-aneja / COSMOSLinks
[AAAI 2023] COSMOS: Catching Out-of-Context Misinformation using Self Supervised Learning
☆43Updated 2 years ago
Alternatives and similar repositories for COSMOS
Users that are interested in COSMOS are comparing it to the libraries listed below
Sorting:
- NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media, EMNLP 2021☆52Updated 4 months ago
- [EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning☆99Updated last year
- [ECCV 2022] FashionViL: Fashion-Focused V+L Representation Learning☆61Updated 2 years ago
- Code for our CVPR'22 paper: Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources☆39Updated 2 years ago
- Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback presented in CVPR 2021.☆66Updated 3 years ago
- ☆20Updated last year
- Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval☆57Updated 4 years ago
- ☆34Updated 4 years ago
- Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)☆133Updated last year
- ☆106Updated 3 years ago
- ☆160Updated 3 years ago
- code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022☆266Updated last year
- [CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning☆92Updated last year
- ☆31Updated 4 years ago
- Official code release for ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity (published at ICLR 2022)☆52Updated 2 years ago
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆187Updated 5 months ago
- ☆40Updated 2 years ago
- [CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval☆38Updated 2 years ago
- ☆27Updated 4 years ago
- SIGIR paper Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback☆14Updated 3 years ago
- A dataset of debunked and verified user-generated videos.☆35Updated 6 years ago
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Updated 2 years ago
- Code and Resources for the Transformer Encoder Reasoning Network (TERN) - https://arxiv.org/abs/2004.09144☆58Updated last year
- This repository contains the dataset and source files to reproduce the results in the publication Müller-Budack et al. 2021: "Multimodal …☆24Updated 2 years ago
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆207Updated 2 years ago
- Official repository for the "VERITE: A Robust Benchmark for Multimodal Misinformation Detection Accounting for Unimodal Bias" paper.☆19Updated last year
- Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…☆40Updated last year
- [CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias☆125Updated 3 years ago
- ☆120Updated 2 years ago
- CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification - 4th Workshop on Computer Vision for Fashion, Art, and Design☆28Updated 3 years ago