SALT-NLP / Impressions
Dataset for the investigation of visual semiotics, and how specific visual features and design choices can elicit specific emotions, thoughts and beliefs.
☆10Updated last year
Alternatives and similar repositories for Impressions:
Users that are interested in Impressions are comparing it to the libraries listed below
- [ICCVW 2023] - Mapping Memes to Words for Multimodal Hateful Meme Classification☆25Updated last year
- Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation. CVPR 2023☆59Updated 3 months ago
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆144Updated 2 years ago
- NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media, EMNLP 2021☆38Updated 5 months ago
- [ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Spa…☆50Updated 7 months ago
- Official repository of "Chatting Makes Perfect: Chat-based Image Retrieval"☆28Updated 2 weeks ago
- ☆58Updated last year
- ☆19Updated 6 months ago
- ☆34Updated last year
- The official implementation of the paper "DIP: Dual Incongruity Perceiving Network for Sarcasm Detection"☆30Updated 2 months ago
- LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation☆127Updated last year
- [CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval☆38Updated last year
- SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation☆102Updated last year
- ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities☆37Updated 5 months ago
- Hate-CLIPper: Multimodal Hateful Meme Classification with Explicit Cross-modal Interaction of CLIP features - Accepted at EMNLP 2022 Work…☆47Updated 2 years ago
- CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)☆190Updated last year
- ☆62Updated last year
- Official repository for the A-OKVQA dataset☆75Updated 9 months ago
- NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)☆47Updated last year
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆26Updated last year
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆71Updated last year
- ☆12Updated last year
- implementation of paper https://arxiv.org/abs/2210.04559☆53Updated 2 years ago
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆172Updated last year
- Code and data for "Learning Program Representations for Food Images and Cooking Recipes" (oral at CVPR 2022)☆15Updated 2 years ago
- NegCLIP.☆30Updated 2 years ago
- An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.☆120Updated last month
- CLIPScore EMNLP code☆210Updated 2 years ago
- 🎁 A Large-scale Multi-modal E-Commerce Products Dataset (LTDL@IJCAI-21 Best Dataset & Pattern Recognition 2023)☆26Updated last year
- [NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering☆185Updated last year