vladsandulescu / hatefulmemes
☆11Updated 3 years ago
Related projects: ⓘ
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 3 years ago
- Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and i…☆35Updated 2 years ago
- Repository containing code from team Kingsterdam for the Hateful Memes Challenge☆19Updated last year
- ☆57Updated last year
- Hate-CLIPper: Multimodal Hateful Meme Classification with Explicit Cross-modal Interaction of CLIP features - Accepted at EMNLP 2022 Work…☆40Updated last year
- ☆44Updated 3 years ago
- ☆90Updated last year
- ☆16Updated 2 years ago
- 🥶Vilio: State-of-the-art VL models in PyTorch & PaddlePaddle☆88Updated last year
- Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge. https://arxi…☆53Updated 7 months ago
- Code and data for ImageCoDe, a contextual vison-and-language benchmark☆39Updated 6 months ago
- Facebook hateful memes challenge using multi-modal learning. More info about it here: https://ai.facebook.com/blog/hateful-memes-challeng…☆15Updated last year
- An unofficial implementation of the CVPR 2020 paper Multimodal Categorization of Crisis Events in Social Media☆12Updated 2 years ago
- ☆19Updated 5 months ago
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆65Updated 11 months ago
- NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)☆44Updated 7 months ago
- This is the official implementation of the paper "MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision…☆19Updated 6 months ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆73Updated 11 months ago
- ☆25Updated 2 years ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆52Updated 3 months ago
- ☆56Updated 3 years ago
- [EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning☆82Updated 2 months ago
- [CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning☆89Updated 5 months ago
- ☆129Updated last year
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21Updated last year
- Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval☆55Updated 2 years ago
- ☆33Updated last year
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆130Updated last year
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆185Updated 2 years ago
- Download Web-10K data by querying Bing Image Search☆10Updated 2 years ago