vladsandulescu / hatefulmemes
β12Updated 3 years ago
Alternatives and similar repositories for hatefulmemes:
Users that are interested in hatefulmemes are comparing it to the libraries listed below
- Repository containing code from team Kingsterdam for the Hateful Memes Challengeβ20Updated 2 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.β34Updated 3 years ago
- π₯ΆVilio: State-of-the-art VL models in PyTorch & PaddlePaddleβ88Updated last year
- β44Updated 3 years ago
- Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and iβ¦β40Updated 2 years ago
- β92Updated 2 years ago
- Hate-CLIPper: Multimodal Hateful Meme Classification with Explicit Cross-modal Interaction of CLIP features - Accepted at EMNLP 2022 Workβ¦β52Updated last week
- β62Updated last year
- An unofficial implementation of the CVPR 2020 paper Multimodal Categorization of Crisis Events in Social Mediaβ16Updated 3 years ago
- Resources (conference/journal publications, references to dataset) for harmful memes detection.β47Updated 2 years ago
- β39Updated last year
- Code and data for ImageCoDe, a contextual vison-and-language benchmarkβ39Updated last year
- β16Updated 3 years ago
- Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge. https://arxiβ¦β60Updated last year
- CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanationsβ27Updated last year
- β21Updated last year
- β26Updated 3 years ago
- This is the official implementation of the paper "MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Visionβ¦β28Updated last year
- Python 3 support for the MS COCO caption evaluation toolsβ14Updated 10 months ago
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"β21Updated last year
- β64Updated 3 years ago
- NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)β48Updated last year
- Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backboneβ129Updated last year
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)β88Updated 3 weeks ago
- Official Github Repo for the Findings of EMNLP 2021 paper "An animated picture says at least a thousand words: Selecting Gif-based Replieβ¦β32Updated 3 years ago
- Open source code for AAAI 2023 Paper "BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning"β162Updated last year
- CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)β195Updated last year
- π A Large-scale Multi-modal E-Commerce Products Dataset (LTDL@IJCAI-21 Best Dataset & Pattern Recognition 2023)β29Updated last year
- An Image/Text Retrieval Test Collection to Support Multimedia Content Creationβ20Updated last year
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.β84Updated last year