vladsandulescu / hatefulmemesLinks
☆13Updated 4 years ago
Alternatives and similar repositories for hatefulmemes
Users that are interested in hatefulmemes are comparing it to the libraries listed below
Sorting:
- ☆66Updated 2 years ago
- Hate-CLIPper: Multimodal Hateful Meme Classification with Explicit Cross-modal Interaction of CLIP features - Accepted at EMNLP 2022 Work…☆55Updated 6 months ago
- Repository containing code from team Kingsterdam for the Hateful Memes Challenge☆22Updated 2 years ago
- An unofficial implementation of the CVPR 2020 paper Multimodal Categorization of Crisis Events in Social Media☆17Updated 3 years ago
- Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and i…☆41Updated 3 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 4 years ago
- ☆23Updated last year
- ☆16Updated 3 years ago
- ☆93Updated 2 years ago
- Facebook hateful memes challenge using multi-modal learning. More info about it here: https://ai.facebook.com/blog/hateful-memes-challeng…☆15Updated 2 years ago
- ☆44Updated 4 years ago
- 🥶Vilio: State-of-the-art VL models in PyTorch & PaddlePaddle☆90Updated 2 years ago
- Code and data for ImageCoDe, a contextual vison-and-language benchmark☆41Updated last year
- [NeurIPS'20-Competition] Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Meme…☆62Updated last year
- CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations☆29Updated last year
- A dataset of crowdsourced ratings for machine-generated image captions☆37Updated 6 years ago
- Python 3 support for the MS COCO caption evaluation tools☆14Updated last year
- CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)☆198Updated last year
- Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…☆54Updated 11 months ago
- Multimodal Meme Classification: Identifying Offensive Content in Image and Text☆71Updated 2 years ago
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆56Updated 6 months ago
- ☆63Updated 4 years ago
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆81Updated 4 months ago
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Updated 4 years ago
- ☆25Updated 3 years ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆97Updated 6 months ago
- Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.☆16Updated 5 months ago
- [EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning☆99Updated last year
- This is the official implementation of the paper "MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision…☆31Updated last year
- ☆16Updated 3 years ago