HimariO / HatefulMemesChallengeLinks

☆93

Alternatives and similar repositories for HatefulMemesChallenge

Users that are interested in HatefulMemesChallenge are comparing it to the libraries listed below

Sorting:

Muennighoff / vilio
🥶Vilio: State-of-the-art VL models in PyTorch & PaddlePaddle
☆90Updated 2 years ago
yikuan8 / Transformers-VQA
An implementation that downstreams pre-trained V+L models to VQA tasks. Now support: VisualBERT, LXMERT, and UNITER
☆165Updated 2 years ago
e-bug / volta
[TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-La…
☆114Updated 3 years ago
airsplay / vokenization
PyTorch code for EMNLP 2020 Paper "Vokenization: Improving Language Understanding with Visual Supervision"
☆192Updated 4 years ago
uclanlp / visualbert
Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"
☆539Updated 2 years ago
ChenRocks / BUTD-UNITER-NLVR2
Support extracting BUTD features for NLVR2 images.
☆18Updated 5 years ago
salesforce / VD-BERT
☆44Updated 5 months ago
limanling / clip-event
☆106Updated 3 years ago
j-min / VL-T5
PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)
☆374Updated 2 years ago
zhegan27 / VILLA
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER…
☆119Updated 4 years ago
UKPLab / MMT-Retrieval
☆131Updated 2 years ago
Nithin-Holla / meme_challenge
Repository containing code from team Kingsterdam for the Hateful Memes Challenge
☆22Updated 3 years ago
microsoft / M3P
Multitask Multilingual Multimodal Pre-training
☆71Updated 3 years ago
alasdairtran / transform-and-tell
[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning
☆92Updated last year
zhegan27 / LXMERT-AdvTrain
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…
☆21Updated 5 years ago
LCS2-IIITD / MOMENTA
☆23Updated last year
drivendataorg / hateful-memes
☆66Updated 2 years ago
airsplay / py-bottom-up-attention
PyTorch bottom-up attention with Detectron2
☆238Updated 3 years ago
berniebear / Multi-HT100M
☆53Updated 3 years ago
rowanz / merlot
MERLOT: Multimodal Neural Script Knowledge Models
☆225Updated 3 years ago
facebookresearch / mmbt
Supervised Multimodal Bitransformers for Classifying Images and Text
☆256Updated 4 years ago
pzzhang / VinVL
project page for VinVL
☆359Updated 2 years ago
multimodal / multimodal
A collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal"
☆83Updated 3 years ago
HLR / Cross_Modality_Relevance
The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"
☆27Updated 4 years ago
NeverMoreLCH / Awesome-VQA
A reading list of papers about Visual Question Answering.
☆35Updated 3 years ago
HAWLYQ / Qc-TextCap
☆16Updated 3 years ago
mesnico / TERN
Code and Resources for the Transformer Encoder Reasoning Network (TERN) - https://arxiv.org/abs/2004.09144
☆58Updated last year
siwooyong / Codalab-Microsoft-COCO-Image-Captioning-Challenge
🥉 Codalab-Microsoft-COCO-Image-Captioning-Challenge 3rd place solution(06.30.21)
☆23Updated 3 years ago
necla-ml / SNLI-VE
Dataset and starting code for visual entailment dataset
☆118Updated 3 years ago
ShannonAI / OpenViDial
Code, Models and Datasets for OpenViDial Dataset
☆132Updated 3 years ago