IsaacRodgz / ConcatBERTLinks
Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.
☆42Updated 3 years ago
Alternatives and similar repositories for ConcatBERT
Users that are interested in ConcatBERT are comparing it to the libraries listed below
Sorting:
- ☆64Updated 4 years ago
- [NeurIPS'20-Competition] Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Meme…☆61Updated last year
- An unofficial implementation of the CVPR 2020 paper Multimodal Categorization of Crisis Events in Social Media☆17Updated 4 years ago
- Multimodal Meme Classification: Identifying Offensive Content in Image and Text☆71Updated 3 years ago
- ☆22Updated last year
- Repository containing code from team Kingsterdam for the Hateful Memes Challenge☆22Updated 3 years ago
- ☆67Updated 2 years ago
- ☆212Updated 4 years ago
- ☆93Updated 3 years ago
- Hate-CLIPper: Multimodal Hateful Meme Classification with Explicit Cross-modal Interaction of CLIP features - Accepted at EMNLP 2022 Work…☆56Updated 8 months ago
- 🥶Vilio: State-of-the-art VL models in PyTorch & PaddlePaddle☆90Updated 2 years ago
- [ACM MM 2021 Oral] Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation"☆40Updated 4 years ago
- ☆13Updated 4 years ago
- It is the implementation of paper "Multi-Modal Sarcasm Detection in Twitter with Hierarchical Fusion Model"☆16Updated 2 years ago
- multimodal social media content (text, image) classification☆50Updated 3 years ago
- An implementation that downstreams pre-trained V+L models to VQA tasks. Now support: VisualBERT, LXMERT, and UNITER☆165Updated 3 years ago
- Source code for training Gated Multimodal Units on MM-IMDb dataset☆100Updated 2 years ago
- Reading list for multimodal sequence learning☆14Updated 2 years ago
- Dataset and codes for our IJCAI 2019 paper "Adapting BERT for Target-Oriented Multimodal Sentiment Classification"☆85Updated 5 years ago
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21Updated 2 years ago
- Supervised Multimodal Bitransformers for Classifying Images and Text☆256Updated 4 years ago
- Code and Resources for the Transformer Encoder Reasoning Network (TERN) - https://arxiv.org/abs/2004.09144☆58Updated 2 years ago
- ☆11Updated 4 years ago
- 🥉 Codalab-Microsoft-COCO-Image-Captioning-Challenge 3rd place solution(06.30.21)☆23Updated 3 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 4 years ago
- SimVLM ---SIMPLE VISUAL LANGUAGE MODEL PRETRAINING WITH WEAK SUPERVISION☆36Updated 3 years ago
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆82Updated 6 months ago
- Multi-modal Multi-label Emotion Recognition with Heterogeneous Hierarchical Message Passing☆18Updated 3 years ago
- Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.☆73Updated 4 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Updated 5 years ago