IsaacRodgz / ConcatBERTLinks
Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.
☆41Updated 2 years ago
Alternatives and similar repositories for ConcatBERT
Users that are interested in ConcatBERT are comparing it to the libraries listed below
Sorting:
- ☆61Updated 4 years ago
- Multimodal Meme Classification: Identifying Offensive Content in Image and Text☆71Updated 2 years ago
- An unofficial implementation of the CVPR 2020 paper Multimodal Categorization of Crisis Events in Social Media☆16Updated 3 years ago
- Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge. https://arxi…☆61Updated last year
- ☆21Updated last year
- ☆66Updated last year
- ☆93Updated 2 years ago
- It is the implementation of paper "Multi-Modal Sarcasm Detection in Twitter with Hierarchical Fusion Model"☆15Updated 2 years ago
- ☆205Updated 3 years ago
- multimodal social media content (text, image) classification☆50Updated 3 years ago
- 🥶Vilio: State-of-the-art VL models in PyTorch & PaddlePaddle☆90Updated 2 years ago
- [ACM MM 2021 Oral] Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation"☆40Updated 3 years ago
- Source code for training Gated Multimodal Units on MM-IMDb dataset☆95Updated 2 years ago
- Hate-CLIPper: Multimodal Hateful Meme Classification with Explicit Cross-modal Interaction of CLIP features - Accepted at EMNLP 2022 Work…☆52Updated 3 months ago
- Repository containing code from team Kingsterdam for the Hateful Memes Challenge☆21Updated 2 years ago
- Dataset and codes for our IJCAI 2019 paper "Adapting BERT for Target-Oriented Multimodal Sentiment Classification"☆81Updated 5 years ago
- MultiSentiNet-CIKM2017☆21Updated 7 years ago
- Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering☆30Updated last year
- Supervised Multimodal Bitransformers for Classifying Images and Text☆256Updated 4 years ago
- Reading list for multimodal sequence learning☆13Updated last year
- An implementation that downstreams pre-trained V+L models to VQA tasks. Now support: VisualBERT, LXMERT, and UNITER☆164Updated 2 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 3 years ago
- 🥉 Codalab-Microsoft-COCO-Image-Captioning-Challenge 3rd place solution(06.30.21)☆23Updated 3 years ago
- [ACL 2022] The source code of Multi-Modal Sarcasm Detection via Cross-Modal Graph Convolutional Network☆37Updated 2 years ago
- Hyperparameter analysis for Image Captioning using LSTMs and Transformers☆26Updated last year
- Multi-modal Multi-label Emotion Recognition with Heterogeneous Hierarchical Message Passing☆17Updated 2 years ago
- Text-Image Relationships (ACL 2019)☆21Updated last year
- SimVLM ---SIMPLE VISUAL LANGUAGE MODEL PRETRAINING WITH WEAK SUPERVISION☆36Updated 2 years ago
- The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"☆27Updated 4 years ago
- ☆16Updated 3 years ago