IsaacRodgz / ConcatBERTLinks
Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.
☆41Updated 3 years ago
Alternatives and similar repositories for ConcatBERT
Users that are interested in ConcatBERT are comparing it to the libraries listed below
Sorting:
- ☆63Updated 4 years ago
- Multimodal Meme Classification: Identifying Offensive Content in Image and Text☆71Updated 2 years ago
- ☆23Updated last year
- An unofficial implementation of the CVPR 2020 paper Multimodal Categorization of Crisis Events in Social Media☆17Updated 3 years ago
- Reading list for multimodal sequence learning☆14Updated 2 years ago
- Repository containing code from team Kingsterdam for the Hateful Memes Challenge☆22Updated 3 years ago
- [NeurIPS'20-Competition] Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Meme…☆62Updated last year
- [ACM MM 2021 Oral] Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation"☆40Updated 4 years ago
- It is the implementation of paper "Multi-Modal Sarcasm Detection in Twitter with Hierarchical Fusion Model"☆15Updated 2 years ago
- ☆93Updated 2 years ago
- ☆212Updated 3 years ago
- ☆13Updated 4 years ago
- ☆66Updated 2 years ago
- Hate-CLIPper: Multimodal Hateful Meme Classification with Explicit Cross-modal Interaction of CLIP features - Accepted at EMNLP 2022 Work…☆55Updated 6 months ago
- 🥶Vilio: State-of-the-art VL models in PyTorch & PaddlePaddle☆90Updated 2 years ago
- multimodal social media content (text, image) classification☆50Updated 3 years ago
- Multi-modal Multi-label Emotion Recognition with Heterogeneous Hierarchical Message Passing☆18Updated 3 years ago
- Dataset and codes for our IJCAI 2019 paper "Adapting BERT for Target-Oriented Multimodal Sentiment Classification"☆84Updated 5 years ago
- Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering☆31Updated last year
- An implementation that downstreams pre-trained V+L models to VQA tasks. Now support: VisualBERT, LXMERT, and UNITER☆165Updated 2 years ago
- ☆11Updated 3 years ago
- ☆22Updated 3 years ago
- Multi-model analysis of sentiment and emotion in multi-speaker conversations.☆27Updated 2 years ago
- Source code for training Gated Multimodal Units on MM-IMDb dataset☆98Updated 2 years ago
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆55Updated 3 years ago
- ☆16Updated 3 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 4 years ago
- SimVLM ---SIMPLE VISUAL LANGUAGE MODEL PRETRAINING WITH WEAK SUPERVISION☆36Updated 2 years ago
- Multimodal Sentiment Detection Based on Multi-channel Graph Neural Networks☆39Updated 3 years ago
- 🥉 Codalab-Microsoft-COCO-Image-Captioning-Challenge 3rd place solution(06.30.21)☆23Updated 3 years ago