Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.
☆43Aug 26, 2022Updated 3 years ago
Alternatives and similar repositories for ConcatBERT
Users that are interested in ConcatBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multimodal Model for Memotion Dataset☆12May 17, 2021Updated 5 years ago
- ☆14Jul 12, 2021Updated 4 years ago
- ☆64Jun 25, 2021Updated 4 years ago
- Repository containing code from team Kingsterdam for the Hateful Memes Challenge☆23Oct 24, 2022Updated 3 years ago
- [NeurIPS'20-Competition] Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Meme…☆61Feb 12, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code showing how to use a model based on the ML model base class.☆10Sep 30, 2022Updated 3 years ago
- ☆68Sep 7, 2023Updated 2 years ago
- ☆16Dec 25, 2021Updated 4 years ago
- ☆10Nov 15, 2021Updated 4 years ago
- RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER☆76Mar 31, 2023Updated 3 years ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform☆17Jul 2, 2020Updated 5 years ago
- Official repository for ACM Multimedia'24 paper "MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube a…☆21Aug 11, 2024Updated last year
- ☆11May 18, 2022Updated 4 years ago
- [NeurIPS 2022] Source code for our paper "Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data"☆24Oct 16, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- multimodal social media content (text, image) classification☆51Jun 22, 2022Updated 3 years ago
- Prompting For Named Entity Recognition☆19Sep 6, 2023Updated 2 years ago
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21May 8, 2023Updated 3 years ago
- Python 3 support for the MS COCO caption evaluation tools☆14Jun 14, 2024Updated last year
- Code recipe for "Multimodal One-Shot Learning of Speech and Images"☆11Nov 21, 2018Updated 7 years ago
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'☆18Jul 21, 2024Updated last year
- ☆44Aug 2, 2021Updated 4 years ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- Multimodal Meme Classification: Identifying Offensive Content in Image and Text☆72Dec 8, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Order-agnostic Identifier for Large Language Model-based Generative Recommendation (SIGIR'25)☆30Oct 21, 2025Updated 7 months ago
- CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations☆30Oct 27, 2023Updated 2 years ago
- Fusion Modality Approaches for sentiment analysis and emotion recognition task.☆12Feb 5, 2021Updated 5 years ago
- [WSDM 2025] Source code for "Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Cali…☆14Oct 14, 2025Updated 7 months ago
- This is a multi-modal fusion method based on VGG16 and FastText for identifying useful information collected from social media platforms.…☆15Mar 4, 2022Updated 4 years ago
- PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers"☆50Aug 27, 2021Updated 4 years ago
- a py3 lib for NLP & image-caption metrics : BLEU METEOR CIDEr ROUGE SPICE WMD☆14Sep 13, 2022Updated 3 years ago
- The implementation of paper "Leveraging Multimodal Features and Item-level User Feedback for Bundle Construction", WSDM'24.☆17Oct 30, 2025Updated 6 months ago
- XCon: Learning with Experts for Fine-grained Category Discovery☆19Dec 19, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Apr 5, 2022Updated 4 years ago
- CNN based on images from Kaggle's FER2013 competition, achieving 67.59% accuracy on the final test set - equivalent of the 5th place on t…☆12Jul 26, 2018Updated 7 years ago
- Classification of tamil news headlines - experimental☆13Feb 21, 2019Updated 7 years ago
- Code for paper "Aiding Intra-Text Representations with Visual Context for Multimodal Named Entity Recognition"☆16Aug 19, 2019Updated 6 years ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Apr 21, 2022Updated 4 years ago
- The implementation of the paper: Clifford Group Equivariant Simplicial Message Passing Networks @ ICLR2024☆17May 29, 2024Updated last year
- 12-in-1: Multi-Task Vision and Language Representation Learning Web Demo☆35Dec 8, 2022Updated 3 years ago