Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.
☆43Aug 26, 2022Updated 3 years ago
Alternatives and similar repositories for ConcatBERT
Users that are interested in ConcatBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Facebook Hatebook Memes Challenge☆12Jan 28, 2021Updated 5 years ago
- Multimodal Model for Memotion Dataset☆12May 17, 2021Updated 4 years ago
- ☆15Jul 12, 2021Updated 4 years ago
- ☆64Jun 25, 2021Updated 4 years ago
- Stable-Diffusion fine-tuned on mechas from the Gundam anime☆13Oct 21, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Repository containing code from team Kingsterdam for the Hateful Memes Challenge☆23Oct 24, 2022Updated 3 years ago
- Fine Tuning Stable Diffusion on Chinese Landscape Painting Generation(基于扩散模型的中国山水画生成)☆10Apr 10, 2023Updated 3 years ago
- Some notebooks for fine-tuning openai diffusion models on images from CLIP retrieval based on a prompt.☆12Aug 16, 2022Updated 3 years ago
- Classify image and text with ResNet and BERT models using Pytorch☆13Jul 7, 2020Updated 5 years ago
- [NeurIPS'20-Competition] Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Meme…☆61Feb 12, 2024Updated 2 years ago
- ☆68Sep 7, 2023Updated 2 years ago
- ☆16Dec 25, 2021Updated 4 years ago
- RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER☆76Mar 31, 2023Updated 3 years ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform☆17Jul 2, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repository for ACM Multimedia'24 paper "MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube a…☆21Aug 11, 2024Updated last year
- ☆11May 18, 2022Updated 3 years ago
- The source code and manually annotated datasets for our paper "Joint Multimodal Sentiment Analysis Based on Information Relevance"☆11Dec 17, 2022Updated 3 years ago
- [NeurIPS 2022] Source code for our paper "Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data"☆24Oct 16, 2023Updated 2 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Nov 29, 2021Updated 4 years ago
- multimodal social media content (text, image) classification☆51Jun 22, 2022Updated 3 years ago
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21May 8, 2023Updated 2 years ago
- Code recipe for "Multimodal One-Shot Learning of Speech and Images"☆11Nov 21, 2018Updated 7 years ago
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'☆19Jul 21, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆44Aug 2, 2021Updated 4 years ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- Generate 256x256, 512x512 resolution images with simple Convolutional GAN by adding Gaussian noise to discriminator layers.☆10Jul 11, 2021Updated 4 years ago
- CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations☆29Oct 27, 2023Updated 2 years ago
- The offical repo of "Teaching Time Series to See and Speak: Forecasting with Aligned Visual and Textual Perspectives"☆48Aug 7, 2025Updated 8 months ago
- Scripts for KGIRNet model for ESWC☆10Jul 6, 2023Updated 2 years ago
- A Hindi Image Captioning system made completely with Transformers🤗☆10Apr 16, 2024Updated last year
- [WSDM 2025] Source code for "Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Cali…☆14Oct 14, 2025Updated 6 months ago
- ☆15Dec 20, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This is a multi-modal fusion method based on VGG16 and FastText for identifying useful information collected from social media platforms.…☆15Mar 4, 2022Updated 4 years ago
- Sentence embedding using Smooth Inverse Frequency weighting scheme☆15Feb 21, 2020Updated 6 years ago
- PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers"☆50Aug 27, 2021Updated 4 years ago
- The implementation of paper "Leveraging Multimodal Features and Item-level User Feedback for Bundle Construction", WSDM'24.☆17Oct 30, 2025Updated 5 months ago
- ☆27Feb 26, 2023Updated 3 years ago
- XCon: Learning with Experts for Fine-grained Category Discovery☆19Dec 19, 2022Updated 3 years ago
- Classification of tamil news headlines - experimental☆13Feb 21, 2019Updated 7 years ago