Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.
☆42Aug 26, 2022Updated 3 years ago
Alternatives and similar repositories for ConcatBERT
Users that are interested in ConcatBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Facebook Hatebook Memes Challenge☆12Jan 28, 2021Updated 5 years ago
- Multimodal Model for Memotion Dataset☆12May 17, 2021Updated 4 years ago
- ☆15Jul 12, 2021Updated 4 years ago
- Repository containing code from team Kingsterdam for the Hateful Memes Challenge☆23Oct 24, 2022Updated 3 years ago
- Classify image and text with ResNet and BERT models using Pytorch☆13Jul 7, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code showing how to use a model based on the ML model base class.☆10Sep 30, 2022Updated 3 years ago
- ☆67Sep 7, 2023Updated 2 years ago
- ☆16Dec 25, 2021Updated 4 years ago
- RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER☆76Mar 31, 2023Updated 2 years ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform☆17Jul 2, 2020Updated 5 years ago
- Official repository for ACM Multimedia'24 paper "MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube a…☆21Aug 11, 2024Updated last year
- ☆11May 18, 2022Updated 3 years ago
- The source code and manually annotated datasets for our paper "Joint Multimodal Sentiment Analysis Based on Information Relevance"☆11Dec 17, 2022Updated 3 years ago
- [NeurIPS 2022] Source code for our paper "Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data"☆24Oct 16, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Nov 29, 2021Updated 4 years ago
- Mouse-side-button voice input for VibeCoding on Linux.☆53Mar 8, 2026Updated 2 weeks ago
- Image inpainting based on LAMA☆13Jul 4, 2022Updated 3 years ago
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21May 8, 2023Updated 2 years ago
- Python 3 support for the MS COCO caption evaluation tools☆14Jun 14, 2024Updated last year
- ☆44Aug 2, 2021Updated 4 years ago
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'☆19Jul 21, 2024Updated last year
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- Generate 256x256, 512x512 resolution images with simple Convolutional GAN by adding Gaussian noise to discriminator layers.☆10Jul 11, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Order-agnostic Identifier for Large Language Model-based Generative Recommendation (SIGIR'25)☆26Oct 21, 2025Updated 5 months ago
- CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations☆29Oct 27, 2023Updated 2 years ago
- Fusion Modality Approaches for sentiment analysis and emotion recognition task.☆12Feb 5, 2021Updated 5 years ago
- 根据维基百科历史编辑数据提取纠错语料。☆12Apr 6, 2022Updated 3 years ago
- Scripts for KGIRNet model for ESWC☆10Jul 6, 2023Updated 2 years ago
- A Hindi Image Captioning system made completely with Transformers🤗☆10Apr 16, 2024Updated last year
- ☆15Dec 20, 2020Updated 5 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Apr 12, 2021Updated 4 years ago
- This is a multi-modal fusion method based on VGG16 and FastText for identifying useful information collected from social media platforms.…☆15Mar 4, 2022Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Sentence embedding using Smooth Inverse Frequency weighting scheme☆15Feb 21, 2020Updated 6 years ago
- PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers"☆50Aug 27, 2021Updated 4 years ago
- a py3 lib for NLP & image-caption metrics : BLEU METEOR CIDEr ROUGE SPICE WMD☆14Sep 13, 2022Updated 3 years ago
- The implementation of paper "Leveraging Multimodal Features and Item-level User Feedback for Bundle Construction", WSDM'24.☆17Oct 30, 2025Updated 4 months ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Apr 5, 2022Updated 3 years ago
- ☆27Feb 26, 2023Updated 3 years ago
- CNN based on images from Kaggle's FER2013 competition, achieving 67.59% accuracy on the final test set - equivalent of the 5th place on t…☆12Jul 26, 2018Updated 7 years ago