Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.
☆43Aug 26, 2022Updated 3 years ago
Alternatives and similar repositories for ConcatBERT
Users that are interested in ConcatBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Jul 12, 2021Updated 4 years ago
- ☆64Jun 25, 2021Updated 4 years ago
- Repository containing code from team Kingsterdam for the Hateful Memes Challenge☆23Oct 24, 2022Updated 3 years ago
- Classify image and text with ResNet and BERT models using Pytorch☆13Jul 7, 2020Updated 5 years ago
- ☆68Sep 7, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Dec 25, 2021Updated 4 years ago
- ☆10Nov 15, 2021Updated 4 years ago
- RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER☆76Mar 31, 2023Updated 3 years ago
- ☆11May 18, 2022Updated 3 years ago
- [NeurIPS 2022] Source code for our paper "Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data"☆24Oct 16, 2023Updated 2 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Nov 29, 2021Updated 4 years ago
- Mouse-side-button voice input for VibeCoding on Linux.☆59Apr 2, 2026Updated last month
- multimodal social media content (text, image) classification☆51Jun 22, 2022Updated 3 years ago
- Prompting For Named Entity Recognition☆19Sep 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21May 8, 2023Updated 2 years ago
- Python 3 support for the MS COCO caption evaluation tools☆14Jun 14, 2024Updated last year
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'☆18Jul 21, 2024Updated last year
- ☆44Aug 2, 2021Updated 4 years ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- Order-agnostic Identifier for Large Language Model-based Generative Recommendation (SIGIR'25)☆30Oct 21, 2025Updated 6 months ago
- CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations☆30Oct 27, 2023Updated 2 years ago
- Fusion Modality Approaches for sentiment analysis and emotion recognition task.☆12Feb 5, 2021Updated 5 years ago
- Курс по машинному обучению для магистров компьютерной лингвистики 1-го курса в Высшей Школе Экономики☆16May 13, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Hindi Image Captioning system made completely with Transformers🤗☆10Apr 16, 2024Updated 2 years ago
- [WSDM 2025] Source code for "Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Cali…☆14Oct 14, 2025Updated 6 months ago
- ☆15Dec 20, 2020Updated 5 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Apr 12, 2021Updated 5 years ago
- PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers"☆50Aug 27, 2021Updated 4 years ago
- The implementation of paper "Leveraging Multimodal Features and Item-level User Feedback for Bundle Construction", WSDM'24.☆17Oct 30, 2025Updated 6 months ago
- This is the repo for the work "Where and What: Driver Attention-based Object Detection".☆10May 10, 2022Updated 3 years ago
- ☆27Feb 26, 2023Updated 3 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Apr 5, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- XCon: Learning with Experts for Fine-grained Category Discovery☆19Dec 19, 2022Updated 3 years ago
- Code for paper "Aiding Intra-Text Representations with Visual Context for Multimodal Named Entity Recognition"☆16Aug 19, 2019Updated 6 years ago
- The implementation of the paper: Clifford Group Equivariant Simplicial Message Passing Networks @ ICLR2024☆17May 29, 2024Updated last year
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Apr 21, 2022Updated 4 years ago
- Convert LaBSE model from TF Hub to PyTorch.☆15Jan 15, 2026Updated 3 months ago
- MMRA: Predicting Micro-video Popularity via Multi-modal Retrieval Augmentation, ACM SIGIR Conference on Research and Development in Infor…☆23Feb 7, 2026Updated 2 months ago
- ☆13Feb 7, 2019Updated 7 years ago