IsaacRodgz/ConcatBERT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IsaacRodgz/ConcatBERT)

IsaacRodgz / ConcatBERT

Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.

☆43

Alternatives and similar repositories for ConcatBERT

Users that are interested in ConcatBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

terenceylchow124 / Meme-MultiModal
View on GitHub
Multimodal Model for Memotion Dataset
☆12May 17, 2021Updated 5 years ago
midas-research / hyperbolic-tlstm-sigir
View on GitHub
☆14Jul 12, 2021Updated 5 years ago
artelab / Image-and-Text-fusion-for-UPMC-Food-101-using-BERT-and-CNNs
View on GitHub
☆64Jun 25, 2021Updated 5 years ago
Nithin-Holla / meme_challenge
View on GitHub
Repository containing code from team Kingsterdam for the Hateful Memes Challenge
☆23Oct 24, 2022Updated 3 years ago
Tim-101 / Text-and-Image-Classification
View on GitHub
Classify image and text with ResNet and BERT models using Pytorch
☆13Jul 7, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
schmidtbri / using-ml-model-abc
View on GitHub
Code showing how to use a model based on the ML model base class.
☆10Sep 30, 2022Updated 3 years ago
drivendataorg / hateful-memes
View on GitHub
☆69Sep 7, 2023Updated 2 years ago
HAWLYQ / Qc-TextCap
View on GitHub
☆16Dec 25, 2021Updated 4 years ago
Multimodal-NER / RpBERT
View on GitHub
RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER
☆76Mar 31, 2023Updated 3 years ago
webYFDT / hateful
View on GitHub
☆11May 18, 2022Updated 4 years ago
code-chendl / HFIR
View on GitHub
The source code and manually annotated datasets for our paper "Joint Multimodal Sentiment Analysis Based on Information Relevance"
☆11Dec 17, 2022Updated 3 years ago
val-iisc / Saddle-LongTail
View on GitHub
[NeurIPS 2022] Source code for our paper "Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data"
☆25Oct 16, 2023Updated 2 years ago
kaustubhdhole / natural-dont-know
View on GitHub
Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries
☆19Nov 29, 2021Updated 4 years ago
firojalam / multimodal_social_media
View on GitHub
multimodal social media content (text, image) classification
☆53Jun 22, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aditya10 / VLC-BERT
View on GitHub
Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"
☆21May 8, 2023Updated 3 years ago
claws-lab / projection-in-MLLMs
View on GitHub
Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'
☆18Jul 21, 2024Updated 2 years ago
rpeloff / multimodal_one_shot_learning
View on GitHub
Code recipe for "Multimodal One-Shot Learning of Speech and Images"
☆11Nov 21, 2018Updated 7 years ago
gchhablani / multilingual-image-captioning
View on GitHub
☆43Aug 2, 2021Updated 4 years ago
simonepri / fever-transformers
View on GitHub
📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks
☆12Feb 21, 2020Updated 6 years ago
ShivamShrirao / facegan_pytorch
View on GitHub
Generate 256x256, 512x512 resolution images with simple Convolutional GAN by adding Gaussian noise to discriminator layers.
☆10Jul 11, 2021Updated 5 years ago
bharathichezhiyan / Multimodal-Meme-Classification-Identifying-Offensive-Content-in-Image-and-Text
View on GitHub
Multimodal Meme Classification: Identifying Offensive Content in Image and Text
☆72Dec 8, 2022Updated 3 years ago
IdeasLabUT / EDA-Artifact-Detection
View on GitHub
Python implementations of machine learning algorithms for motion artifact detection in electrodermal activity (EDA) data
☆17Jul 27, 2017Updated 8 years ago
ExplainableML / CLEVR-X
View on GitHub
CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations
☆30Oct 27, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
gkoumasd / MSAF
View on GitHub
Fusion Modality Approaches for sentiment analysis and emotion recognition task.
☆12Feb 5, 2021Updated 5 years ago
seanbenhur / hindi_image_captioning
View on GitHub
A Hindi Image Captioning system made completely with Transformers🤗
☆10Apr 16, 2024Updated 2 years ago
SmartDataAnalytics / kgirnet
View on GitHub
Scripts for KGIRNet model for ESWC
☆10Jul 6, 2023Updated 3 years ago
xueyouluo / wiki-error-extract
View on GitHub
根据维基百科历史编辑数据提取纠错语料。
☆12Apr 6, 2022Updated 4 years ago
EricWWWW / image-caption-metrics
View on GitHub
a py3 lib for NLP & image-caption metrics : BLEU METEOR CIDEr ROUGE SPICE WMD
☆14Sep 13, 2022Updated 3 years ago
PrithivirajDamodaran / Alt-ZSC
View on GitHub
Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…
☆37Apr 5, 2022Updated 4 years ago
allenai / x-lxmert
View on GitHub
PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers"
☆50Aug 27, 2021Updated 4 years ago
Neon-Jing / Guider
View on GitHub
[WSDM 2025] Source code for "Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Cali…
☆14Oct 14, 2025Updated 9 months ago
dh1105 / Multi-modal-movie-genre-prediction
View on GitHub
A multi-modal deep learning model trained to predict a movie's genre given the movie poster and overview as an input.
☆13May 18, 2020Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Cloud-CV / vilbert-multi-task
View on GitHub
12-in-1: Multi-Task Vision and Language Representation Learning Web Demo
☆35Dec 8, 2022Updated 3 years ago
setu4993 / convert-labse-tf-pt
View on GitHub
Convert LaBSE model from TF Hub to PyTorch.
☆15Jan 15, 2026Updated 6 months ago
sajastu / reddit_collector
View on GitHub
Reddit Collector and Text Processor
☆24Sep 7, 2022Updated 3 years ago
wavewangyue / mae
View on GitHub
基于多模态的属性抽取
☆46Aug 6, 2020Updated 5 years ago
Uchman21 / MLGW
View on GitHub
Code for the paper "Collaborative Graph Walk for Semi-supervised Multi-Label Node Classification" - ICDM 2019
☆13Mar 25, 2023Updated 3 years ago
Ifiokcharles / COVID-19-Anti-viral-cure-using-deep-reinforcement-learning
View on GitHub
☆20Mar 1, 2020Updated 6 years ago
cuilimeng / DETERRENT
View on GitHub
☆30Jun 25, 2020Updated 6 years ago