facebookresearch/mmbt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/mmbt)

facebookresearch / mmbt

Supervised Multimodal Bitransformers for Classifying Images and Text

☆256

Alternatives and similar repositories for mmbt

Users that are interested in mmbt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

johnarevalo / gmu-mmimdb
View on GitHub
Source code for training Gated Multimodal Units on MM-IMDb dataset
☆103Apr 8, 2023Updated 3 years ago
uclanlp / visualbert
View on GitHub
Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"
☆542May 1, 2023Updated 3 years ago
WasifurRahman / BERT_multimodal_transformer
View on GitHub
☆220Dec 5, 2021Updated 4 years ago
ChenRocks / UNITER
View on GitHub
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
☆799Jun 30, 2021Updated 5 years ago
facebookresearch / fair-sslime
View on GitHub
FAIR Self-Supervised Learning Integrated Multi-modal Environment (SSLIME)
☆69Feb 3, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
vkeswani / IITK_Memotion_Analysis
View on GitHub
Bimodal and Unimodal Sentiment Analysis of Internet Memes (Image+Text)
☆16Oct 3, 2021Updated 4 years ago
jiasenlu / vilbert_beta
View on GitHub
☆478Nov 21, 2022Updated 3 years ago
facebookresearch / mmf
View on GitHub
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
☆5,636Jul 7, 2026Updated 3 weeks ago
airsplay / lxmert
View on GitHub
PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".
☆965Oct 22, 2022Updated 3 years ago
XL2248 / AGDT
View on GitHub
Code for "A Novel Aspect-Guided Deep Transition Model for Aspect Based Sentiment Analysis." on EMNLP 2019.
☆21Dec 22, 2019Updated 6 years ago
Justin1904 / TensorFusionNetworks
View on GitHub
Pytorch Implementation of Tensor Fusion Networks for multimodal sentiment analysis.
☆193Apr 5, 2020Updated 6 years ago
soujanyaporia / multimodal-sentiment-analysis
View on GitHub
Attention-based multimodal fusion for sentiment analysis
☆367Apr 8, 2024Updated 2 years ago
honglizhan / CovidET
View on GitHub
This repo contains the dataset for the EMNLP 2022 paper "Why Do You Feel This Way? Summarizing Triggers of Emotions in Social Media Posts…
☆19Oct 9, 2023Updated 2 years ago
anita-hu / MSAF
View on GitHub
Offical implementation of paper "MSAF: Multimodal Split Attention Fusion"
☆80Jun 16, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Axe-- / Visual-Question-Answering
View on GitHub
PyTorch Implementation of VQA Baseline & Hierarchical Co-Attention model
☆16Oct 3, 2023Updated 2 years ago
pliang279 / awesome-multimodal-ml
View on GitHub
Reading list for research topics in multimodal machine learning
☆6,913Aug 20, 2024Updated last year
yaohungt / Multimodal-Transformer
View on GitHub
[ACL'19] [PyTorch] Multimodal Transformer
☆993Sep 12, 2022Updated 3 years ago
lyeoni / KorQuAD
View on GitHub
KorQuAD (Korean Question Answering Dataset) submission guide using PyTorch pretrained BERT
☆31Jun 18, 2019Updated 7 years ago
MIND-Lab / SemEval2022-Task-5-Multimedia-Automatic-Misogyny-Identification-MAMI-
View on GitHub
SemEval 2022 Task 5: Multimedia Automatic Misogyny Identification - baseline models and dataset
☆15Nov 22, 2022Updated 3 years ago
ModuNLP / hacking_transformers
View on GitHub
☆11Aug 12, 2020Updated 5 years ago
anthonyhu / tumblr-emotions
View on GitHub
Code for the KDD 2018 paper "Multimodal Sentiment Analysis to Explore the Structure of Emotions".
☆51May 28, 2018Updated 8 years ago
skywaLKer518 / MultiplicativeMultimodal
View on GitHub
☆30Mar 2, 2018Updated 8 years ago
BAAI-WuDao / BriVL
View on GitHub
Bridging Vision and Language Model
☆286Mar 27, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pliang279 / factorized
View on GitHub
[ICLR 2019] Learning Factorized Multimodal Representations
☆69Aug 4, 2020Updated 5 years ago
steven95421 / KDD_WinnieTheBest
View on GitHub
KDD Cup 2020 Challenges for Modern E-Commerce Platform: Multimodalities Recall first place
☆190Jul 22, 2020Updated 6 years ago
firojalam / multimodal_social_media
View on GitHub
multimodal social media content (text, image) classification
☆53Jun 22, 2022Updated 4 years ago
yalesong / pvse
View on GitHub
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)
☆135Mar 15, 2024Updated 2 years ago
VarnithChordia / Multimodal_Classification_Co_Attention
View on GitHub
Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models
☆19Aug 17, 2020Updated 5 years ago
kuanghuei / SCAN
View on GitHub
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
☆579May 18, 2023Updated 3 years ago
zuokai / KDDCUP_2020_MultimodalitiesRecall_2nd_Place
View on GitHub
☆133Dec 8, 2022Updated 3 years ago
Wangt-CN / MTFN-RR-PyTorch-Code
View on GitHub
The offical code for paper "Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking", ACM Multimedia 2019 Oral
☆67Sep 28, 2019Updated 6 years ago
Justin1904 / Low-rank-Multimodal-Fusion
View on GitHub
This is the repository for "Efficient Low-rank Multimodal Fusion with Modality-Specific Factors", Liu and Shen, et. al. ACL 2018
☆275May 31, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / vizseq
View on GitHub
An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)
☆453Updated this week
facebookresearch / unlikelihood_training
View on GitHub
Neural Text Generation with Unlikelihood Training
☆311Aug 31, 2021Updated 4 years ago
artelab / Multi-modal-classification
View on GitHub
This project contains the code of the implementation of the approach proposed in I. Gallo, A. Calefati, S. Nawaz and M.K. Janjua, "Image …
☆22Apr 10, 2019Updated 7 years ago
jefferyYu / TomBERT
View on GitHub
Dataset and codes for our IJCAI 2019 paper "Adapting BERT for Target-Oriented Multimodal Sentiment Classification"
☆87Mar 31, 2020Updated 6 years ago
facebookresearch / spreadingvectors
View on GitHub
Open source implementation of "Spreading Vectors for Similarity Search"
☆322Aug 13, 2021Updated 4 years ago
david-gimeno / tailored-avsr
View on GitHub
Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
☆15Feb 24, 2025Updated last year
Multimodal-NER / RpBERT
View on GitHub
RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER
☆76Mar 31, 2023Updated 3 years ago