A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
☆5,628Apr 7, 2026Updated this week
Alternatives and similar repositories for mmf
Users that are interested in mmf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A natural language modeling framework based on PyTorch☆6,304Oct 17, 2022Updated 3 years ago
- An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.☆766Mar 10, 2024Updated 2 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,201Sep 30, 2025Updated 6 months ago
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆1,466Feb 3, 2023Updated 3 years ago
- An open-source NLP research library, built on PyTorch.☆11,893Nov 22, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".☆966Oct 22, 2022Updated 3 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,928Feb 14, 2023Updated 3 years ago
- Grid features pre-training code for visual question answering☆269Sep 17, 2021Updated 4 years ago
- Oscar and VinVL☆1,051Aug 28, 2023Updated 2 years ago
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆10,631Nov 3, 2023Updated 2 years ago
- Bilinear attention networks for visual question answering☆548Oct 30, 2023Updated 2 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,178May 28, 2023Updated 2 years ago
- An end-to-end PyTorch framework for image and video classification☆1,614Jun 27, 2024Updated last year
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,947Updated this week
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.☆9,383Feb 16, 2023Updated 3 years ago
- Multi Task Vision and Language☆825Feb 16, 2022Updated 4 years ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,686Dec 1, 2025Updated 4 months ago
- Generate embeddings from large-scale graph-structured data.☆3,459Mar 3, 2024Updated 2 years ago
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆30,990Apr 1, 2026Updated last week
- ☆478Nov 21, 2022Updated 3 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,077Jan 23, 2026Updated 2 months ago
- Visual Question Answering in Pytorch☆735Dec 11, 2019Updated 6 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,295Mar 3, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- End-to-End Object Detection with Transformers☆15,209Mar 12, 2024Updated 2 years ago
- The Natural Language Decathlon: A Multitask Challenge for NLP☆2,339May 1, 2025Updated 11 months ago
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆34,284Mar 21, 2026Updated 3 weeks ago
- Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".☆746May 22, 2023Updated 2 years ago
- TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.☆1,712Mar 30, 2026Updated last week
- Recent Advances in Vision and Language PreTrained Models (VL-PTMs)☆1,156Aug 19, 2022Updated 3 years ago
- High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.☆4,751Apr 2, 2026Updated last week
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,603Apr 3, 2026Updated last week
- Debugging, monitoring and visualization for Python Machine Learning and Data Science☆3,466Mar 30, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Google Research☆37,679Updated this week
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,361Oct 27, 2025Updated 5 months ago
- Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"☆542May 1, 2023Updated 2 years ago
- Reading list for research topics in multimodal machine learning☆6,855Aug 20, 2024Updated last year
- PyTorch extensions for high performance and large scale training.☆3,404Apr 26, 2025Updated 11 months ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆800Jun 30, 2021Updated 4 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,973Jul 28, 2024Updated last year