A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
☆5,635Jun 17, 2026Updated 2 weeks ago
Alternatives and similar repositories for mmf
Users that are interested in mmf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A natural language modeling framework based on PyTorch☆6,297Oct 17, 2022Updated 3 years ago
- An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.☆769Mar 10, 2024Updated 2 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,233Sep 30, 2025Updated 9 months ago
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆1,469Feb 3, 2023Updated 3 years ago
- An open-source NLP research library, built on PyTorch.☆11,890Nov 22, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".☆967Oct 22, 2022Updated 3 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,929Feb 14, 2023Updated 3 years ago
- Grid features pre-training code for visual question answering☆269Sep 17, 2021Updated 4 years ago
- Oscar and VinVL☆1,054Aug 28, 2023Updated 2 years ago
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆10,623Nov 3, 2023Updated 2 years ago
- Bilinear attention networks for visual question answering☆549Oct 30, 2023Updated 2 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,182May 28, 2023Updated 3 years ago
- An end-to-end PyTorch framework for image and video classification☆1,608Jun 27, 2024Updated 2 years ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,972Jun 22, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.☆9,370Feb 16, 2023Updated 3 years ago
- Multi Task Vision and Language☆825Feb 16, 2022Updated 4 years ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,692Jun 20, 2026Updated last week
- Generate embeddings from large-scale graph-structured data.☆3,458Mar 3, 2024Updated 2 years ago
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆31,209Jun 10, 2026Updated 3 weeks ago
- ☆478Nov 21, 2022Updated 3 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,152Jan 23, 2026Updated 5 months ago
- Visual Question Answering in Pytorch☆733Dec 11, 2019Updated 6 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,296Mar 3, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- End-to-End Object Detection with Transformers☆15,322Mar 12, 2024Updated 2 years ago
- The Natural Language Decathlon: A Multitask Challenge for NLP☆2,338May 1, 2025Updated last year
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆34,589Jun 7, 2026Updated 3 weeks ago
- Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".☆745May 22, 2023Updated 3 years ago
- TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.☆1,724Jun 22, 2026Updated last week
- Recent Advances in Vision and Language PreTrained Models (VL-PTMs)☆1,157Aug 19, 2022Updated 3 years ago
- High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.☆4,769Updated this week
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,925Updated this week
- Debugging, monitoring and visualization for Python Machine Learning and Data Science☆3,469Mar 30, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Google Research☆38,238Jun 24, 2026Updated last week
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,379Oct 27, 2025Updated 8 months ago
- Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"☆543May 1, 2023Updated 3 years ago
- Reading list for research topics in multimodal machine learning☆6,892Aug 20, 2024Updated last year
- PyTorch extensions for high performance and large scale training.☆3,409Apr 26, 2025Updated last year
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆800Jun 30, 2021Updated 5 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,957Jul 28, 2024Updated last year