A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
☆5,627Apr 7, 2026Updated 3 weeks ago
Alternatives and similar repositories for mmf
Users that are interested in mmf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A natural language modeling framework based on PyTorch☆6,301Oct 17, 2022Updated 3 years ago
- An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.☆769Mar 10, 2024Updated 2 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,212Sep 30, 2025Updated 7 months ago
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆1,467Feb 3, 2023Updated 3 years ago
- An open-source NLP research library, built on PyTorch.☆11,891Nov 22, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".☆967Oct 22, 2022Updated 3 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,932Feb 14, 2023Updated 3 years ago
- Grid features pre-training code for visual question answering☆269Sep 17, 2021Updated 4 years ago
- Oscar and VinVL☆1,053Aug 28, 2023Updated 2 years ago
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆10,632Nov 3, 2023Updated 2 years ago
- Bilinear attention networks for visual question answering☆547Oct 30, 2023Updated 2 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,177May 28, 2023Updated 2 years ago
- An end-to-end PyTorch framework for image and video classification☆1,613Jun 27, 2024Updated last year
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,953Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.☆9,378Feb 16, 2023Updated 3 years ago
- Multi Task Vision and Language☆825Feb 16, 2022Updated 4 years ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,692Dec 1, 2025Updated 5 months ago
- Generate embeddings from large-scale graph-structured data.☆3,458Mar 3, 2024Updated 2 years ago
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆31,104Updated this week
- ☆478Nov 21, 2022Updated 3 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,107Jan 23, 2026Updated 3 months ago
- Visual Question Answering in Pytorch☆734Dec 11, 2019Updated 6 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,296Mar 3, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- End-to-End Object Detection with Transformers☆15,246Mar 12, 2024Updated 2 years ago
- The Natural Language Decathlon: A Multitask Challenge for NLP☆2,340May 1, 2025Updated last year
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆34,422Apr 7, 2026Updated 3 weeks ago
- Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".☆746May 22, 2023Updated 2 years ago
- TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.☆1,716Updated this week
- Recent Advances in Vision and Language PreTrained Models (VL-PTMs)☆1,157Aug 19, 2022Updated 3 years ago
- High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.☆4,753Updated this week
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,713Apr 24, 2026Updated last week
- Debugging, monitoring and visualization for Python Machine Learning and Data Science☆3,468Mar 30, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Google Research☆37,825Updated this week
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,370Oct 27, 2025Updated 6 months ago
- Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"☆543May 1, 2023Updated 3 years ago
- Reading list for research topics in multimodal machine learning☆6,865Aug 20, 2024Updated last year
- PyTorch extensions for high performance and large scale training.☆3,409Apr 26, 2025Updated last year
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆801Jun 30, 2021Updated 4 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,970Jul 28, 2024Updated last year