A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
☆5,623Mar 16, 2026Updated this week
Alternatives and similar repositories for mmf
Users that are interested in mmf are comparing it to the libraries listed below
Sorting:
- A natural language modeling framework based on PyTorch☆6,306Oct 17, 2022Updated 3 years ago
- An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.☆766Mar 10, 2024Updated 2 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,190Sep 30, 2025Updated 5 months ago
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆1,467Feb 3, 2023Updated 3 years ago
- An open-source NLP research library, built on PyTorch.☆11,893Nov 22, 2022Updated 3 years ago
- PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".☆966Oct 22, 2022Updated 3 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,927Feb 14, 2023Updated 3 years ago
- Grid features pre-training code for visual question answering☆269Sep 17, 2021Updated 4 years ago
- Oscar and VinVL☆1,052Aug 28, 2023Updated 2 years ago
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆10,627Nov 3, 2023Updated 2 years ago
- Bilinear attention networks for visual question answering☆548Oct 30, 2023Updated 2 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,176May 28, 2023Updated 2 years ago
- An end-to-end PyTorch framework for image and video classification☆1,613Jun 27, 2024Updated last year
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,936Updated this week
- Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.☆9,386Feb 16, 2023Updated 3 years ago
- Multi Task Vision and Language☆825Feb 16, 2022Updated 4 years ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,679Dec 1, 2025Updated 3 months ago
- Generate embeddings from large-scale graph-structured data.☆3,459Mar 3, 2024Updated 2 years ago
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆30,952Updated this week
- ☆478Nov 21, 2022Updated 3 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,046Jan 23, 2026Updated 2 months ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,294Mar 3, 2024Updated 2 years ago
- Visual Question Answering in Pytorch☆735Dec 11, 2019Updated 6 years ago
- End-to-End Object Detection with Transformers☆15,166Mar 12, 2024Updated 2 years ago
- The Natural Language Decathlon: A Multitask Challenge for NLP☆2,339May 1, 2025Updated 10 months ago
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆34,223Updated this week
- Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".☆746May 22, 2023Updated 2 years ago
- TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.☆1,706Updated this week
- Recent Advances in Vision and Language PreTrained Models (VL-PTMs)☆1,155Aug 19, 2022Updated 3 years ago
- High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.☆4,748Updated this week
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,504Mar 13, 2026Updated last week
- Debugging, monitoring and visualization for Python Machine Learning and Data Science☆3,467Mar 6, 2026Updated 2 weeks ago
- Google Research☆37,494Updated this week
- Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"☆541May 1, 2023Updated 2 years ago
- Reading list for research topics in multimodal machine learning☆6,843Aug 20, 2024Updated last year
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,352Oct 27, 2025Updated 4 months ago
- PyTorch extensions for high performance and large scale training.☆3,404Apr 26, 2025Updated 10 months ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆800Jun 30, 2021Updated 4 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,975Jul 28, 2024Updated last year