facebookresearch / Mixture-of-TransformersView external linksLinks
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.
☆162Sep 13, 2025Updated 5 months ago
Alternatives and similar repositories for Mixture-of-Transformers
Users that are interested in Mixture-of-Transformers are comparing it to the libraries listed below
Sorting:
- Quantization of LLMs and benchmarking.☆10Apr 3, 2024Updated last year
- MutiModel paper reading (Visual, Audio)☆21Nov 24, 2025Updated 2 months ago
- ☆22Sep 16, 2025Updated 5 months ago
- ☆10Nov 19, 2015Updated 10 years ago
- OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning☆27May 24, 2025Updated 8 months ago
- AgentHub is the only SDK you need to connect to state-of-the-art LLMs (GPT-5.2/Claude 4.5/Gemini 3).☆50Updated this week
- This is a framework for evaluating reasoning in foundational Video Models.☆49Feb 10, 2026Updated last week
- Production-ready Supabase self-hosting with Docker Compose, Swarm & Portainer. Complete wiki documentation, automated setup scripts, and …☆36Oct 5, 2025Updated 4 months ago
- CLI and library for translating OTP (One-time-password) archives between different OTP apps.☆22Sep 29, 2025Updated 4 months ago
- ☆23Jan 24, 2024Updated 2 years ago
- Reference implementation of DecDTW in PyTorch (ICLR 2023)☆24May 29, 2023Updated 2 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Jun 13, 2023Updated 2 years ago
- This repository contains datasets for Bangla Sign Language characters and digits collected from different institutes near Dhaka, Banglade…☆19Feb 14, 2019Updated 7 years ago
- ☆29Jul 25, 2025Updated 6 months ago
- ☆19Apr 16, 2022Updated 3 years ago
- This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language M…☆24Apr 27, 2025Updated 9 months ago
- Multimodal RewardBench☆61Feb 21, 2025Updated 11 months ago
- [NeurIPS 2025] The official repository for our paper, "Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reason…☆152Sep 12, 2025Updated 5 months ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆30Jun 12, 2025Updated 8 months ago
- A Simple, Modular and Unified Real Robot Control Interface☆45Feb 1, 2026Updated 2 weeks ago
- [CoRL 2025] Robot Learning from Any Images☆34Nov 11, 2025Updated 3 months ago
- Project page of "GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation"☆21Apr 3, 2023Updated 2 years ago
- H.AI cookbook provides code examples and guides to help developers use models developed by H Company.☆65Feb 3, 2026Updated 2 weeks ago
- ☆34Jul 8, 2025Updated 7 months ago
- Bangla Unicode Normalization☆22May 26, 2024Updated last year
- ☆24Nov 27, 2021Updated 4 years ago
- Alignment examples for Interspeech 2024☆27Jul 5, 2024Updated last year
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Jan 14, 2025Updated last year
- ☆27Sep 15, 2020Updated 5 years ago
- [SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Di…☆62Nov 7, 2024Updated last year
- ☆53Mar 19, 2021Updated 4 years ago
- Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"☆52Dec 18, 2025Updated 2 months ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Nov 2, 2021Updated 4 years ago
- ☆59Sep 14, 2024Updated last year
- The repository contains code for Adaptive Data Optimization☆32Dec 9, 2024Updated last year
- Esoteric Language Models☆111Feb 8, 2026Updated last week
- ☆189Dec 17, 2024Updated last year
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Aug 26, 2025Updated 5 months ago
- 用于微调LLM的中文指令数据集☆28Apr 12, 2023Updated 2 years ago