Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.
☆212Sep 13, 2025Updated 8 months ago
Alternatives and similar repositories for Mixture-of-Transformers
Users that are interested in Mixture-of-Transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 26] DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆41Aug 3, 2025Updated 9 months ago
- Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning", ICLR 2025☆34Apr 21, 2025Updated last year
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)☆16Jan 6, 2026Updated 4 months ago
- ☆22Sep 16, 2025Updated 8 months ago
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CoRL 2025] Robot Learning from Any Images☆34Nov 11, 2025Updated 6 months ago
- ☆10Nov 19, 2015Updated 10 years ago
- ☆11May 19, 2025Updated last year
- The official implementation of "Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving"☆113May 9, 2026Updated last week
- CLI and library for translating OTP (One-time-password) archives between different OTP apps.☆22Sep 29, 2025Updated 7 months ago
- OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning☆29May 24, 2025Updated 11 months ago
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆103Apr 7, 2026Updated last month
- Project page of "GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation"☆21Apr 3, 2023Updated 3 years ago
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"☆226May 30, 2025Updated 11 months ago
- An open source implementation of CLIP (With TULIP Support)☆165May 14, 2025Updated last year
- The WorldRWKV project aims to implement training and inference across various modalities using the RWKV7 architecture. By leveraging diff…☆67Mar 18, 2026Updated 2 months ago
- A curated collection of prompts for Grok Imagine by xAI☆29Oct 19, 2025Updated 7 months ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆22Jun 13, 2023Updated 2 years ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆32Jun 12, 2025Updated 11 months ago
- ☆68Feb 4, 2026Updated 3 months ago
- This is a LoRA model finetuned on Wan-I2V-14B-480P. It turns things in the image into fluffy toys.☆19Nov 10, 2025Updated 6 months ago
- The code will come soon.☆16Sep 12, 2025Updated 8 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Welcome to the Background Remover project! This tool allows you to effortlessly replace backgrounds in images and videos, making it perfe…☆11Feb 3, 2024Updated 2 years ago
- MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation☆28Mar 4, 2025Updated last year
- Dion optimizer algorithm☆476May 11, 2026Updated last week
- Multimodal RewardBench☆68Feb 21, 2025Updated last year
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆47Aug 26, 2025Updated 8 months ago
- Production-ready Supabase self-hosting with Docker Compose, Swarm & Portainer. Complete wiki documentation, automated setup scripts, and …☆39Oct 5, 2025Updated 7 months ago
- [NeurIPS 2025] The official repository for our paper, "Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reason…☆155Sep 12, 2025Updated 8 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆167Jan 31, 2025Updated last year
- Standalone repo for our Atropos integration with Thinking Machines Tinker API (https://thinkingmachines.ai/tinker/)☆82Mar 22, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Spirit-v1.5: A Robotic Foundation Model by Spirit AI☆567Apr 23, 2026Updated 3 weeks ago
- ☆34Jun 9, 2025Updated 11 months ago
- SGAP-Net: Semantic-Guided Attentive Prototypes Network for Few-Shot Human-Object Interaction Recognition, AAAI2020.☆14Dec 15, 2020Updated 5 years ago
- This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language M…☆24Apr 27, 2025Updated last year
- This is the official implementation of physics-informed neural networks for functional differential equations (Functional PINN) proposed …☆12Apr 9, 2025Updated last year
- ☆70Jul 8, 2025Updated 10 months ago
- Open-vocabulary Semantic Segmentation☆33Feb 16, 2024Updated 2 years ago