MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
☆3,151Jul 7, 2025Updated 10 months ago
Alternatives and similar repositories for MiniMax-M1
Users that are interested in MiniMax-M1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,419Jul 7, 2025Updated 10 months ago
- Kimi K2 is the large language model series developed by Moonshot AI team☆10,775Jan 21, 2026Updated 4 months ago
- MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining☆2,117Jun 5, 2025Updated 11 months ago
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆200Jul 7, 2025Updated 10 months ago
- Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video ge…☆1,480May 14, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Muon is Scalable for LLM Training☆1,480Aug 3, 2025Updated 9 months ago
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆27,228Jan 9, 2026Updated 4 months ago
- ☆814Jun 9, 2025Updated 11 months ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆753Jun 6, 2025Updated 11 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆80,418Updated this week
- Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities☆1,186Jul 15, 2025Updated 10 months ago
- open-source coding LLM for software engineering tasks☆1,217Sep 30, 2025Updated 7 months ago
- The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning☆333May 31, 2025Updated 11 months ago
- GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models☆4,343Feb 1, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆64,485Updated this week
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,259Aug 27, 2025Updated 8 months ago
- Scaling RL on advanced reasoning models☆679Oct 20, 2025Updated 7 months ago
- Democratizing Reinforcement Learning for LLMs☆5,548Updated this week
- ☆3,471Mar 7, 2025Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆19,293Nov 19, 2025Updated 6 months ago
- SGLang is a high-performance serving framework for large language models and multimodal models.☆27,836May 15, 2026Updated last week
- Open-source unified multimodal model☆5,925May 4, 2026Updated 2 weeks ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆10,211May 5, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆19,799May 15, 2026Updated last week
- Universal memory layer for AI Agents☆56,013Updated this week
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆21,337May 16, 2026Updated last week
- MoBA: Mixture of Block Attention for Long-Context LLMs☆2,120Apr 3, 2025Updated last year
- Fully open reproduction of DeepSeek-R1☆26,020Apr 2, 2026Updated last month
- Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.☆812Jul 8, 2025Updated 10 months ago
- No fortress, purely open ground. OpenManus is Coming.☆56,345Feb 11, 2026Updated 3 months ago
- The Autonomous Company Operating System☆19,764Updated this week
- Official repository for LTX-Video☆10,292Jan 5, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving stat…☆1,574Jun 14, 2025Updated 11 months ago
- Wan: Open and Advanced Large-Scale Video Generative Models☆16,088Mar 5, 2026Updated 2 months ago
- Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …☆363May 21, 2025Updated last year
- 🙌 OpenHands: AI-Driven Development☆73,913Updated this week
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆19,193Jan 30, 2026Updated 3 months ago
- Everything about the SmolLM and SmolVLM family of models☆3,777Apr 2, 2026Updated last month
- FlashMLA: Efficient Multi-head Latent Attention Kernels☆12,657Apr 30, 2026Updated 3 weeks ago