MiniMax-AI/MiniMax-M1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MiniMax-AI/MiniMax-M1)

MiniMax-AI / MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

☆3,151

Alternatives and similar repositories for MiniMax-M1

Users that are interested in MiniMax-M1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MiniMax-AI / MiniMax-01
View on GitHub
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
☆3,419Jul 7, 2025Updated 10 months ago
MoonshotAI / Kimi-K2
View on GitHub
Kimi K2 is the large language model series developed by Moonshot AI team
☆10,775Jan 21, 2026Updated 4 months ago
XiaomiMiMo / MiMo
View on GitHub
MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining
☆2,117Jun 5, 2025Updated 11 months ago
MiniMax-AI / SynLogic
View on GitHub
[NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
☆200Jul 7, 2025Updated 10 months ago
MiniMax-AI / MiniMax-MCP
View on GitHub
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video ge…
☆1,480May 14, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MoonshotAI / Moonlight
View on GitHub
Muon is Scalable for LLM Training
☆1,480Aug 3, 2025Updated 9 months ago
QwenLM / Qwen3
View on GitHub
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
☆27,228Jan 9, 2026Updated 4 months ago
ByteDance-Seed / Seed-Thinking-v1.5
View on GitHub
☆814Jun 9, 2025Updated 11 months ago
ByteDance-Seed / Seed-Coder
View on GitHub
Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.
☆753Jun 6, 2025Updated 11 months ago
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆80,418Updated this week
MoonshotAI / Kimi-VL
View on GitHub
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
☆1,186Jul 15, 2025Updated 10 months ago
MoonshotAI / Kimi-Dev
View on GitHub
open-source coding LLM for software engineering tasks
☆1,217Sep 30, 2025Updated 7 months ago
MiniMax-AI / One-RL-to-See-Them-All
View on GitHub
The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning
☆333May 31, 2025Updated 11 months ago
zai-org / GLM-4.5
View on GitHub
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
☆4,343Feb 1, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
unslothai / unsloth
View on GitHub
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
☆64,485Updated this week
sail-sg / understand-r1-zero
View on GitHub
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,259Aug 27, 2025Updated 8 months ago
ChenxinAn-fdu / POLARIS
View on GitHub
Scaling RL on advanced reasoning models
☆679Oct 20, 2025Updated 7 months ago
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,548Updated this week
MoonshotAI / Kimi-k1.5
View on GitHub
☆3,471Mar 7, 2025Updated last year
nari-labs / dia
View on GitHub
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆19,293Nov 19, 2025Updated 6 months ago
sgl-project / sglang
View on GitHub
SGLang is a high-performance serving framework for large language models and multimodal models.
☆27,836May 15, 2026Updated last week
ByteDance-Seed / Bagel
View on GitHub
Open-source unified multimodal model
☆5,925May 4, 2026Updated 2 weeks ago
kyutai-labs / moshi
View on GitHub
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…
☆10,211May 5, 2026Updated 2 weeks ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
camel-ai / owl
View on GitHub
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
☆19,799May 15, 2026Updated last week
mem0ai / mem0
View on GitHub
Universal memory layer for AI Agents
☆56,013Updated this week
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆21,337May 16, 2026Updated last week
MoonshotAI / MoBA
View on GitHub
MoBA: Mixture of Block Attention for Long-Context LLMs
☆2,120Apr 3, 2025Updated last year
huggingface / open-r1
View on GitHub
Fully open reproduction of DeepSeek-R1
☆26,020Apr 2, 2026Updated last month
Tencent-Hunyuan / Hunyuan-A13B
View on GitHub
Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.
☆812Jul 8, 2025Updated 10 months ago
FoundationAgents / OpenManus
View on GitHub
No fortress, purely open ground. OpenManus is Coming.
☆56,345Feb 11, 2026Updated 3 months ago
kortix-ai / suna
View on GitHub
The Autonomous Company Operating System
☆19,764Updated this week
Lightricks / LTX-Video
View on GitHub
Official repository for LTX-Video
☆10,292Jan 5, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ByteDance-Seed / Seed1.5-VL
View on GitHub
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving stat…
☆1,574Jun 14, 2025Updated 11 months ago
Wan-Video / Wan2.1
View on GitHub
Wan: Open and Advanced Large-Scale Video Generative Models
☆16,088Mar 5, 2026Updated 2 months ago
dipampaul17 / KVSplit
View on GitHub
Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …
☆363May 21, 2025Updated last year
OpenHands / OpenHands
View on GitHub
🙌 OpenHands: AI-Driven Development
☆73,913Updated this week
QwenLM / Qwen3-VL
View on GitHub
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
☆19,193Jan 30, 2026Updated 3 months ago
huggingface / smollm
View on GitHub
Everything about the SmolLM and SmolVLM family of models
☆3,777Apr 2, 2026Updated last month
deepseek-ai / FlashMLA
View on GitHub
FlashMLA: Efficient Multi-head Latent Attention Kernels
☆12,657Apr 30, 2026Updated 3 weeks ago