MiniMax-AI / MiniMax-01Links

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

☆3,247

Alternatives and similar repositories for MiniMax-01

Users that are interested in MiniMax-01 are comparing it to the libraries listed below

Sorting:

MoonshotAI / MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
☆1,981Updated 7 months ago
MoonshotAI / Kimi-k1.5
☆3,469Updated 8 months ago
QwenLM / Qwen2.5-Omni
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…
☆3,787Updated 5 months ago
ByteDance-Seed / Bagel
Open-source unified multimodal model
☆5,282Updated 3 weeks ago
MiniMax-AI / MiniMax-M1
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
☆2,992Updated 4 months ago
stepfun-ai / Step-Video-T2V
☆3,130Updated 8 months ago
Tencent-Hunyuan / Tencent-Hunyuan-Large
☆1,585Updated 11 months ago
SandAI-org / MAGI-1
MAGI-1: Autoregressive Video Generation at Scale
☆3,546Updated 5 months ago
AIDC-AI / Marco-o1
An Open Large Reasoning Model for Real-World Solutions
☆1,527Updated 5 months ago
MoonshotAI / Moonlight
Muon is Scalable for LLM Training
☆1,359Updated 3 months ago
zai-org / GLM-V
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
☆1,744Updated 3 weeks ago
rllm-org / rllm
Democratizing Reinforcement Learning for LLMs
☆4,737Updated this week
open-thoughts / open-thoughts
Fully open data curation for reasoning models
☆2,144Updated 2 months ago
QwenLM / Qwen3-Omni
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…
☆2,921Updated last month
meituan-longcat / LongCat-Flash-Chat
☆1,221Updated last week
Alibaba-NLP / ZeroSearch
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
☆1,194Updated 3 months ago
ByteDance-Seed / Seed1.5-VL
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving stat…
☆1,492Updated 5 months ago
zai-org / GLM-4.5
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
☆3,181Updated last month
XiaomiMiMo / MiMo
MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining
☆1,619Updated 5 months ago
NovaSky-AI / SkyThought
Sky-T1: Train your own O1 preview model within $450
☆3,352Updated 4 months ago
ML-GSAI / LLaDA
Official PyTorch implementation for "Large Language Diffusion Models"
☆3,263Updated last week
deepseek-ai / DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
☆1,833Updated last year
deepseek-ai / DualPipe
A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.
☆2,879Updated 8 months ago
MoonshotAI / Kimi-VL
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
☆1,102Updated 4 months ago
OpenManus / OpenManus-RL
A live stream development of RL tunning for LLM agents
☆3,610Updated last month
stepfun-ai / Step-Audio
☆4,555Updated 5 months ago
simplescaling / s1
s1: Simple test-time scaling
☆6,597Updated 4 months ago
ByteDance-Seed / Seed-Coder
Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.
☆639Updated 5 months ago
ByteDance-Seed / Seed-Thinking-v1.5
☆817Updated 5 months ago
QwenLM / Qwen3-Embedding
☆1,585Updated last month