XiaomiMiMo/MiMo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/XiaomiMiMo/MiMo)

XiaomiMiMo / MiMo

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

☆2,287

Alternatives and similar repositories for MiMo

Users that are interested in MiMo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

XiaomiMiMo / MiMo-V2-Flash
View on GitHub
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model
☆1,362Jan 8, 2026Updated 6 months ago
XiaomiMiMo / MiMo-VL
View on GitHub
MiMo-VL
☆642Aug 21, 2025Updated 11 months ago
XiaomiMiMo / MiMo-Audio
View on GitHub
MiMo-Audio: Audio Language Models are Few-Shot Learners
☆1,069Jun 17, 2026Updated last month
ByteDance-Seed / Seed-Thinking-v1.5
View on GitHub
☆811Jun 9, 2025Updated last year
MiniMax-AI / MiniMax-M1
View on GitHub
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
☆3,161Jul 7, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ByteDance-Seed / Seed1.5-VL
View on GitHub
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving stat…
☆1,583Jun 14, 2025Updated last year
XiaomiMiMo / MiMo-Audio-Tokenizer
View on GitHub
A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.
☆145Sep 19, 2025Updated 10 months ago
QwenLM / Qwen3
View on GitHub
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
☆27,426Jan 9, 2026Updated 6 months ago
QwenLM / Qwen3-Omni
View on GitHub
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…
☆3,906Apr 23, 2026Updated 3 months ago
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,667Updated this week
XiaomiMiMo / MiMo-Audio-Eval
View on GitHub
☆88Jun 17, 2026Updated last month
QwenLM / Qwen3-VL
View on GitHub
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
☆19,667Jan 30, 2026Updated 5 months ago
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,732Updated this week
XiaomiMiMo / MiMo-Code
View on GitHub
MiMo Code: Where Models and Agents Co-Evolve
☆12,465Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
MoonshotAI / Kimi-VL
View on GitHub
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
☆1,210Jul 15, 2025Updated last year
deepseek-ai / DeepSeek-Prover-V2
View on GitHub
☆1,285Jul 18, 2025Updated last year
MoonshotAI / Kimi-K2
View on GitHub
Kimi K2 is the large language model series developed by Moonshot AI team
☆11,036Jan 21, 2026Updated 6 months ago
BytedTsinghua-SIA / DAPO
View on GitHub
An Open-source RL System from ByteDance Seed and Tsinghua AIR
☆1,848May 11, 2025Updated last year
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated last year
QwenLM / Qwen2.5-Omni
View on GitHub
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…
☆4,052Jun 12, 2025Updated last year
hkust-nlp / simpleRL-reason
View on GitHub
Simple RL training for reasoning
☆3,870Dec 23, 2025Updated 7 months ago
ByteDance-Seed / Bagel
View on GitHub
Open-source unified multimodal model
☆6,121May 4, 2026Updated 2 months ago
sgl-project / sglang
View on GitHub
SGLang is a high-performance serving framework for large language models and multimodal models.
☆30,756Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MoonshotAI / Kimi-Linear
View on GitHub
☆1,498Nov 17, 2025Updated 8 months ago
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,645Updated this week
MiniMax-AI / MiniMax-01
View on GitHub
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
☆3,446Jul 7, 2025Updated last year
sail-sg / understand-r1-zero
View on GitHub
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,268Aug 27, 2025Updated 10 months ago
deepseek-ai / FlashMLA
View on GitHub
FlashMLA: Efficient Multi-head Latent Attention Kernels
☆12,779Apr 30, 2026Updated 2 months ago
deepseek-ai / open-infra-index
View on GitHub
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
☆8,032May 15, 2025Updated last year
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆87,210Updated this week
MoonshotAI / Moonlight
View on GitHub
Muon is Scalable for LLM Training
☆1,514Aug 3, 2025Updated 11 months ago
ByteDance-Seed / VeOmni
View on GitHub
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
☆2,107Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
deepseek-ai / DeepSeek-V3.2-Exp
View on GitHub
☆1,621Nov 18, 2025Updated 8 months ago
ByteDance-Seed / Seed-Coder
View on GitHub
Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.
☆754Jun 6, 2025Updated last year
areal-project / AReaL
View on GitHub
The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
☆5,604Updated this week
deepseek-ai / DeepGEMM
View on GitHub
DeepGEMM: clean and efficient BLAS kernel library on GPU
☆7,565Updated this week
meituan-longcat / LongCat-Flash-Omni
View on GitHub
This is the official repo for the paper "LongCat-Flash-Omni Technical Report"
☆501May 9, 2026Updated 2 months ago
ByteDance-Seed / Seed-1.8
View on GitHub
☆219Dec 19, 2025Updated 7 months ago
huggingface / open-r1
View on GitHub
Fully open reproduction of DeepSeek-R1
☆26,415Apr 2, 2026Updated 3 months ago