XiaomiMiMo / MiMo
MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining
☆1,332Updated this week
Alternatives and similar repositories for MiMo
Users that are interested in MiMo are comparing it to the libraries listed below
Sorting:
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆566Updated this week
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆2,599Updated this week
- Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…☆2,913Updated this week
- ☆746Updated 3 weeks ago
- ☆1,510Updated 5 months ago
- Muon is Scalable for LLM Training☆1,044Updated last month
- ☆1,078Updated 2 weeks ago
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents☆1,650Updated last week
- Understanding R1-Zero-Like Training: A Critical Perspective☆925Updated last month
- MoBA: Mixture of Block Attention for Long-Context LLMs☆1,776Updated last month
- Dream 7B, a large diffusion language model☆630Updated 2 weeks ago
- Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities☆847Updated 3 weeks ago
- Official Repo for Open-Reasoner-Zero☆1,916Updated last month
- LIMO: Less is More for Reasoning☆940Updated last month
- Analyze computation-communication overlap in V3/R1.☆1,027Updated last month
- A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.☆2,763Updated 2 months ago
- ☆875Updated last month
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆400Updated this week
- ☆3,335Updated 2 months ago
- An open-sourced end-to-end VLM-based GUI Agent☆939Updated last month
- Expert Parallelism Load Balancer☆1,180Updated last month
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,234Updated last week
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆788Updated this week
- Democratizing Reinforcement Learning for LLMs☆3,236Updated this week
- Code release for "LLMs can see and hear without any training"☆432Updated last week
- ☆691Updated last month
- Scalable RL solution for advanced reasoning of language models☆1,552Updated 2 months ago
- The Open Cookbook for Top-Tier Code Large Language Model☆1,693Updated 5 months ago
- Distributed RL System for LLM Reasoning☆1,248Updated 3 weeks ago
- Fully open data curation for reasoning models☆1,772Updated last week