deepseek-ai / DeepSeek-Prover-V2Links
☆1,228Updated 5 months ago
Alternatives and similar repositories for DeepSeek-Prover-V2
Users that are interested in DeepSeek-Prover-V2 are comparing it to the libraries listed below
Sorting:
- ☆1,515Updated last month
- ☆551Updated last year
- ☆1,377Updated 3 months ago
- ☆482Updated 5 months ago
- ☆1,405Updated last month
- OpenAI Frontier Evals☆974Updated last month
- Technical report of Kimina-Prover Preview.☆350Updated 6 months ago
- Muon is Scalable for LLM Training☆1,397Updated 5 months ago
- ☆2,529Updated this week
- ☆395Updated 3 weeks ago
- An AI agent system for solving International Mathematical Olympiad (IMO) problems using Google's Gemini, OpenAI, and XAI APIs.☆899Updated 3 months ago
- Dream 7B, a large diffusion language model☆1,139Updated last month
- Democratizing Reinforcement Learning for LLMs☆4,942Updated last week
- Humanity's Last Exam☆1,294Updated 3 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,181Updated 4 months ago
- [COLM 2025] LIMO: Less is More for Reasoning☆1,059Updated 5 months ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,430Updated 4 months ago
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.☆3,032Updated 6 months ago
- Renderer for the harmony response format to be used with gpt-oss☆4,124Updated 3 weeks ago
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,296Updated 3 weeks ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆857Updated last week
- MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining☆1,896Updated 7 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,729Updated 8 months ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆737Updated 7 months ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆722Updated 7 months ago
- ☆1,257Updated last month
- ☆478Updated last year
- Textbook on reinforcement learning from human feedback☆1,396Updated this week
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,290Updated 6 months ago
- Async RL Training at Scale☆985Updated this week