deepseek-ai / DeepSeek-Prover-V2Links
☆1,197Updated 3 months ago
Alternatives and similar repositories for DeepSeek-Prover-V2
Users that are interested in DeepSeek-Prover-V2 are comparing it to the libraries listed below
Sorting:
- ☆1,320Updated last month
- ☆540Updated last year
- Technical report of Kimina-Prover Preview.☆338Updated 3 months ago
- ☆934Updated 3 weeks ago
- ☆476Updated 3 months ago
- An AI agent system for solving International Mathematical Olympiad (IMO) problems using Google's Gemini, OpenAI, and XAI APIs.☆830Updated 3 weeks ago
- ☆2,395Updated last week
- Dream 7B, a large diffusion language model☆1,034Updated last month
- Democratizing Reinforcement Learning for LLMs☆4,600Updated this week
- OpenAI Frontier Evals☆924Updated last week
- [COLM 2025] LIMO: Less is More for Reasoning☆1,038Updated 3 months ago
- ☆299Updated last month
- Humanity's Last Exam☆1,151Updated 3 weeks ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,132Updated 2 months ago
- Muon is Scalable for LLM Training☆1,342Updated 2 months ago
- Post-training with Tinker☆1,096Updated last week
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.☆2,932Updated 3 months ago
- Self-Adapting Language Models☆1,400Updated 2 months ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆838Updated 2 weeks ago
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,371Updated 2 weeks ago
- AlphaGo Moment for Model Architecture Discovery.☆1,101Updated 2 months ago
- MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining☆1,602Updated 4 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,638Updated 6 months ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆727Updated 4 months ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,313Updated 2 months ago
- A benchmark for LLMs on complicated tasks in the terminal☆961Updated this week
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆682Updated last month
- Renderer for the harmony response format to be used with gpt-oss☆3,926Updated 2 months ago
- ☆817Updated 4 months ago
- Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…☆2,762Updated 3 weeks ago