deepseek-ai / DeepSeek-Prover-V2Links
☆1,225Updated 5 months ago
Alternatives and similar repositories for DeepSeek-Prover-V2
Users that are interested in DeepSeek-Prover-V2 are comparing it to the libraries listed below
Sorting:
- ☆1,490Updated 2 weeks ago
- ☆1,357Updated 3 months ago
- ☆547Updated last year
- ☆479Updated 5 months ago
- OpenAI Frontier Evals☆962Updated 2 weeks ago
- ☆1,367Updated last month
- Technical report of Kimina-Prover Preview.☆348Updated 5 months ago
- ☆324Updated 3 months ago
- An AI agent system for solving International Mathematical Olympiad (IMO) problems using Google's Gemini, OpenAI, and XAI APIs.☆884Updated 2 months ago
- Humanity's Last Exam☆1,267Updated 2 months ago
- ☆1,233Updated last month
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆745Updated last week
- Dream 7B, a large diffusion language model☆1,104Updated last month
- ☆477Updated last year
- ☆2,489Updated last month
- Muon is Scalable for LLM Training☆1,380Updated 4 months ago
- [COLM 2025] LIMO: Less is More for Reasoning☆1,056Updated 4 months ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆855Updated 2 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,712Updated 8 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,173Updated 3 months ago
- ☆220Updated 8 months ago
- Democratizing Reinforcement Learning for LLMs☆4,854Updated this week
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.☆3,002Updated 5 months ago
- Textbook on reinforcement learning from human feedback☆1,354Updated last week
- ☆247Updated 6 months ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆734Updated 6 months ago
- Code to automatically prove or verify estimates in analysis☆319Updated 5 months ago
- ☆586Updated 6 months ago
- A benchmark for LLMs on complicated tasks in the terminal☆1,235Updated this week
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆769Updated 2 months ago