deepseek-ai / DeepSeek-Prover-V2Links
☆1,228Updated 5 months ago
Alternatives and similar repositories for DeepSeek-Prover-V2
Users that are interested in DeepSeek-Prover-V2 are comparing it to the libraries listed below
Sorting:
- ☆1,515Updated last month
- ☆551Updated last year
- ☆1,405Updated last month
- ☆482Updated 5 months ago
- Technical report of Kimina-Prover Preview.☆350Updated 6 months ago
- ☆1,377Updated 3 months ago
- Muon is Scalable for LLM Training☆1,397Updated 5 months ago
- ☆1,257Updated last month
- An AI agent system for solving International Mathematical Olympiad (IMO) problems using Google's Gemini, OpenAI, and XAI APIs.☆899Updated 3 months ago
- ☆395Updated 2 weeks ago
- OpenAI Frontier Evals☆971Updated last month
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,181Updated 4 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,729Updated 8 months ago
- Humanity's Last Exam☆1,294Updated 3 months ago
- [COLM 2025] LIMO: Less is More for Reasoning☆1,059Updated 5 months ago
- ☆478Updated last year
- Pretraining and inference code for a large-scale depth-recurrent language model☆857Updated last week
- Dream 7B, a large diffusion language model☆1,134Updated last month
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆737Updated 7 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆934Updated 7 months ago
- ☆600Updated 7 months ago
- MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining☆1,896Updated 7 months ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆773Updated this week
- A collection of formalized statements of conjectures in Lean.☆752Updated this week
- ☆224Updated 9 months ago
- Post-training with Tinker☆2,699Updated this week
- Training Large Language Model to Reason in a Continuous Latent Space☆1,430Updated 4 months ago
- Self-Adapting Language Models☆1,637Updated 5 months ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆792Updated 2 weeks ago
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,290Updated 6 months ago