deepseek-ai / DeepSeek-Prover-V2Links
☆1,120Updated last month
Alternatives and similar repositories for DeepSeek-Prover-V2
Users that are interested in DeepSeek-Prover-V2 are comparing it to the libraries listed below
Sorting:
- Democratizing Reinforcement Learning for LLMs☆3,306Updated 3 weeks ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆2,036Updated last week
- LIMO: Less is More for Reasoning☆953Updated 2 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,390Updated last month
- ☆1,642Updated last week
- Dream 7B, a large diffusion language model☆703Updated last month
- Muon is Scalable for LLM Training☆1,052Updated 2 months ago
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆349Updated last week
- Pretraining code for a large-scale depth-recurrent language model☆770Updated last week
- Understanding R1-Zero-Like Training: A Critical Perspective☆956Updated last week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆2,453Updated last week
- Technical report of Kimina-Prover Preview.☆285Updated 3 weeks ago
- ☆522Updated 9 months ago
- A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.☆2,792Updated 2 months ago
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆2,751Updated 3 weeks ago
- MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining☆1,398Updated last week
- Atom of Thoughts for Markov LLM Test-Time Scaling☆567Updated last week
- Fully open data curation for reasoning models☆1,796Updated 2 weeks ago
- Code for BLT research paper☆1,675Updated 2 weeks ago
- ☆773Updated last month
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆607Updated this week
- An Open Large Reasoning Model for Real-World Solutions☆1,496Updated last week
- Releases from OpenAI Preparedness☆761Updated last week
- MoBA: Mixture of Block Attention for Long-Context LLMs☆1,781Updated 2 months ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,135Updated 4 months ago
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆763Updated this week
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆1,908Updated last week
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,066Updated 4 months ago
- Analyze computation-communication overlap in V3/R1.☆1,046Updated 2 months ago
- Simple RL training for reasoning☆3,601Updated last month