deepseek-ai / DeepSeek-Prover-V2Links
☆1,153Updated last month
Alternatives and similar repositories for DeepSeek-Prover-V2
Users that are interested in DeepSeek-Prover-V2 are comparing it to the libraries listed below
Sorting:
- ☆525Updated 10 months ago
- Dream 7B, a large diffusion language model☆774Updated last week
- MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining☆1,470Updated 3 weeks ago
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.☆2,118Updated last week
- ☆2,023Updated 2 weeks ago
- Technical report of Kimina-Prover Preview.☆291Updated last month
- LIMO: Less is More for Reasoning☆963Updated 2 months ago
- Muon is Scalable for LLM Training☆1,081Updated 2 months ago
- MoBA: Mixture of Block Attention for Long-Context LLMs☆1,803Updated 2 months ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆2,378Updated last week
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,106Updated 4 months ago
- Code for BLT research paper☆1,686Updated last month
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents☆1,725Updated 3 weeks ago
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,026Updated 3 weeks ago
- Democratizing Reinforcement Learning for LLMs☆3,396Updated last month
- Understanding R1-Zero-Like Training: A Critical Perspective☆991Updated last month
- ☆792Updated 2 weeks ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆635Updated 2 weeks ago
- Scalable RL solution for advanced reasoning of language models☆1,622Updated 3 months ago
- Open-source implementation of AlphaEvolve☆2,676Updated this week
- Pretraining code for a large-scale depth-recurrent language model☆783Updated 2 weeks ago
- MMaDA - Open-Sourced Multimodal Large Diffusion Language Models☆1,136Updated last week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,438Updated 2 months ago
- Sky-T1: Train your own O1 preview model within $450☆3,272Updated last month
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,016Updated 3 weeks ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,162Updated 5 months ago
- ☆3,374Updated 3 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆2,656Updated this week
- Official Repo for Open-Reasoner-Zero☆1,969Updated 3 weeks ago
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆2,891Updated last week