deepseek-ai/DeepSeek-Math-V2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/deepseek-ai/DeepSeek-Math-V2)

deepseek-ai / DeepSeek-Math-V2

☆1,591

Alternatives and similar repositories for DeepSeek-Math-V2

Users that are interested in DeepSeek-Math-V2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Mercor-Intelligence / apex-swe
View on GitHub
☆80Jun 25, 2026Updated 2 weeks ago
deepseek-ai / DeepSeek-V3.2-Exp
View on GitHub
☆1,611Nov 18, 2025Updated 7 months ago
ByteDance-Seed / Seed-Prover
View on GitHub
☆435Feb 13, 2026Updated 4 months ago
sjtu-sai-agents / Browse-Master
View on GitHub
Official implementation of Browse-Master, a tool-augmented web-search agent.
☆35Aug 22, 2025Updated 10 months ago
deepseek-ai / DeepSeek-Prover-V1.5
View on GitHub
☆578Aug 16, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,312Updated this week
Purewhite2019 / rethinking_autoformalization
View on GitHub
[ICLR'25 Spotlight] Rethinking and improving autoformalization: towards a faithful metric and a Dependency Retrieval-based approach
☆30May 20, 2025Updated last year
RobertLuo1 / CoHD
View on GitHub
The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation
☆27Aug 17, 2025Updated 10 months ago
google-deepmind / superhuman
View on GitHub
☆772Jun 5, 2026Updated last month
sail-sg / understand-r1-zero
View on GitHub
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,265Aug 27, 2025Updated 10 months ago
multimodal-art-projection / CriticLean
View on GitHub
☆49Aug 5, 2025Updated 11 months ago
MoonshotAI / Kimina-Prover-Preview
View on GitHub
Technical report of Kimina-Prover Preview.
☆370Jul 10, 2025Updated 11 months ago
deepseek-ai / DeepSeek-Prover-V2
View on GitHub
☆1,281Jul 18, 2025Updated 11 months ago
Yifei-Zuo / FlashLLA
View on GitHub
Official repository Flash Local Linear Attention
☆37May 28, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
MiniMax-AI / MiniMax-01
View on GitHub
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
☆3,445Jul 7, 2025Updated last year
ai4sci-research / ai4sci-research.github.io
View on GitHub
☆17Jun 3, 2024Updated 2 years ago
deepseek-ai / DeepSeek-Math
View on GitHub
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
☆3,350Apr 15, 2024Updated 2 years ago
ByteDance-Seed / Stable-DiffCoder
View on GitHub
Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …
☆83Mar 9, 2026Updated 4 months ago
Goedel-LM / Goedel-Prover
View on GitHub
☆236Apr 4, 2025Updated last year
Open-Reasoner-Zero / Open-Reasoner-Zero
View on GitHub
Official Repo for Open-Reasoner-Zero
☆2,096Jun 2, 2025Updated last year
ByteDance-Seed / Seed-Thinking-v1.5
View on GitHub
☆810Jun 9, 2025Updated last year
areal-project / AReaL
View on GitHub
The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
☆5,469Updated this week
zjowowen / GenerativeRL_Preview
View on GitHub
Python library for solving reinforcement learning (RL) problems using generative models.
☆11Feb 18, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Gen-Verse / Open-AgentRL
View on GitHub
RLAnything (ICML 2026) & AutoTool (ICML 2026), DemyAgent: Open-Source RL for LLMs and Agentic Scenarios
☆571Jun 12, 2026Updated 3 weeks ago
deepseek-ai / DeepSeek-OCR
View on GitHub
Contexts Optical Compression
☆23,507Jan 27, 2026Updated 5 months ago
microsoft / ToRA
View on GitHub
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…
☆1,118Feb 22, 2024Updated 2 years ago
deepseek-ai / DeepGEMM
View on GitHub
DeepGEMM: clean and efficient BLAS kernel library on GPU
☆7,478Jun 29, 2026Updated last week
deepseek-ai / Engram
View on GitHub
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
☆4,494Jan 14, 2026Updated 5 months ago
trishullab / PutnamBench
View on GitHub
An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.
☆247Jul 2, 2026Updated last week
huggingface / open-r1
View on GitHub
Fully open reproduction of DeepSeek-R1
☆26,381Apr 2, 2026Updated 3 months ago
zhuyjan / MER2025-MRAC25
View on GitHub
[ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.
☆25Nov 25, 2025Updated 7 months ago
simplescaling / s1
View on GitHub
s1: Simple test-time scaling
☆6,658Jun 25, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
fla-org / native-sparse-attention
View on GitHub
🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"
☆1,008Feb 5, 2026Updated 5 months ago
inclusionAI / Ring-V2
View on GitHub
Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.
☆98Oct 23, 2025Updated 8 months ago
facebookresearch / coconut
View on GitHub
Training Large Language Model to Reason in a Continuous Latent Space
☆1,652Jul 2, 2026Updated last week
Leey21 / CipherBank
View on GitHub
☆13Jun 13, 2025Updated last year
deepseek-ai / DualPipe
View on GitHub
A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.
☆2,977Jan 14, 2026Updated 5 months ago
deepseek-ai / FlashMLA
View on GitHub
FlashMLA: Efficient Multi-head Latent Attention Kernels
☆12,733Apr 30, 2026Updated 2 months ago
Jiahao004 / DeepTheorem
View on GitHub
☆26Jun 10, 2025Updated last year