QwenLM / Qwen2.5-MathLinks

A series of math-specific large language models of our Qwen2 series.

☆1,019

Alternatives and similar repositories for Qwen2.5-Math

Users that are interested in Qwen2.5-Math are comparing it to the libraries listed below

Sorting:

PRIME-RL / PRIME
Scalable RL solution for advanced reasoning of language models
☆1,764Updated 7 months ago
AIDC-AI / Marco-o1
An Open Large Reasoning Model for Real-World Solutions
☆1,524Updated 5 months ago
Open-Source-O1 / Open-O1
☆1,348Updated 11 months ago
SimpleBerry / LLaMA-O1
Large Reasoning Models
☆806Updated 11 months ago
ByteDance-Seed / Seed-Thinking-v1.5
☆817Updated 4 months ago
GAIR-NLP / LIMO
[COLM 2025] LIMO: Less is More for Reasoning
☆1,042Updated 3 months ago
Qihoo360 / Light-R1
☆749Updated 2 months ago
SkyworkAI / Skywork-OR1
Unleashing the Power of Reinforcement Learning for Math and Code Reasoners
☆729Updated 5 months ago
Open-Reasoner-Zero / Open-Reasoner-Zero
Official Repo for Open-Reasoner-Zero
☆2,058Updated 5 months ago
huggingface / Math-Verify
☆990Updated 4 months ago
zhentingqi / rStar
☆963Updated 9 months ago
microsoft / rStar
☆1,331Updated last month
Gen-Verse / ReasonFlux
[NeurIPS 2025 Spotlight] ReasonFlux Series - ReasonFlux, ReasonFlux-PRM and ReasonFlux-Coder
☆496Updated last month
allenai / OLMoE
OLMoE: Open Mixture-of-Experts Language Models
☆898Updated last month
BytedTsinghua-SIA / DAPO
An Open-source RL System from ByteDance Seed and Tsinghua AIR
☆1,616Updated 5 months ago
deepseek-ai / DeepSeek-Prover-V1.5
☆540Updated last year
MoonshotAI / Moonlight
Muon is Scalable for LLM Training
☆1,348Updated 3 months ago
princeton-nlp / SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
☆925Updated 8 months ago
RUCAIBox / Slow_Thinking_with_LLMs
A series of technical report on Slow Thinking with LLM
☆744Updated 2 months ago
GAIR-NLP / O1-Journey
O1 Replication Journey
☆2,002Updated 9 months ago
open-thoughts / open-thoughts
Fully open data curation for reasoning models
☆2,132Updated 2 months ago
LiveCodeBench / LiveCodeBench
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
☆697Updated 3 months ago
InternLM / InternLM-Math
State-of-the-art bilingual open-sourced Math reasoning LLMs.
☆523Updated last year
sail-sg / understand-r1-zero
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,142Updated 2 months ago
lmarena / arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
☆950Updated 4 months ago
MoonshotAI / Kimi-VL
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
☆1,089Updated 3 months ago
deepseek-ai / DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
☆1,820Updated last year
THUDM / WebRL
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
☆471Updated 5 months ago
THUDM / ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
☆674Updated 9 months ago
ADaM-BJTU / O1-CODER
AN O1 REPLICATION FOR CODING
☆336Updated 10 months ago