BytedTsinghua-SIA/DAPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BytedTsinghua-SIA/DAPO)

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

☆1,750

Alternatives and similar repositories for DAPO

Users that are interested in DAPO are comparing it to the libraries listed below

Sorting:

verl-project / verl
View on GitHub
verl: Volcano Engine Reinforcement Learning for LLMs
☆19,519Updated this week
sail-sg / understand-r1-zero
View on GitHub
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,222Aug 27, 2025Updated 6 months ago
hkust-nlp / simpleRL-reason
View on GitHub
Simple RL training for reasoning
☆3,830Dec 23, 2025Updated 2 months ago
Open-Reasoner-Zero / Open-Reasoner-Zero
View on GitHub
Official Repo for Open-Reasoner-Zero
☆2,084Jun 2, 2025Updated 9 months ago
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
☆9,084Updated this week
ByteDance-Seed / Seed-Thinking-v1.5
View on GitHub
☆813Jun 9, 2025Updated 8 months ago
hiyouga / EasyR1
View on GitHub
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
☆4,681Feb 26, 2026Updated last week
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,196Updated this week
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆4,135Nov 13, 2025Updated 3 months ago
PRIME-RL / PRIME
View on GitHub
Scalable RL solution for advanced reasoning of language models
☆1,811Mar 18, 2025Updated 11 months ago
StarsfieldAI / R1-V
View on GitHub
Witness the aha moment of VLM with less than $3.
☆4,036May 19, 2025Updated 9 months ago
EvolvingLMMs-Lab / open-r1-multimodal
View on GitHub
A fork to add multimodal model training to open-r1
☆1,496Feb 8, 2025Updated last year
inclusionAI / AReaL
View on GitHub
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
☆4,441Updated this week
huggingface / open-r1
View on GitHub
Fully open reproduction of DeepSeek-R1
☆25,927Nov 24, 2025Updated 3 months ago
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,527Feb 27, 2026Updated last week
Unakar / Logic-RL
View on GitHub
Reproduce R1 Zero on Logic Puzzle
☆2,440Mar 20, 2025Updated 11 months ago
Qihoo360 / Light-R1
View on GitHub
☆762Dec 23, 2025Updated 2 months ago
Jiayi-Pan / TinyZero
View on GitHub
Minimal reproduction of DeepSeek R1-Zero
☆12,896Feb 27, 2026Updated last week
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆17,523Updated this week
ModalMinds / MM-EUREKA
View on GitHub
MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning
☆771Sep 7, 2025Updated 6 months ago
GAIR-NLP / O1-Journey
View on GitHub
O1 Replication Journey
☆2,000Jan 14, 2025Updated last year
Agent-RL / ReCall
View on GitHub
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…
☆1,338May 16, 2025Updated 9 months ago
huggingface / Math-Verify
View on GitHub
☆1,107Jan 10, 2026Updated last month
om-ai-lab / VLM-R1
View on GitHub
Solve Visual Understanding with Reinforced VLMs
☆5,855Oct 21, 2025Updated 4 months ago
PRIME-RL / TTRL
View on GitHub
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
☆1,002Feb 23, 2026Updated last week
GAIR-NLP / ToRL
View on GitHub
☆335May 24, 2025Updated 9 months ago
RUCAIBox / Slow_Thinking_with_LLMs
View on GitHub
A series of technical report on Slow Thinking with LLM
☆761Aug 13, 2025Updated 6 months ago
GAIR-NLP / LIMR
View on GitHub
☆216Feb 20, 2025Updated last year
simplescaling / s1
View on GitHub
s1: Simple test-time scaling
☆6,642Jun 25, 2025Updated 8 months ago
RUCAIBox / R1-Searcher
View on GitHub
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
☆693Aug 5, 2025Updated 7 months ago
ByteDance-Seed / Seed1.5-VL
View on GitHub
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving stat…
☆1,551Jun 14, 2025Updated 8 months ago
hijkzzz / Awesome-LLM-Strawberry
View on GitHub
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
☆6,896Dec 17, 2025Updated 2 months ago
AgentR1 / Agent-R1
View on GitHub
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
☆1,269Updated this week
Liuziyu77 / Visual-RFT
View on GitHub
Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’
☆2,308Oct 29, 2025Updated 4 months ago
GAIR-NLP / LIMO
View on GitHub
[COLM 2025] LIMO: Less is More for Reasoning
☆1,064Jul 30, 2025Updated 7 months ago
MoonshotAI / Kimi-VL
View on GitHub
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
☆1,164Jul 15, 2025Updated 7 months ago
openreasoner / openr
View on GitHub
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
☆1,835Jan 17, 2025Updated last year
SkyworkAI / Skywork-OR1
View on GitHub
Unleashing the Power of Reinforcement Learning for Math and Code Reasoners
☆742Jun 6, 2025Updated 9 months ago
turningpoint-ai / VisualThinker-R1-Zero
View on GitHub
Explore the Multimodal “Aha Moment” on 2B Model
☆623Mar 18, 2025Updated 11 months ago