NovaSky-AI / SkyRLLinks
SkyRL: A Modular Full-stack RL Library for LLMs
☆950Updated this week
Alternatives and similar repositories for SkyRL
Users that are interested in SkyRL are comparing it to the libraries listed below
Sorting:
- slime is an LLM post-training framework for RL Scaling.☆2,023Updated this week
- ☆948Updated 3 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆472Updated 2 weeks ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆547Updated 2 months ago
- A version of verl to support diverse tool use☆570Updated this week
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,100Updated last month
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆602Updated 6 months ago
- Scalable toolkit for efficient model reinforcement☆910Updated this week
- A project to improve skills of large language models☆568Updated this week
- Automatic evals for LLMs☆533Updated 3 months ago
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆823Updated last week
- ☆1,273Updated 3 weeks ago
- ☆773Updated 3 weeks ago
- Large Reasoning Models☆804Updated 10 months ago
- [COLM 2025] LIMO: Less is More for Reasoning☆1,022Updated 2 months ago
- Recipes to scale inference-time compute of open models☆1,109Updated 4 months ago
- ☆963Updated 8 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆245Updated 5 months ago
- ☆202Updated 2 weeks ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆261Updated 4 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆670Updated 8 months ago
- A series of technical report on Slow Thinking with LLM☆739Updated last month
- ☆209Updated last month
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,161Updated last week
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆319Updated 5 months ago
- A bibliography and survey of the papers surrounding o1☆1,208Updated 10 months ago
- RewardBench: the first evaluation tool for reward models.☆640Updated 3 months ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆440Updated last week
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,560Updated 4 months ago
- ☆318Updated 4 months ago