NovaSky-AI / SkyRLLinks
SkyRL: A Modular Full-stack RL Library for LLMs
☆818Updated last week
Alternatives and similar repositories for SkyRL
Users that are interested in SkyRL are comparing it to the libraries listed below
Sorting:
- slime is a LLM post-training framework for RL Scaling.☆1,747Updated this week
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆595Updated 5 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆541Updated last month
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆459Updated 3 weeks ago
- ☆921Updated 2 months ago
- A version of verl to support diverse tool use☆474Updated last week
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,076Updated 2 weeks ago
- Scalable toolkit for efficient model reinforcement☆857Updated this week
- A project to improve skills of large language models☆553Updated this week
- Large Reasoning Models☆805Updated 9 months ago
- procedural reasoning datasets☆1,092Updated last week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆243Updated 4 months ago
- Automatic evals for LLMs☆524Updated 2 months ago
- official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”☆357Updated last week
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆257Updated 4 months ago
- ☆1,122Updated last week
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆313Updated 4 months ago
- A series of technical report on Slow Thinking with LLM☆729Updated last month
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆443Updated 3 months ago
- ☆315Updated 3 months ago
- ☆205Updated last month
- ☆958Updated 7 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆649Updated last month
- TTRL: Test-Time Reinforcement Learning☆794Updated 3 weeks ago
- [COLM 2025] LIMO: Less is More for Reasoning☆1,015Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆251Updated 4 months ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆516Updated this week
- Recipes to scale inference-time compute of open models☆1,111Updated 3 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆663Updated 7 months ago
- Decentralized RL Training at Scale☆569Updated last week