NovaSky-AI / SkyRLLinks
SkyRL: A Modular Full-stack RL Library for LLMs
☆1,060Updated last week
Alternatives and similar repositories for SkyRL
Users that are interested in SkyRL are comparing it to the libraries listed below
Sorting:
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆553Updated 2 months ago
- slime is an LLM post-training framework for RL Scaling.☆2,232Updated this week
- ☆971Updated 3 months ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆607Updated 7 months ago
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,194Updated 2 weeks ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆539Updated this week
- A version of verl to support diverse tool use☆627Updated this week
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,126Updated last month
- Scalable toolkit for efficient model reinforcement☆956Updated this week
- A project to improve skills of large language models☆587Updated this week
- Post-training with Tinker☆1,096Updated this week
- Automatic evals for LLMs☆547Updated 3 months ago
- ☆843Updated last week
- A bibliography and survey of the papers surrounding o1☆1,209Updated 11 months ago
- ☆1,309Updated last month
- Recipes to scale inference-time compute of open models☆1,111Updated 5 months ago
- [COLM 2025] LIMO: Less is More for Reasoning☆1,037Updated 2 months ago
- Large Reasoning Models☆805Updated 10 months ago
- ☆239Updated last month
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆321Updated last week
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆673Updated 9 months ago
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆864Updated last month
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆775Updated this week
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆734Updated 2 months ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆266Updated last week
- OLMoE: Open Mixture-of-Experts Language Models☆888Updated last month
- Async RL Training at Scale☆722Updated this week
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆538Updated 2 weeks ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆554Updated 5 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆246Updated 5 months ago