NovaSky-AI / SkyRLLinks
SkyRL: A Modular Full-stack RL Library for LLMs
β1,547Updated this week
Alternatives and similar repositories for SkyRL
Users that are interested in SkyRL are comparing it to the libraries listed below
Sorting:
- Scalable toolkit for efficient model reinforcementβ1,293Updated this week
- πΎ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.β623Updated last week
- β1,084Updated 3 weeks ago
- slime is an LLM post-training framework for RL Scaling.β3,668Updated this week
- Async RL Training at Scaleβ1,044Updated this week
- Understanding R1-Zero-Like Training: A Critical Perspectiveβ1,203Updated 5 months ago
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewardsβ1,326Updated 3 weeks ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"β673Updated 10 months ago
- β1,385Updated 4 months ago
- Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.β830Updated this week
- β957Updated 3 months ago
- A version of verl to support diverse tool useβ860Updated last month
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]β625Updated 6 months ago
- Recipes to scale inference-time compute of open modelsβ1,124Updated 8 months ago
- A bibliography and survey of the papers surrounding o1β1,213Updated last year
- [COLM 2025] LIMO: Less is More for Reasoningβ1,061Updated 6 months ago
- A project to improve skills of large language modelsβ804Updated last week
- An Open-source RL System from ByteDance Seed and Tsinghua AIRβ1,715Updated 8 months ago
- Automatic evals for LLMsβ578Updated last month
- Checkpoint-engine is a simple middleware to update model weights in LLM inference enginesβ902Updated this week
- PyTorch building blocks for the OLMo ecosystemβ763Updated last week
- Training Large Language Model to Reason in a Continuous Latent Spaceβ1,491Updated 5 months ago
- A Gym for Agentic LLMsβ439Updated 2 weeks ago
- β970Updated last year
- Large Reasoning Modelsβ807Updated last year
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learningβ972Updated 4 months ago
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike statβ¦β418Updated 2 weeks ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"β589Updated 4 months ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reiβ¦β1,314Updated 8 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,503Updated 2 weeks ago