NovaSky-AI / SkyRLLinks
SkyRL: A Modular Full-stack RL Library for LLMs
☆1,170Updated this week
Alternatives and similar repositories for SkyRL
Users that are interested in SkyRL are comparing it to the libraries listed below
Sorting:
- slime is an LLM post-training framework for RL Scaling.☆2,407Updated last week
- ☆995Updated 4 months ago
- Scalable toolkit for efficient model reinforcement☆1,024Updated this week
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆564Updated 2 weeks ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆616Updated 7 months ago
- A version of verl to support diverse tool use☆668Updated last week
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆573Updated 3 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,148Updated 2 months ago
- ☆1,335Updated 2 months ago
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,214Updated last month
- ☆894Updated last week
- [COLM 2025] LIMO: Less is More for Reasoning☆1,045Updated 3 months ago
- Async RL Training at Scale☆749Updated last week
- Automatic evals for LLMs☆556Updated 4 months ago
- A project to improve skills of large language models☆608Updated this week
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆820Updated this week
- Large Reasoning Models☆806Updated 11 months ago
- Recipes to scale inference-time compute of open models☆1,117Updated 5 months ago
- ☆963Updated 9 months ago
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆348Updated this week
- ☆231Updated 3 months ago
- A bibliography and survey of the papers surrounding o1☆1,209Updated 11 months ago
- OLMoE: Open Mixture-of-Experts Language Models☆901Updated last month
- ☆263Updated last month
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆678Updated 9 months ago
- Scalable toolkit for efficient model alignment☆844Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆249Updated 6 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆568Updated 3 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,390Updated this week
- A Gym for Agentic LLMs☆352Updated this week