NovaSky-AI / SkyRLLinks
SkyRL: A Modular Full-stack RL Library for LLMs
☆1,394Updated this week
Alternatives and similar repositories for SkyRL
Users that are interested in SkyRL are comparing it to the libraries listed below
Sorting:
- slime is an LLM post-training framework for RL Scaling.☆2,911Updated last week
- Scalable toolkit for efficient model reinforcement☆1,141Updated last week
- ☆941Updated last month
- ☆1,045Updated 5 months ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆637Updated 9 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆584Updated this week
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,177Updated 3 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆607Updated 4 months ago
- Async RL Training at Scale☆950Updated this week
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,283Updated last week
- A version of verl to support diverse tool use☆766Updated 2 weeks ago
- A project to improve skills of large language models☆715Updated this week
- ☆1,369Updated 3 months ago
- A bibliography and survey of the papers surrounding o1☆1,214Updated last year
- Recipes to scale inference-time compute of open models☆1,120Updated 7 months ago
- Automatic evals for LLMs☆569Updated this week
- [COLM 2025] LIMO: Less is More for Reasoning☆1,056Updated 4 months ago
- A Gym for Agentic LLMs☆409Updated this week
- Large Reasoning Models☆806Updated last year
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆405Updated last month
- ☆610Updated last week
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆871Updated this week
- ☆299Updated 3 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,448Updated 3 weeks ago
- OLMoE: Open Mixture-of-Experts Language Models☆930Updated 3 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆581Updated 4 months ago
- Official Repo for Open-Reasoner-Zero☆2,084Updated 6 months ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,411Updated 4 months ago
- ☆969Updated 11 months ago
- PyTorch building blocks for the OLMo ecosystem☆612Updated this week