NovaSky-AI / SkyRLLinks
SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning
☆343Updated last week
Alternatives and similar repositories for SkyRL
Users that are interested in SkyRL are comparing it to the libraries listed below
Sorting:
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆207Updated 3 weeks ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆213Updated 2 weeks ago
- ☆145Updated last week
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆367Updated last week
- ☆293Updated this week
- Scalable toolkit for efficient model reinforcement☆361Updated this week
- ☆198Updated last week
- Repo of paper "Free Process Rewards without Process Labels"☆149Updated 2 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆239Updated last month
- Async pipelined version of Verl☆90Updated last month
- official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”☆223Updated this week
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆474Updated 3 weeks ago
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆184Updated 2 months ago
- Reproducing R1 for Code with Reliable Rewards☆201Updated 3 weeks ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆206Updated 3 weeks ago
- RewardBench: the first evaluation tool for reward models.☆582Updated this week
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆94Updated 2 months ago
- ☆173Updated 2 months ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆299Updated last month
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆231Updated 3 weeks ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆530Updated 2 months ago
- A Comprehensive Survey on Long Context Language Modeling☆147Updated last week
- ☆201Updated 3 months ago
- ☆731Updated last month
- ☆208Updated last week
- Reproducible, flexible LLM evaluations☆203Updated 3 weeks ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆248Updated this week
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆630Updated 4 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆140Updated 2 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆218Updated last year