SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning
☆769May 17, 2026Updated last week
Alternatives and similar repositories for SkillRL
Users that are interested in SkillRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code repository of Shuffle-R1☆98Feb 23, 2026Updated 3 months ago
- Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning☆360May 1, 2026Updated 3 weeks ago
- MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents☆483Mar 31, 2026Updated last month
- Paper: “MEMRL: SELF-EVOLVING AGENTS VIA RUNTIME REINFORCEMENT LEARNING ON EPISODIC MEMORY” Open-Source Code☆119May 2, 2026Updated 3 weeks ago
- [ICML 2026] RLAnything & DemyAgent: General and scalable agentic RL algorithms across terminal, GUI, SWE, and tool-call settings☆526May 16, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆64Apr 3, 2026Updated last month
- Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)☆19Jul 1, 2025Updated 10 months ago
- ☆122Apr 19, 2026Updated last month
- Official code for paper "GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable R…☆64Mar 29, 2026Updated last month
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,909Feb 27, 2026Updated 3 months ago
- ☆34Sep 19, 2025Updated 8 months ago
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆40Apr 13, 2026Updated last month
- [ICML'26] VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆57May 18, 2026Updated last week
- Reinforcement Learning of Vision Language Models with Self Visual Perception Reward☆170Mar 14, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆65Mar 17, 2026Updated 2 months ago
- [ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model☆18Feb 24, 2025Updated last year
- [NeurIPS 2025] Scaling Language-centric Omnimodal Representation Learning☆42Apr 13, 2026Updated last month
- ☆52Feb 12, 2025Updated last year
- Code2World: A GUI World Model via Renderable Code Generation☆319Feb 12, 2026Updated 3 months ago
- The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"☆207Dec 25, 2025Updated 5 months ago
- 🚀 Sliding Window Attention Training for Efficient Large Language Models☆16Dec 8, 2025Updated 5 months ago
- ☆312Jul 6, 2025Updated 10 months ago
- Official Repo for Open-Reasoner-Zero☆2,091Jun 2, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Aligning Agentic World Models via Knowledgeable Experience Learning☆35May 15, 2026Updated last week
- Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.☆13Nov 19, 2024Updated last year
- [ICML'26] MemEvolve & EvolveLab☆225May 5, 2026Updated 3 weeks ago
- ☆81May 14, 2026Updated 2 weeks ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,753Nov 13, 2025Updated 6 months ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆17Mar 2, 2026Updated 2 months ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,385May 16, 2025Updated last year
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆19Oct 18, 2025Updated 7 months ago
- 🚀 Text2Grad: Converting natural language feedback into gradient signals for precise model optimization. Revolutionizing RLHF with span-l…☆35Feb 6, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Jul 17, 2025Updated 10 months ago
- A Searching-based Agent Model for Open-Domain Open-Ended Question Answering☆36Jun 20, 2025Updated 11 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆712Aug 5, 2025Updated 9 months ago
- [ICLR'26] EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization☆30Aug 5, 2025Updated 9 months ago
- Official implementation of "PyVision-RL: Forging Open Agentic Vision Models via RL."☆66Feb 25, 2026Updated 3 months ago
- Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"☆296May 20, 2026Updated last week
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,810May 11, 2025Updated last year