SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning
☆702Apr 11, 2026Updated 3 weeks ago
Alternatives and similar repositories for SkillRL
Users that are interested in SkillRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code repository of Shuffle-R1☆105Feb 23, 2026Updated 2 months ago
- Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning☆335Updated this week
- Paper: “MEMRL: SELF-EVOLVING AGENTS VIA RUNTIME REINFORCEMENT LEARNING ON EPISODIC MEMORY” Open-Source Code☆101Updated this week
- [ICML 2026] RLAnything & DemyAgent: General and scalable agentic RL algorithms across terminal, GUI, SWE, and tool-call settings☆481Feb 27, 2026Updated 2 months ago
- MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents☆456Mar 31, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆62Apr 3, 2026Updated last month
- Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)☆19Jul 1, 2025Updated 10 months ago
- ☆114Apr 19, 2026Updated 2 weeks ago
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆35Apr 13, 2026Updated 3 weeks ago
- Official code for paper "GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable R…☆60Mar 29, 2026Updated last month
- [ICML'26] VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆53Updated this week
- Reinforcement Learning of Vision Language Models with Self Visual Perception Reward☆170Mar 14, 2026Updated last month
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆63Mar 17, 2026Updated last month
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,868Feb 27, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆60Aug 5, 2025Updated 9 months ago
- [ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model☆18Feb 24, 2025Updated last year
- ☆52Feb 12, 2025Updated last year
- 🚀 Sliding Window Attention Training for Efficient Large Language Models☆16Dec 8, 2025Updated 4 months ago
- ☆313Jul 6, 2025Updated 10 months ago
- Official Repo for Open-Reasoner-Zero☆2,093Jun 2, 2025Updated 11 months ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆32Jan 25, 2026Updated 3 months ago
- Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.☆13Nov 19, 2024Updated last year
- MemEvolve & EvolveLab☆211Dec 23, 2025Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆80Nov 6, 2025Updated 6 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,637Nov 13, 2025Updated 5 months ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆17Mar 2, 2026Updated 2 months ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,380May 16, 2025Updated 11 months ago
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆19Oct 18, 2025Updated 6 months ago
- 🚀 Text2Grad: Converting natural language feedback into gradient signals for precise model optimization. Revolutionizing RLHF with span-l…☆34Feb 6, 2026Updated 3 months ago
- ☆14Jul 17, 2025Updated 9 months ago
- [ICLR'26] EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization☆30Aug 5, 2025Updated 9 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆711Aug 5, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official implementation of "PyVision-RL: Forging Open Agentic Vision Models via RL."☆72Feb 25, 2026Updated 2 months ago
- Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe☆192Apr 29, 2026Updated last week
- Reinforcement Learning via Self-Distillation (SDPO)☆840Feb 18, 2026Updated 2 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆162Oct 30, 2024Updated last year
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆41Jan 5, 2026Updated 4 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆53Dec 13, 2025Updated 4 months ago
- [CVPR'26] VisPlay: Self-Evolving Vision-Language Models☆56Feb 25, 2026Updated 2 months ago