Necolizer / awesome-rl-for-agentsLinks
A curated list of reinforcement learning (RL) for agents.
☆40Updated last week
Alternatives and similar repositories for awesome-rl-for-agents
Users that are interested in awesome-rl-for-agents are comparing it to the libraries listed below
Sorting:
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆196Updated 5 months ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆360Updated 3 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆193Updated 2 weeks ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆197Updated 4 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆309Updated 2 weeks ago
- ☆278Updated 3 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆282Updated last week
- Paper List of Inference/Test Time Scaling/Computing☆319Updated 2 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆326Updated last week
- ☆414Updated 3 weeks ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆96Updated 10 months ago
- ☆307Updated 5 months ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆354Updated 2 weeks ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆350Updated 3 weeks ago
- [🏆AAAI2025] Official Repo for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area.☆55Updated last week
- Official Repository of "Learning what reinforcement learning can't"☆68Updated last month
- [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding☆91Updated 7 months ago
- Training VLM agents with multi-turn reinforcement learning☆285Updated last week
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆90Updated last week
- A comprehensive collection of process reward models.☆115Updated 3 weeks ago
- ☆109Updated last month
- ☆192Updated 3 months ago
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆57Updated 6 months ago
- ☆335Updated 3 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆166Updated 3 weeks ago
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆314Updated last month
- Latest Advances on Long Chain-of-Thought Reasoning☆537Updated 3 months ago
- ☆178Updated 5 months ago
- One-shot Entropy Minimization☆186Updated 4 months ago
- Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models☆263Updated 2 months ago