dvlab-research / ARPOLinks
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆120Updated 3 months ago
Alternatives and similar repositories for ARPO
Users that are interested in ARPO are comparing it to the libraries listed below
Sorting:
- ☆210Updated 3 weeks ago
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆263Updated last week
- ☆330Updated last month
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆121Updated 5 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆157Updated last week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆243Updated 4 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆252Updated this week
- ☆59Updated 3 months ago
- ☆98Updated 3 weeks ago
- ☆90Updated 2 weeks ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆251Updated 3 months ago
- Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆106Updated 2 months ago
- ☆21Updated 4 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆132Updated 2 months ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆177Updated 4 months ago
- repo for paper https://arxiv.org/abs/2504.13837☆187Updated 2 months ago
- Repo of paper "Free Process Rewards without Process Labels"☆162Updated 5 months ago
- [NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.☆115Updated 3 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆168Updated 3 months ago
- The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆164Updated 2 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆232Updated 4 months ago
- A Self-Training Framework for Vision-Language Reasoning☆83Updated 7 months ago
- ☆205Updated 3 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆110Updated 4 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"☆143Updated 3 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆128Updated 4 months ago
- Extrapolating RLVR to General Domains without Verifiers☆158Updated last month
- ☆283Updated 3 months ago
- An Illusion of Progress? Assessing the Current State of Web Agents☆80Updated last month
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆81Updated 3 months ago