dvlab-research / ARPOLinks
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆109Updated 2 months ago
Alternatives and similar repositories for ARPO
Users that are interested in ARPO are comparing it to the libraries listed below
Sorting:
- Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆101Updated 2 months ago
- ☆55Updated 2 months ago
- ☆204Updated last week
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆58Updated 10 months ago
- ☆89Updated last month
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆122Updated last month
- ☆325Updated 3 weeks ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆118Updated 4 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆141Updated 8 months ago
- ☆21Updated 3 months ago
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆172Updated 2 weeks ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"☆139Updated 2 months ago
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆78Updated 3 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆155Updated last month
- ☆51Updated 2 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆163Updated 2 months ago
- ☆73Updated this week
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆165Updated 3 months ago
- A Self-Training Framework for Vision-Language Reasoning☆82Updated 7 months ago
- repo for paper https://arxiv.org/abs/2504.13837☆182Updated last month
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆86Updated 4 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆64Updated last month
- [NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.☆113Updated 3 months ago
- Resources for the Enigmata Project.☆64Updated last week
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆108Updated 3 months ago
- "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆78Updated 4 months ago
- An Illusion of Progress? Assessing the Current State of Web Agents☆77Updated last month
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆105Updated 2 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆53Updated 2 months ago
- ☆61Updated 5 months ago