dvlab-research / ARPOLinks
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆68Updated this week
Alternatives and similar repositories for ARPO
Users that are interested in ARPO are comparing it to the libraries listed below
Sorting:
- Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆49Updated this week
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆43Updated 3 months ago
- An Illusion of Progress? Assessing the Current State of Web Agents☆52Updated last week
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆57Updated 7 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆136Updated 6 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆101Updated 2 months ago
- Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆94Updated last month
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆51Updated 5 months ago
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆44Updated last month
- This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR2025]☆66Updated last month
- General Reasoner: Advancing LLM Reasoning Across All Domains☆117Updated this week
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆55Updated 4 months ago
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆44Updated last week
- ☆40Updated 3 weeks ago
- ☆145Updated last week
- ☆102Updated last month
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"☆99Updated last week
- Official Repository of LatentSeek☆30Updated last week
- ☆58Updated 2 months ago
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning", https://arxiv.org/abs/2505.13934☆30Updated this week
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials☆32Updated 3 months ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆34Updated 10 months ago
- ☆21Updated 3 months ago
- Natural Language Reinforcement Learning☆89Updated 5 months ago
- Efficient Agent Training for Computer Use☆85Updated last week
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆82Updated last week
- official implementation of paper "Process Reward Model with Q-value Rankings"☆59Updated 3 months ago
- ☆102Updated last month
- NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆64Updated 2 weeks ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆36Updated last week