microsoft / GUI-Agent-RLLinks
☆39Updated 6 months ago
Alternatives and similar repositories for GUI-Agent-RL
Users that are interested in GUI-Agent-RL are comparing it to the libraries listed below
Sorting:
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆177Updated 3 months ago
- ☆254Updated last week
- ✨✨Latest Papers and Datasets on Mobile and PC GUI Agent☆149Updated last year
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆143Updated 11 months ago
- ☆70Updated 7 months ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆95Updated 2 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆130Updated 10 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆155Updated last year
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆314Updated last month
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆298Updated this week
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆71Updated 8 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆148Updated 8 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Updated 6 months ago
- ☆108Updated last month
- Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework☆205Updated 3 weeks ago
- [NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆145Updated 3 months ago
- ☆427Updated 3 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆256Updated 9 months ago
- MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)☆71Updated 5 months ago
- [ICLR 2026] Efficient Agent Training for Computer Use☆135Updated 5 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆55Updated last year
- ☆229Updated last month
- ☆219Updated 8 months ago
- ☆192Updated 3 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆133Updated 10 months ago
- Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"☆61Updated last year
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆306Updated 3 months ago
- ☆165Updated 3 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆87Updated 10 months ago
- Test-time preferenece optimization (ICML 2025).☆178Updated 8 months ago