ZJU-REAL / GUI-RCPOLinks
[AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615
☆59Updated 3 months ago
Alternatives and similar repositories for GUI-RCPO
Users that are interested in GUI-RCPO are comparing it to the libraries listed below
Sorting:
- ☆271Updated last week
- ☆36Updated 4 months ago
- ☆33Updated 6 months ago
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Updated 3 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆178Updated 4 months ago
- Collection of model-centric MCP servers☆25Updated 8 months ago
- Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning☆55Updated last month
- ☆148Updated 6 months ago
- [NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆376Updated 3 months ago
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆94Updated 7 months ago
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆95Updated 8 months ago
- Official Repository for PosterGen☆211Updated this week
- [TMLR 2025] Reading List of Memory Augmented Multimodal Research, including multimodal context modeling, memory in vision and robotics, a…☆57Updated 3 weeks ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Updated last month
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆44Updated last year
- Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning☆65Updated last month
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆107Updated 6 months ago
- MemVerse: Multimodal Memory for Lifelong Learning Agents☆128Updated last month
- ASTRA is an end-to-end system for synthesizing agentic trajectories and rule-verifiable environments for SFT and RL training, developed b…☆109Updated last week
- The code and data of We-Math 2.0.☆164Updated 5 months ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆392Updated 5 months ago
- The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"☆136Updated 5 months ago
- ☆193Updated 3 months ago
- ☆32Updated 6 months ago
- ☆73Updated 8 months ago
- [ICLR 2026] Efficient Agent Training for Computer Use☆135Updated 5 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆89Updated 8 months ago
- [EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs☆59Updated 5 months ago
- [AAAI 2026] The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants☆46Updated 2 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆115Updated last month