Tongyi-MAI / MobileWorldLinks
Benchmarking Autonomous Mobile Agents in Agent-User Interactive and MCP-Augmented Environments
☆112Updated this week
Alternatives and similar repositories for MobileWorld
Users that are interested in MobileWorld are comparing it to the libraries listed below
Sorting:
- ☆207Updated last week
- MiroTrain is an efficient and algorithm-first framework research agent.☆132Updated 5 months ago
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆226Updated 5 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆304Updated 3 months ago
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆95Updated 8 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆177Updated 3 months ago
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.☆164Updated 4 months ago
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆190Updated 6 months ago
- Scaling Preference Data Curation via Human-AI Synergy☆137Updated 6 months ago
- ☆92Updated 8 months ago
- ☆180Updated 9 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆148Updated 8 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆114Updated 3 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆147Updated 8 months ago
- ☆82Updated 9 months ago
- [ICLR 2026] Efficient Agent Training for Computer Use☆135Updated 4 months ago
- ☆192Updated 3 months ago
- ☆100Updated 5 months ago
- ☆76Updated 7 months ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆95Updated 2 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆290Updated 2 months ago
- Towards a Unified View of Large Language Model Post-Training☆199Updated 4 months ago
- Scaling Long-Horizon LLM Agent via Context-Folding☆101Updated last week
- ☆162Updated last year
- ☆153Updated this week
- [NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆144Updated 2 months ago
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆107Updated 6 months ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆384Updated 5 months ago
- Evergreen, contamination-free, real-world, domain-specific AI evaluation framework☆119Updated 3 weeks ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆142Updated 11 months ago