Tongyi-MAI/MobileWorld

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Tongyi-MAI/MobileWorld)

Tongyi-MAI / MobileWorld

Benchmarking Autonomous Mobile Agents in Agent-User Interactive and MCP-Augmented Environments (ACL 2026)

☆240

Alternatives and similar repositories for MobileWorld

Users that are interested in MobileWorld are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

google-research / android_world
View on GitHub
AndroidWorld is an environment and benchmark for autonomous agents
☆826Updated this week
Tongyi-MAI / MAI-UI
View on GitHub
MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B
☆1,823Apr 20, 2026Updated 3 months ago
lgy0404 / MemGUI-Bench
View on GitHub
[ACM MM 2026] MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments
☆46Jul 13, 2026Updated last week
ai-agents-2030 / ViMo
View on GitHub
☆26Apr 2, 2026Updated 3 months ago
THUDM / MobileRL
View on GitHub
☆93Dec 23, 2025Updated 6 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Computer-use-agents / dart-gui
View on GitHub
DART-GUI: Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation
☆94Feb 26, 2026Updated 4 months ago
xlang-ai / CUA-Gym-Hub
View on GitHub
CUA-Gym-Hub: mock web apps as reproducible RL training environments for computer-use agents
☆65Jul 9, 2026Updated last week
xiaomi-research / guievalkit
View on GitHub
[ICML 2026] GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents
☆23Feb 26, 2026Updated 4 months ago
OSU-NLP-Group / GUI-Agents-Paper-List
View on GitHub
Awesome GUI Agent Paper List
☆861Jun 28, 2026Updated 3 weeks ago
OpenGVLab / GUI-Odyssey
View on GitHub
[ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…
☆159Jan 3, 2026Updated 6 months ago
ZJU-REAL / Awesome-GUI-Agents
View on GitHub
A curated collection of resources, tools, and frameworks for developing GUI Agents.
☆444Jul 9, 2026Updated last week
wwh0411 / FedMABench
View on GitHub
[EMNLP 2025 Main Oral] FedMABench: Benchmarking Mobile GUI Agents on Decentralized Heterogeneous User Data.
☆16Nov 11, 2025Updated 8 months ago
open-compass / MMBench-GUI
View on GitHub
Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent w…
☆112Sep 8, 2025Updated 10 months ago
njucckevin / OpenMobile-Code
View on GitHub
The model, data and code for OpenMobile
☆49Jul 9, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ZJU-REAL / KnowU-Bench
View on GitHub
Official code for "KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation"
☆76Jun 13, 2026Updated last month
ZJU-REAL / GRIL
View on GitHub
[ACL 2026 findings] Pause or Fabricate? Training Language Models for Grounded Reasoning
☆25Apr 24, 2026Updated 2 months ago
ai-agents-2030 / SPA-Bench
View on GitHub
SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
☆64Jul 11, 2025Updated last year
MadeAgents / mobile-use
View on GitHub
MobileUse: an open-source mobile GUI agent for Android phone automation, AndroidWorld/AndroidLab evaluation, hierarchical reflection, and…
☆167Jul 10, 2026Updated last week
JIA-Lab-research / ARPO
View on GitHub
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆162May 29, 2025Updated last year
xlang-ai / OSWorld-G
View on GitHub
[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis
☆172Jun 18, 2026Updated last month
trillion-labs / gWorld
View on GitHub
Generative Visual Code Mobile World Model
☆61May 15, 2026Updated 2 months ago
OpenGVLab / ZeroGUI
View on GitHub
ZeroGUI: Automating Online GUI Learning at Zero Human Cost
☆119Jul 17, 2025Updated last year
stepfun-ai / gelab-zero
View on GitHub
STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research c…
☆2,233May 11, 2026Updated 2 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ritzz-ai / GUI-R1
View on GitHub
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
☆252May 5, 2025Updated last year
xlang-ai / CUA-Gym
View on GitHub
Scalable pipeline for synthesizing verifiable RLVR training data for computer-use agents
☆177May 26, 2026Updated last month
xlang-ai / OSWorld
View on GitHub
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
☆3,026Updated this week
THUDM / Android-Lab
View on GitHub
☆322Aug 18, 2025Updated 11 months ago
OpenBMB / AgentCPM-GUI
View on GitHub
AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient…
☆1,392Jan 11, 2026Updated 6 months ago
EternityJune25 / MVISU-Bench
View on GitHub
[ACM MM 2025 🔥 Oral] MVISU-Bench: Benchmarking Mobile Agents for Real-World Tasks by Multi-App, Vague, Interactive, Single-App and Uneth…
☆15Mar 13, 2026Updated 4 months ago
xlang-ai / OSWorld-V2
View on GitHub
OSWorld 2.0: Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks
☆196Jul 9, 2026Updated last week
OSU-NLP-Group / Mind2Web-2
View on GitHub
[NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge
☆111May 17, 2026Updated 2 months ago
cyysky / MAI-UI-Navigation-Agent
View on GitHub
MAI UI Navigation Agent
☆16Dec 29, 2025Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
showlab / macosworld
View on GitHub
☆35Jan 28, 2026Updated 5 months ago
ui-voyager / UI-Voyager
View on GitHub
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience
☆78Apr 3, 2026Updated 3 months ago
PhoneLLM / Awesome-LLM-Powered-Phone-GUI-Agents
View on GitHub
[TMLR] LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects
☆174Dec 2, 2025Updated 7 months ago
YuxiangChai / A3
View on GitHub
☆35Jan 12, 2026Updated 6 months ago
ZJU-REAL / ClawGUI
View on GitHub
Build, Evaluate, and Deploy GUI Agents — online RL training, standardized benchmarks, and real-device deployment in one framework.
☆1,314Jun 3, 2026Updated last month
kwai / MobileForge
View on GitHub
Official code for "MobileForge: Annotation-Free Adaptation for Mobile GUI Agents with Hierarchical Feedback-Guided Policy Optimization"
☆73Jul 9, 2026Updated last week
iLearn-Lab / ACL26-PersonalAlign
View on GitHub
[ACL 2026 main] PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records
☆21Apr 11, 2026Updated 3 months ago