ai-agents-2030/SPA-Bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ai-agents-2030/SPA-Bench)

ai-agents-2030 / SPA-Bench

SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation

☆64

Alternatives and similar repositories for SPA-Bench

Users that are interested in SPA-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LlamaTouch / AgentEnv
View on GitHub
An environment for mobile angets to interact with realistic android device or android emulator
☆13Jul 19, 2024Updated 2 years ago
YuxiangChai / OpenSlides
View on GitHub
AI-powered slide workspace for creating, editing, versioning, and presenting beautiful reveal.js decks from prompts and source files.
☆15Apr 14, 2026Updated 3 months ago
PhoneLLM / Awesome-LLM-Powered-Phone-GUI-Agents
View on GitHub
[TMLR] LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects
☆174Dec 2, 2025Updated 7 months ago
google-research / android_world
View on GitHub
AndroidWorld is an environment and benchmark for autonomous agents
☆832Jul 16, 2026Updated last week
iLearn-Lab / ACL25-GUI-explorer
View on GitHub
[ACL 2025] GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent
☆68May 28, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DistRL-lab / distrl-open
View on GitHub
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
☆24Aug 4, 2025Updated 11 months ago
OpenGVLab / GUI-Odyssey
View on GitHub
[ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…
☆159Jan 3, 2026Updated 6 months ago
OS-Copilot / OS-Genesis
View on GitHub
[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
☆188Oct 8, 2025Updated 9 months ago
MobileAgentBench / mobile-agent-bench
View on GitHub
☆37Sep 30, 2024Updated last year
DigiRL-agent / digiq
View on GitHub
☆121Apr 8, 2025Updated last year
StarWalkin / UI-NEXUS
View on GitHub
This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…
☆14Jul 27, 2025Updated 11 months ago
YuxiangChai / A3
View on GitHub
☆35Jan 12, 2026Updated 6 months ago
ai-agents-2030 / ViMo
View on GitHub
☆26Apr 2, 2026Updated 3 months ago
lgy0404 / LearnAct
View on GitHub
[ACL 2026] LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark
☆48Apr 18, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DigiRL-agent / digirl
View on GitHub
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
☆393Feb 22, 2025Updated last year
lll6gg / UI-R1
View on GitHub
[AAAI-2026] Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"
☆158Nov 24, 2025Updated 8 months ago
AndroidArenaAgent / AndroidArena
View on GitHub
☆47Apr 11, 2024Updated 2 years ago
OSU-NLP-Group / GUI-Agents-Paper-List
View on GitHub
Awesome GUI Agent Paper List
☆864Jun 28, 2026Updated 3 weeks ago
MadeAgents / ColorBench
View on GitHub
[WWW'26 Oral] ColorBench: a graph-structured benchmark for complex, long-horizon tasks in mobile GUI agents.
☆15Apr 13, 2026Updated 3 months ago
ZJU-ACES-ISE / ChatUITest
View on GitHub
Under construction
☆14Jan 15, 2025Updated last year
Tongyi-MAI / MobileWorld
View on GitHub
Benchmarking Autonomous Mobile Agents in Agent-User Interactive and MCP-Augmented Environments (ACL 2026)
☆243Jul 2, 2026Updated 3 weeks ago
RUCBM / GUICourse
View on GitHub
GUICourse: From General Vision Langauge Models to Versatile GUI Agents
☆143Mar 1, 2026Updated 4 months ago
IMNearth / CoAT
View on GitHub
Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)
☆103Oct 14, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
THUDM / Android-Lab
View on GitHub
☆324Aug 18, 2025Updated 11 months ago
xbmxb / CoCo-Agent
View on GitHub
☆35Jun 20, 2024Updated 2 years ago
vyokky / LLM-Brained-GUI-Agents-Survey
View on GitHub
GitHub page for "Large Language Model-Brained GUI Agents: A Survey"
☆230Jun 23, 2025Updated last year
aburns4 / textualforesight
View on GitHub
☆12Aug 8, 2024Updated last year
YuxiangChai / AMEX-codebase
View on GitHub
☆33Sep 27, 2024Updated last year
aialt / awesome-mobile-agents
View on GitHub
✨✨Latest Papers and Datasets on Mobile and PC GUI Agent
☆159Nov 29, 2024Updated last year
wwh0411 / FedMABench
View on GitHub
[EMNLP 2025 Main Oral] FedMABench: Benchmarking Mobile GUI Agents on Decentralized Heterogeneous User Data.
☆16Nov 11, 2025Updated 8 months ago
google-research-datasets / seq2act
View on GitHub
This repository contains the opensource version of the datasets were used for different parts of training and testing of models that grou…
☆35Aug 20, 2020Updated 5 years ago
alibaba / MobiZen-GUI
View on GitHub
☆46Mar 25, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
alipay / mobile-agent
View on GitHub
☆46Mar 19, 2024Updated 2 years ago
kwai / MobileForge
View on GitHub
Official code for "MobileForge: Annotation-Free Adaptation for Mobile GUI Agents with Hierarchical Feedback-Guided Policy Optimization"
☆75Jul 9, 2026Updated 2 weeks ago
showlab / Awesome-GUI-Agent
View on GitHub
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
☆1,199Aug 17, 2025Updated 11 months ago
Yan98 / GTA1
View on GitHub
☆130Oct 3, 2025Updated 9 months ago
Euphoria16 / UI-Genie
View on GitHub
[NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents
☆60Nov 27, 2025Updated 7 months ago
OpenBMB / AgentCPM-GUI
View on GitHub
AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient…
☆1,394Jan 11, 2026Updated 6 months ago
THUDM / VisualAgentBench
View on GitHub
Towards Large Multimodal Models as Visual Foundation Agents
☆274Apr 24, 2025Updated last year