aialt/awesome-mobile-agents

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aialt/awesome-mobile-agents)

aialt / awesome-mobile-agents

✨✨Latest Papers and Datasets on Mobile and PC GUI Agent

☆159

Alternatives and similar repositories for awesome-mobile-agents

Users that are interested in awesome-mobile-agents are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

XiaoMi / mobilevlm
View on GitHub
MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding
☆78Feb 27, 2025Updated last year
showlab / Awesome-GUI-Agent
View on GitHub
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
☆1,197Aug 17, 2025Updated 11 months ago
opendilab / awesome-ui-agents
View on GitHub
A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)
☆313Jun 17, 2026Updated last month
alipay / mobile-agent
View on GitHub
☆46Mar 19, 2024Updated 2 years ago
PhoneLLM / Awesome-LLM-Powered-Phone-GUI-Agents
View on GitHub
[TMLR] LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects
☆174Dec 2, 2025Updated 7 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ai-agents-2030 / SPA-Bench
View on GitHub
SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
☆64Jul 11, 2025Updated last year
OSU-NLP-Group / UGround
View on GitHub
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
☆314Mar 11, 2026Updated 4 months ago
hrwise-nlp / AppBench
View on GitHub
This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction
☆16Nov 4, 2024Updated last year
OSU-NLP-Group / GUI-Agents-Paper-List
View on GitHub
Awesome GUI Agent Paper List
☆861Jun 28, 2026Updated 3 weeks ago
YuxiangChai / AMEX-codebase
View on GitHub
☆33Sep 27, 2024Updated last year
Dongping-Chen / GUI-World
View on GitHub
(ICLR 2025) The Official Code Repository for GUI-World.
☆69Dec 18, 2024Updated last year
StarWalkin / UI-NEXUS
View on GitHub
This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…
☆14Jul 27, 2025Updated 11 months ago
MobileAgentBench / mobile-agent-bench
View on GitHub
☆37Sep 30, 2024Updated last year
uivision / UI-Vision
View on GitHub
☆32Jul 3, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AriaUI / Aria-UI
View on GitHub
Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents
☆406Feb 8, 2025Updated last year
THUDM / Android-Lab
View on GitHub
☆322Aug 18, 2025Updated 11 months ago
BigTaige / MP-GUI
View on GitHub
CVPR25
☆28Jul 2, 2025Updated last year
inspire-group / MIAdefenseSELENA
View on GitHub
[USENIX Security 2022] Mitigating Membership Inference Attacks by Self-Distillation Through a Novel Ensemble Architecture
☆16Aug 29, 2022Updated 3 years ago
niuzaisheng / ScreenExplorer
View on GitHub
ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World
☆26Jun 17, 2025Updated last year
KuofengGao / Verbose_Images
View on GitHub
[ICLR 2024] Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images
☆44Jan 25, 2024Updated 2 years ago
zjunlp / WKM
View on GitHub
[NeurIPS 2024] Agent Planning with World Knowledge Model
☆167Dec 17, 2024Updated last year
showlab / ShowUI
View on GitHub
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
☆1,881Apr 24, 2026Updated 2 months ago
lukahhcm / Awesome_Environment_Scaling
View on GitHub
Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …
☆71Jan 28, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
vyokky / LLM-Brained-GUI-Agents-Survey
View on GitHub
GitHub page for "Large Language Model-Brained GUI Agents: A Survey"
☆230Jun 23, 2025Updated last year
DigiRL-agent / digiq
View on GitHub
☆121Apr 8, 2025Updated last year
wwh0411 / FedMABench
View on GitHub
[EMNLP 2025 Main Oral] FedMABench: Benchmarking Mobile GUI Agents on Decentralized Heterogeneous User Data.
☆16Nov 11, 2025Updated 8 months ago
THUDM / VisualAgentBench
View on GitHub
Towards Large Multimodal Models as Visual Foundation Agents
☆270Apr 24, 2025Updated last year
OS-Agent-Survey / OS-Agent-Survey
View on GitHub
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).
☆486Aug 16, 2025Updated 11 months ago
lgy0404 / MemGUI-Bench
View on GitHub
[ACM MM 2026] MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments
☆46Jul 13, 2026Updated last week
Yuqi-Zhou / GUI-G1
View on GitHub
☆28Sep 15, 2025Updated 10 months ago
jylee425 / b-moca
View on GitHub
Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation; CoLLAs 2025)
☆34Jul 21, 2025Updated last year
swordlidev / Evaluation-Multimodal-LLMs-Survey
View on GitHub
A Survey on Benchmarks of Multimodal Large Language Models
☆156Jul 13, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
UCSB-AI / Screen-Point-and-Read
View on GitHub
Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"
☆31May 12, 2026Updated 2 months ago
LlamaTouch / LlamaTouch
View on GitHub
LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation
☆70Aug 9, 2024Updated last year
MobileLLM / Personal_LLM_Agents_Survey
View on GitHub
Paper list for Personal LLM Agents
☆433Jun 27, 2026Updated 3 weeks ago
jun0wanan / awesome-large-multimodal-agents
View on GitHub
☆495Sep 25, 2024Updated last year
google-research / android_world
View on GitHub
AndroidWorld is an environment and benchmark for autonomous agents
☆826Updated this week
testtestA6 / VisionDroid
View on GitHub
VisionDroid
☆22Apr 2, 2024Updated 2 years ago
swarnaHub / SummarizationPrograms
View on GitHub
[ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees
☆23Jun 19, 2023Updated 3 years ago