[TMLR] LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects
☆160Dec 2, 2025Updated 3 months ago
Alternatives and similar repositories for Awesome-LLM-Powered-Phone-GUI-Agents
Users that are interested in Awesome-LLM-Powered-Phone-GUI-Agents are comparing it to the libraries listed below
Sorting:
- ☆35Jan 12, 2026Updated last month
- ☆12Aug 8, 2024Updated last year
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆60Jul 11, 2025Updated 7 months ago
- VisionDroid☆21Apr 2, 2024Updated last year
- Under construction☆13Jan 15, 2025Updated last year
- Official code repo for the paper "LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark"☆46May 16, 2025Updated 9 months ago
- ☆20Mar 26, 2025Updated 11 months ago
- Building a comprehensive and handy list of papers for GUI agents☆642Oct 27, 2025Updated 4 months ago
- [AAAI-2026] Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"☆147Nov 24, 2025Updated 3 months ago
- ☆35Sep 30, 2024Updated last year
- ✨✨Latest Papers and Datasets on Mobile and PC GUI Agent☆152Nov 29, 2024Updated last year
- ☆303Aug 18, 2025Updated 6 months ago
- [ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…☆148Jan 3, 2026Updated 2 months ago
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆20Mar 21, 2025Updated 11 months ago
- Paper list for Personal LLM Agents☆427May 8, 2024Updated last year
- On-the-fly Definition Augmentation of LLMs for Biomedical NER☆14Apr 14, 2025Updated 10 months ago
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆311Mar 2, 2026Updated last week
- Setup Clawd Bot automatically on Orgo. Free.☆48Jan 24, 2026Updated last month
- ☆18Mar 19, 2025Updated 11 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Oct 27, 2024Updated last year
- Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation; CoLLAs 2025)☆35Jul 21, 2025Updated 7 months ago
- [NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆150Nov 6, 2025Updated 4 months ago
- GitHub page for "Large Language Model-Brained GUI Agents: A Survey"☆220Jun 23, 2025Updated 8 months ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆302Jul 18, 2025Updated 7 months ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- (ICLR 2025) The Official Code Repository for GUI-World.☆68Dec 18, 2024Updated last year
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆113Jul 17, 2025Updated 7 months ago
- SWE-Exp: Experience-Driven Software Issue Resolution☆35Oct 17, 2025Updated 4 months ago
- ☆22May 23, 2025Updated 9 months ago
- This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retri…☆18Mar 13, 2025Updated 11 months ago
- 💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.☆1,132Aug 17, 2025Updated 6 months ago
- ☆45Apr 11, 2024Updated last year
- ☆23Oct 2, 2024Updated last year
- [NeurIPS 2025 Spotlight] Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"☆53May 21, 2025Updated 9 months ago
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆136Mar 1, 2026Updated last week
- CVPR25☆26Jul 2, 2025Updated 8 months ago
- ☆31Jul 3, 2025Updated 8 months ago
- An MCP server that hosts finite state machines as dynamic resources that multiple clients can subscribe to and be updated when their stat…☆25Aug 24, 2025Updated 6 months ago
- ☆29Apr 22, 2025Updated 10 months ago