vyokky/LLM-Brained-GUI-Agents-Survey

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vyokky/LLM-Brained-GUI-Agents-Survey)

vyokky / LLM-Brained-GUI-Agents-Survey

GitHub page for "Large Language Model-Brained GUI Agents: A Survey"

☆230

Alternatives and similar repositories for LLM-Brained-GUI-Agents-Survey

Users that are interested in LLM-Brained-GUI-Agents-Survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OSU-NLP-Group / GUI-Agents-Paper-List
View on GitHub
Awesome GUI Agent Paper List
☆866Jun 28, 2026Updated last month
njucckevin / SeeClick
View on GitHub
The model, data and code for the visual GUI Agent SeeClick
☆493Jul 13, 2025Updated last year
iLearn-Lab / ACL25-GUI-explorer
View on GitHub
[ACL 2025] GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent
☆68May 28, 2025Updated last year
InfiXAI / InfiGUI-R1
View on GitHub
Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"
☆67Dec 4, 2025Updated 7 months ago
ZJU-ACES-ISE / ChatUITest
View on GitHub
Under construction
☆14Jan 15, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
OS-Agent-Survey / OS-Agent-Survey
View on GitHub
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).
☆486Aug 16, 2025Updated 11 months ago
showlab / Awesome-GUI-Agent
View on GitHub
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
☆1,201Aug 17, 2025Updated 11 months ago
OSU-NLP-Group / UGround
View on GitHub
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
☆316Mar 11, 2026Updated 4 months ago
mobilegptsys / MobileGPT
View on GitHub
☆27Oct 2, 2024Updated last year
ai-agents-2030 / SPA-Bench
View on GitHub
SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
☆64Jul 11, 2025Updated last year
YuxiangChai / AMEX-codebase
View on GitHub
☆33Sep 27, 2024Updated last year
tml1026 / Lifelong-Personalized-Agent
View on GitHub
☆18Jul 22, 2025Updated last year
RUCBM / GUICourse
View on GitHub
GUICourse: From General Vision Langauge Models to Versatile GUI Agents
☆143Mar 1, 2026Updated 4 months ago
MobileLLM / LLM-Explorer
View on GitHub
☆25Jun 1, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Android-Functional-bugs-study / home
View on GitHub
☆20Jul 26, 2023Updated 3 years ago
OS-Copilot / OS-Atlas
View on GitHub
OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
☆452Apr 20, 2025Updated last year
THUDM / Android-Lab
View on GitHub
☆324Aug 18, 2025Updated 11 months ago
lgy0404 / LearnAct
View on GitHub
[ACL 2026] LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark
☆48Apr 18, 2026Updated 3 months ago
opendilab / awesome-ui-agents
View on GitHub
A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)
☆314Jun 17, 2026Updated last month
wendell0218 / GVA-Survey
View on GitHub
Official repository of the paper "Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms"
☆86Apr 15, 2026Updated 3 months ago
InfiXAI / InfiGUIAgent
View on GitHub
☆74May 23, 2025Updated last year
Yan98 / GTA1
View on GitHub
☆130Oct 3, 2025Updated 9 months ago
WebChoreArena / WebChoreArena
View on GitHub
COLM2026
☆36Jul 9, 2026Updated 2 weeks ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
AriaUI / Aria-UI
View on GitHub
Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents
☆407Feb 8, 2025Updated last year
xiaomi-research / guievalkit
View on GitHub
[ICML 2026] GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents
☆24Feb 26, 2026Updated 5 months ago
aialt / awesome-mobile-agents
View on GitHub
✨✨Latest Papers and Datasets on Mobile and PC GUI Agent
☆159Nov 29, 2024Updated last year
OpenGVLab / GUI-Odyssey
View on GitHub
[ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…
☆159Jan 3, 2026Updated 6 months ago
cuplv / chimpcheck
View on GitHub
Combinator Library for writing test generators and test properties for Android Apps
☆12Jul 26, 2019Updated 7 years ago
xlang-ai / aguvis
View on GitHub
[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
☆389Mar 7, 2025Updated last year
sqlab-sustech / HarmonyOS-App-Test
View on GitHub
☆11Oct 17, 2024Updated last year
YXB-NKU / SE-GUI
View on GitHub
[NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"
☆108Oct 21, 2025Updated 9 months ago
DigiRL-agent / digirl
View on GitHub
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
☆393Feb 22, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
showlab / ShowUI
View on GitHub
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
☆1,887Apr 24, 2026Updated 3 months ago
google-research / android_world
View on GitHub
AndroidWorld is an environment and benchmark for autonomous agents
☆835Jul 16, 2026Updated last week
PhoneLLM / Awesome-LLM-Powered-Phone-GUI-Agents
View on GitHub
[TMLR] LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects
☆174Dec 2, 2025Updated 7 months ago
AndroidArenaAgent / AndroidArena
View on GitHub
☆47Apr 11, 2024Updated 2 years ago
OS-Copilot / OS-Genesis
View on GitHub
[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
☆189Oct 8, 2025Updated 9 months ago
MobileLLM / AutoDroid
View on GitHub
Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"
☆482Mar 22, 2024Updated 2 years ago
cooelf / Auto-GUI
View on GitHub
Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)
☆261Jul 16, 2024Updated 2 years ago