MobileLLM / AutoDroidLinks

Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"

☆384

Alternatives and similar repositories for AutoDroid

Users that are interested in AutoDroid are comparing it to the libraries listed below

Sorting:

MobileLLM / DroidBot-GPT
Automating Android apps with ChatGPT-like LLM.
☆129Updated last year
LlamaTouch / LlamaTouch
LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation
☆63Updated last year
AkimotoAyako / VisionTasker
VisionTasker introduces a novel two-stage framework combining vision-based UI understanding and LLM task planning for mobile task automat…
☆83Updated 3 weeks ago
google-research / android_world
AndroidWorld is an environment and benchmark for autonomous agents
☆370Updated this week
MobileLLM / Personal_LLM_Agents_Survey
Paper list for Personal LLM Agents
☆403Updated last year
AndroidArenaAgent / AndroidArena
☆43Updated last year
coinse / droidagent
DroidAgent: Intent-Driven Mobile GUI Testing with Autonomous LLM Agents
☆31Updated last year
Westlake-AGI-Lab / AppAgentX
Official implementation of AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
☆484Updated 3 months ago
njucckevin / SeeClick
The model, data and code for the visual GUI Agent SeeClick
☆411Updated 3 weeks ago
xlang-ai / aguvis
[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
☆340Updated 5 months ago
THUDM / Android-Lab
☆228Updated 3 months ago
likaixin2000 / ScreenSpot-Pro-GUI-Grounding
GUI Grounding for Professional High-Resolution Computer Use
☆238Updated last month
niuzaisheng / ScreenAgent
ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)
☆483Updated 8 months ago
cooelf / Auto-GUI
Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)
☆244Updated last year
MobileAgentBench / mobile-agent-bench
☆30Updated 10 months ago
vyokky / LLM-Brained-GUI-Agents-Survey
GitHub page for "Large Language Model-Brained GUI Agents: A Survey"
☆181Updated last month
showlab / Awesome-GUI-Agent
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
☆819Updated 2 months ago
testtestA6 / VisionDroid
VisionDroid
☆18Updated last year
MobileLLM / AutoDroid-V2
☆19Updated 3 months ago
alipay / mobile-agent
☆42Updated last year
IMNearth / CoAT
Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)
☆91Updated 9 months ago
X-LANCE / Mobile-Env
A Universal Platform for Training and Evaluation of Mobile Interaction
☆51Updated 3 weeks ago
PhoneLLM / Awesome-LLM-Powered-Phone-GUI-Agents
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects
☆99Updated 3 months ago
DigiRL-agent / digirl
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
☆372Updated 5 months ago
bz-lab / AUITestAgent
AUITestAgent is the first automatic, natural language-driven GUI testing tool for mobile apps, capable of fully automating the entire pro…
☆250Updated last year
OSU-NLP-Group / Mind2Web
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…
☆853Updated 4 months ago
showlab / ShowUI
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
☆1,409Updated 2 months ago
kyegomez / ScreenAI
Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"
☆355Updated this week
OpenGVLab / GUI-Odyssey
GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 episodes from…
☆123Updated this week
ranpox / awesome-computer-use
This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.
☆412Updated 2 months ago