Official implementation of AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
☆655Apr 15, 2025Updated last year
Alternatives and similar repositories for AppAgentX
Users that are interested in AppAgentX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.☆6,766Mar 19, 2025Updated last year
- Mobile-Agent: The Powerful GUI Agent Family☆8,780May 14, 2026Updated 3 weeks ago
- MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding☆79Feb 27, 2025Updated last year
- AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient…☆1,372Jan 11, 2026Updated 4 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆14Jul 27, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.☆1,190Aug 17, 2025Updated 9 months ago
- Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"☆475Mar 22, 2024Updated 2 years ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆186Oct 8, 2025Updated 8 months ago
- [TMLR] LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects☆172Dec 2, 2025Updated 6 months ago
- MobileUse: an open-source mobile GUI agent for Android phone automation, AndroidWorld/AndroidLab evaluation, hierarchical reflection, and…☆154May 7, 2026Updated last month
- Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents☆405Feb 8, 2025Updated last year
- ☆35Jun 20, 2024Updated last year
- [NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆409Apr 13, 2026Updated last month
- Pioneering Automated GUI Interaction with Native Agents☆10,856Jan 27, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- GUI Grounding for Professional High-Resolution Computer Use☆374Apr 14, 2026Updated last month
- 视觉UI分析工具☆457Jul 26, 2023Updated 2 years ago
- A simple screen parsing tool towards pure vision based GUI agent☆24,851Apr 13, 2026Updated last month
- An open-sourced end-to-end VLM-based GUI Agent☆1,183Apr 4, 2025Updated last year
- Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖