coinse / droidagentLinks
DroidAgent: Intent-Driven Mobile GUI Testing with Autonomous LLM Agents
☆36Updated last year
Alternatives and similar repositories for droidagent
Users that are interested in droidagent are comparing it to the libraries listed below
Sorting:
- ☆30Updated 2 years ago
- ☆45Updated last year
- Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"☆400Updated last year
- VisionDroid☆18Updated last year
- AndroidWorld is an environment and benchmark for autonomous agents☆465Updated this week
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆64Updated last year
- ☆31Updated last year
- GitHub page for "Large Language Model-Brained GUI Agents: A Survey"☆199Updated 3 months ago
- Automating Android apps with ChatGPT-like LLM.☆135Updated last year
- ☆241Updated 2 months ago
- ✨✨Latest Papers and Datasets on Mobile and PC GUI Agent☆138Updated 10 months ago
- AUITestAgent is the first automatic, natural language-driven GUI testing tool for mobile apps, capable of fully automating the entire pro…☆267Updated last year
- The model, data and code for the visual GUI Agent SeeClick☆433Updated 3 months ago
- GUI Grounding for Professional High-Resolution Computer Use☆271Updated 3 weeks ago
- VisionTasker introduces a novel two-stage framework combining vision-based UI understanding and LLM task planning for mobile task automat…☆89Updated 3 months ago
- A Comprehensive Benchmark for Software Development.☆115Updated last year
- Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation and CoLLAs 20…☆32Updated 2 months ago
- ☆485Updated last year
- A Universal Platform for Training and Evaluation of Mobile Interaction☆55Updated 3 weeks ago
- Paper list for Personal LLM Agents☆412Updated last year
- LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects☆121Updated 5 months ago
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆362Updated 7 months ago
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆94Updated last year
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆280Updated 3 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆239Updated 5 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆218Updated 4 months ago
- Inference code of Lingma SWE-GPT☆245Updated 10 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆163Updated last week
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆11Updated 2 months ago
- Repository for the paper "Large Language Model-Based Agents for Software Engineering: A Survey". Keep updating.☆506Updated 7 months ago