☆36Sep 30, 2024Updated last year
Alternatives and similar repositories for mobile-agent-bench
Users that are interested in mobile-agent-bench are comparing it to the libraries listed below
Sorting:
- DroidAgent: Intent-Driven Mobile GUI Testing with Autonomous LLM Agents☆60Mar 12, 2024Updated 2 years ago
- ☆31Sep 27, 2024Updated last year
- ☆35Jan 12, 2026Updated 2 months ago
- [ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…☆149Jan 3, 2026Updated 2 months ago
- ☆307Aug 18, 2025Updated 7 months ago
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆392Feb 22, 2025Updated last year
- ClickAgent: Enhancing UI Location Capabilities of Autonomous Agents☆28Oct 28, 2024Updated last year
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆136Mar 1, 2026Updated 3 weeks ago
- VisionDroid☆22Apr 2, 2024Updated last year
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆256Jul 16, 2024Updated last year
- (ICLR 2025) The Official Code Repository for GUI-World.☆68Dec 18, 2024Updated last year
- AndroidWorld is an environment and benchmark for autonomous agents☆670Mar 13, 2026Updated last week
- ☆20Apr 24, 2024Updated last year
- ☆30Mar 11, 2025Updated last year
- EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO☆21Jan 24, 2026Updated last month
- Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments☆61Aug 19, 2024Updated last year
- Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations b…☆34Jun 27, 2024Updated last year
- Code of ["Spectral Prompt Tuning: Unveiling Unseen Classes for Zero-Shot Semantic Segmentation"]☆14Apr 26, 2024Updated last year
- UI auto test framework based on YOLO to recognize elements, less code, less maintenance, cross platform, cross project / 基于YOLO的UI层自动化测试框…☆15Feb 27, 2026Updated 3 weeks ago
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Jan 27, 2025Updated last year
- Official repository for the paper, "FedMABench: Benchmarking Mobile GUI Agents on Decentralized Heterogeneous User Data", EMNLP 2025 Main…☆16Nov 11, 2025Updated 4 months ago
- [AAAI-2026] Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"☆149Nov 24, 2025Updated 3 months ago
- EmojiCrypt: Prompt Encryption for Secure Communication with Large Language Models☆23Feb 21, 2024Updated 2 years ago
- An environment for mobile angets to interact with realistic android device or android emulator☆13Jul 19, 2024Updated last year
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆440Apr 20, 2025Updated 11 months ago
- ☆45Mar 19, 2024Updated 2 years ago
- Official code repo for the paper "MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments"☆32Mar 9, 2026Updated last week
- ☆10May 16, 2021Updated 4 years ago
- The model, data and code for the visual GUI Agent SeeClick☆472Jul 13, 2025Updated 8 months ago
- ☆16Jun 10, 2025Updated 9 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 7 months ago
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficien…☆27Oct 10, 2025Updated 5 months ago
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆12Dec 5, 2023Updated 2 years ago
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆20Mar 21, 2025Updated last year
- Mobile adapter for IOS and android for mobile LLM agents☆46Nov 24, 2024Updated last year
- ☆23Oct 24, 2025Updated 4 months ago
- PatchBackdoor is a code base associated with paper PatchBackdoor.☆12Aug 27, 2024Updated last year
- 💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.☆1,142Aug 17, 2025Updated 7 months ago
- [SIGIR 2024] TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision☆20Mar 28, 2024Updated last year