X-LANCE/Mobile-Env

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/X-LANCE/Mobile-Env)

X-LANCE / Mobile-Env

A Universal Platform for Training and Evaluation of Mobile Interaction

☆63

Alternatives and similar repositories for Mobile-Env

Users that are interested in Mobile-Env are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aburns4 / MoTIF
View on GitHub
Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments
☆61Aug 19, 2024Updated last year
OpenDFM / Rememberer
View on GitHub
[NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents
☆40May 2, 2024Updated 2 years ago
google-research-datasets / seq2act
View on GitHub
This repository contains the opensource version of the datasets were used for different parts of training and testing of models that grou…
☆35Aug 20, 2020Updated 5 years ago
LlamaTouch / LlamaTouch
View on GitHub
LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation
☆70Aug 9, 2024Updated last year
OpenGVLab / GUI-Odyssey
View on GitHub
[ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…
☆159Jan 3, 2026Updated 6 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
LlamaTouch / AgentEnv
View on GitHub
An environment for mobile angets to interact with realistic android device or android emulator
☆13Jul 19, 2024Updated 2 years ago
chuyg1005 / seeclick-crawler
View on GitHub
☆20Apr 24, 2024Updated 2 years ago
X-LANCE / META-GUI-baseline
View on GitHub
[EMNLP 2022] The baseline code for META-GUI dataset
☆16Jul 9, 2024Updated 2 years ago
cooelf / Auto-GUI
View on GitHub
Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)
☆261Jul 16, 2024Updated 2 years ago
google-research / android_world
View on GitHub
AndroidWorld is an environment and benchmark for autonomous agents
☆833Jul 16, 2026Updated last week
google-deepmind / android_env
View on GitHub
RL research on Android devices.
☆1,234Updated this week
TheDuckAI / DuckTrack
View on GitHub
Multimodal computer agent data collection program
☆174Updated this week
LZhengisme / self-infilling
View on GitHub
[ICML 2024] Self-Infilling Code Generation
☆18May 5, 2024Updated 2 years ago
AndroidArenaAgent / AndroidArena
View on GitHub
☆47Apr 11, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
asappresearch / webagents-step
View on GitHub
☆41Jul 21, 2024Updated 2 years ago
DigiRL-agent / digiq
View on GitHub
☆121Apr 8, 2025Updated last year
YuxiangChai / A3
View on GitHub
☆35Jan 12, 2026Updated 6 months ago
MobileLLM / AutoDroid
View on GitHub
Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"
☆480Mar 22, 2024Updated 2 years ago
MobileLLM / DroidBot-GPT
View on GitHub
Automating Android apps with ChatGPT-like LLM.
☆157Jan 17, 2024Updated 2 years ago
Berkeley-NLP / Agent-Eval-Refine
View on GitHub
Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]
☆149Nov 26, 2024Updated last year
Dongping-Chen / GUI-World
View on GitHub
(ICLR 2025) The Official Code Repository for GUI-World.
☆69Dec 18, 2024Updated last year
3B-Group / ConvRe
View on GitHub
🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)
☆24Oct 10, 2023Updated 2 years ago
RUCBM / GUICourse
View on GitHub
GUICourse: From General Vision Langauge Models to Versatile GUI Agents
☆143Mar 1, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
google-deepmind / pix2act
View on GitHub
☆60Jul 8, 2026Updated 2 weeks ago
DigiRL-agent / digirl
View on GitHub
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
☆393Feb 22, 2025Updated last year
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
TopSea / SimpleDiffusion
View on GitHub
A Android client of Stable Diffusion.
☆13Mar 29, 2024Updated 2 years ago
princeton-nlp / WebShop
View on GitHub
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
☆573Sep 6, 2024Updated last year
bobjiangps / vision
View on GitHub
UI auto test framework based on YOLO to recognize elements, less code, less maintenance, cross platform, cross project / 基于YOLO的UI层自动化测试框…
☆15Feb 27, 2026Updated 5 months ago
zorazrw / trove
View on GitHub
[ICML'24] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks
☆33Sep 20, 2024Updated last year
MobileLLM / LLM-Explorer
View on GitHub
☆25Jun 1, 2026Updated last month
microsoft / ConstrainedReasoner
View on GitHub
☆13Aug 26, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
google-research-datasets / rico_semantics
View on GitHub
Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations b…
☆36Jun 27, 2024Updated 2 years ago
lbaermann / qaego4d
View on GitHub
Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"
☆31Aug 28, 2023Updated 2 years ago
X1AOX1A / Word2World
View on GitHub
[ACL 2026 Oral] From Word to World: Can Large Language Models be Implicit Text-based World Models?
☆66Apr 13, 2026Updated 3 months ago
psunlpgroup / VisOnlyQA
View on GitHub
This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…
☆29Jul 9, 2025Updated last year
pyronn / prompt-studio
View on GitHub
Prompt Studio MidJourney提示词可视化编辑与管理工具
☆29Apr 25, 2026Updated 3 months ago
THUDM / VisualAgentBench
View on GitHub
Towards Large Multimodal Models as Visual Foundation Agents
☆274Apr 24, 2025Updated last year
google-research-datasets / screen_qa
View on GitHub
ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K …
☆151Feb 7, 2025Updated last year