X-LANCE / Mobile-Env
A Universal Platform for Training and Evaluation of Mobile Interaction
☆31Updated last month
Related projects: ⓘ
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆48Updated 3 weeks ago
- Towards Large Multimodal Models as Visual Foundation Agents☆87Updated 3 weeks ago
- ☆11Updated 4 months ago
- ☆76Updated last month
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆86Updated 3 months ago
- ☆102Updated 2 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents☆81Updated last week
- Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments☆55Updated last month
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆115Updated 5 months ago
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆177Updated 2 months ago
- [ICLR 2024] Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"☆113Updated 8 months ago
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆68Updated 2 months ago
- ☆37Updated 9 months ago
- AndroidWorld is an environment and benchmark for autonomous agents☆86Updated this week
- The Official Code Repository for GUI-World.☆33Updated last month
- Official Repo of LangSuitE☆74Updated last month
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆82Updated last year
- Paper collections of the continuous effort start from World Models.☆127Updated 2 months ago
- The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…☆69Updated last month
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"☆209Updated 3 weeks ago
- Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation)☆23Updated last month
- ☆25Updated 3 months ago
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos☆54Updated 9 months ago
- The model, data and code for the visual GUI Agent SeeClick☆182Updated 3 weeks ago
- Android in the Zoo: Chain-of-Action-Thought for GUI Agents☆32Updated 2 months ago
- ☆49Updated 8 months ago
- This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Co…☆66Updated 2 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆55Updated last week
- GUI Odyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUI Odyssey consists of 7,735 episodes fr…☆57Updated 2 months ago
- ☆131Updated 4 months ago