LlamaTouch / AgentEnv
An environment for mobile angets to interact with realistic android device or android emulator
☆11Updated 9 months ago
Alternatives and similar repositories for AgentEnv:
Users that are interested in AgentEnv are comparing it to the libraries listed below
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆57Updated 8 months ago
- A Universal Platform for Training and Evaluation of Mobile Interaction☆44Updated 2 months ago
- ☆29Updated 7 months ago
- Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation)☆31Updated 4 months ago
- A Stream-based LLM Agent Framework for Continuous Context Sensing and Sharing☆36Updated 5 months ago
- ☆40Updated last year
- Langchain Agent finetuning using 7B - LLAMA 2 , on hotpotQA (Retroformer framework)☆15Updated last year
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆30Updated 2 weeks ago
- VisionTasker introduces a novel two-stage framework combining vision-based UI understanding and LLM task planning for mobile task automat…☆69Updated 2 months ago
- Awesome LLM papers, news and projects about learning to reason with LLM, OpenAI o1, reasonning techniques, chain-of-thought (COT), Large …☆26Updated 6 months ago
- AAAI24(Oral) ProAgent: Building Proactive Cooperative Agents with Large Language Models☆81Updated 2 months ago
- ☆34Updated 10 months ago
- Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"☆350Updated last year
- DroidAgent: Intent-Driven Mobile GUI Testing with Autonomous LLM Agents☆25Updated last year
- (ICLR 2025) The Official Code Repository for GUI-World.☆54Updated 4 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆139Updated 11 months ago
- ☆57Updated 2 months ago
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆233Updated 9 months ago
- ☆17Updated 7 months ago
- GUI Odyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUI Odyssey consists of 7,735 episodes fr…☆109Updated 5 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆209Updated last week
- ☆101Updated 3 weeks ago
- Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"☆41Updated 5 months ago
- The model, data and code for the visual GUI Agent SeeClick☆365Updated 5 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆68Updated last year
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆34Updated last year
- Zero-Shot Chain-of-Thought Reasoning Guided by Evolutionary Algorithms in Large Language Models☆13Updated last year
- ☆17Updated last year
- A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)☆194Updated last month
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆136Updated last year