jylee425 / b-moca
Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation)
☆29Updated last month
Alternatives and similar repositories for b-moca:
Users that are interested in b-moca are comparing it to the libraries listed below
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆115Updated 2 months ago
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆127Updated 9 months ago
- ☆46Updated last month
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆54Updated 2 weeks ago
- ☆25Updated 9 months ago
- NeurIPS 2024 tutorial on LLM Inference☆38Updated last month
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos☆60Updated last year
- Towards Large Multimodal Models as Visual Foundation Agents☆169Updated last month
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆55Updated 2 weeks ago
- A Universal Platform for Training and Evaluation of Mobile Interaction☆41Updated last month
- ☆76Updated 6 months ago
- AndroidWorld is an environment and benchmark for autonomous agents☆194Updated last week
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆25Updated last month
- ☆88Updated last week
- Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"☆30Updated 2 months ago
- ☆123Updated 6 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 4 months ago
- Natural Language Reinforcement Learning☆69Updated last month
- Official code for the paper: WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents☆27Updated 2 months ago
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆34Updated 11 months ago
- ☆94Updated 7 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆114Updated 3 months ago
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆298Updated 2 months ago
- ☆27Updated 4 months ago
- The Official Code Repository for GUI-World.☆46Updated last month
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆125Updated 10 months ago
- ☆140Updated 8 months ago
- ☆44Updated last year
- Dateset Reset Policy Optimization☆28Updated 9 months ago
- ☆48Updated last month