X-LANCE / META-GUI-baselineLinks
[EMNLP 2022] The baseline code for META-GUI dataset
☆14Updated last year
Alternatives and similar repositories for META-GUI-baseline
Users that are interested in META-GUI-baseline are comparing it to the libraries listed below
Sorting:
- A Universal Platform for Training and Evaluation of Mobile Interaction☆55Updated last week
- ☆20Updated last year
- Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments☆61Updated last year
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆128Updated last year
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆60Updated 8 months ago
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆250Updated last year
- A curated list of the papers, repositories, tutorials, and anythings related to the large language models for tools☆68Updated 2 years ago
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain☆104Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆143Updated 9 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆81Updated last year
- ☆36Updated last year
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆149Updated 10 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆237Updated 4 months ago
- This repository contains the opensource version of the datasets were used for different parts of training and testing of models that grou…☆32Updated 5 years ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆105Updated 6 months ago
- Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…☆129Updated 2 years ago
- [ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…☆128Updated last month
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆58Updated last year
- Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics,…☆124Updated 3 months ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆63Updated last month
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 8 months ago
- [NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs☆46Updated 9 months ago
- Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"☆56Updated last year
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆69Updated 2 years ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆122Updated 3 months ago
- (ICLR 2025) The Official Code Repository for GUI-World.☆65Updated 9 months ago
- [ACL 2024] The project of Symbol-LLM☆57Updated last year
- Self-Alignment with Principle-Following Reward Models☆165Updated 4 months ago
- An Illusion of Progress? Assessing the Current State of Web Agents☆85Updated last month
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆148Updated 9 months ago