Jl-wei / guingLinks
A mobile GUI search engine using a vision-language model
☆14Updated 9 months ago
Alternatives and similar repositories for guing
Users that are interested in guing are comparing it to the libraries listed below
Sorting:
- ☆34Updated last year
- GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents☆19Updated 2 weeks ago
- Building a comprehensive and handy list of papers for GUI agents☆628Updated 3 months ago
- [ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…☆147Updated last month
- ☆23Updated last year
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆57Updated 6 months ago
- An Illusion of Progress? Assessing the Current State of Web Agents☆143Updated last month
- VisualWebArena is a benchmark for multimodal agents.☆431Updated last year
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆255Updated last year
- The model, data and code for the visual GUI Agent SeeClick☆461Updated 6 months ago
- Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments☆61Updated last year
- GitHub page for "Large Language Model-Brained GUI Agents: A Survey"☆217Updated 7 months ago
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆136Updated last year
- ☆174Updated 3 months ago
- Code and data for the paper: Competing Large Language Models in Multi-Agent Gaming Environments☆95Updated 2 weeks ago
- The awesome agents in the era of large language models☆71Updated 2 years ago
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆258Updated 2 years ago
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆98Updated last year
- Code for ACL 2024 paper: PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models.☆16Updated last year
- Evaluating Safety of Autonomous Agents in Mobile Device Control (AAAI 2026 AI Alignment Track)☆32Updated 2 weeks ago
- This repository contains the opensource version of the datasets were used for different parts of training and testing of models that grou…☆33Updated 5 years ago
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆67Updated last year
- ☆25Updated 2 years ago
- A Universal Platform for Training and Evaluation of Mobile Interaction☆60Updated 4 months ago
- Codebase for LLM Textual Hallucination Benchmark☆73Updated 9 months ago
- BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).☆175Updated 2 years ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆68Updated 6 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆133Updated 10 months ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆299Updated 6 months ago
- ☆38Updated last year