ZJU-ACES-ISE / ChatUITestLinks
Under construction
☆11Updated 5 months ago
Alternatives and similar repositories for ChatUITest
Users that are interested in ChatUITest are comparing it to the libraries listed below
Sorting:
- VisionDroid☆15Updated last year
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆89Updated 8 months ago
- Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations b…☆28Updated 11 months ago
- ☆13Updated last year
- GUI Odyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUI Odyssey consists of 7,735 episodes fr…☆116Updated 7 months ago
- Owl Eyes: Spotting UI Display Issues via Visual Understanding☆11Updated 4 years ago
- ☆19Updated 8 months ago
- A Lightweight Visual Reasoning Benchmark for Evaluating Large Multimodal Models through Complex Diagrams in Coding Tasks☆12Updated 4 months ago
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆37Updated 2 weeks ago
- ☆30Updated 8 months ago
- More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆21Updated 3 weeks ago
- ☆29Updated 8 months ago
- (ICLR 2025) The Official Code Repository for GUI-World.☆60Updated 6 months ago
- Unblind Your Apps: Predicting Natural-Language Labels for Mobile GUI Components by Deep Learning☆48Updated last year
- A Universal Platform for Training and Evaluation of Mobile Interaction☆47Updated 4 months ago
- A mobile GUI search engine using a vision-language model☆12Updated last month
- Official code repo for the paper "LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark"☆27Updated last month
- CVPR25☆22Updated 3 months ago
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆28Updated 10 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆70Updated 4 months ago
- ☆35Updated last year
- Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"☆45Updated last month
- the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimo…☆19Updated 2 months ago
- ☆32Updated 3 weeks ago
- A Self-Training Framework for Vision-Language Reasoning☆80Updated 5 months ago
- ☆12Updated 10 months ago
- ☆21Updated 2 months ago
- GitHub page for "Large Language Model-Brained GUI Agents: A Survey"☆167Updated this week
- ☆19Updated last month
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆33Updated 6 months ago