uivision/UI-Vision

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/uivision/UI-Vision)

uivision / UI-Vision

☆33

Alternatives and similar repositories for UI-Vision

Users that are interested in UI-Vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ServiceNow / GroundCUA
View on GitHub
GroundCUA
☆129Mar 24, 2026Updated 3 months ago
MM-FIRE / FIRE
View on GitHub
☆13Nov 5, 2024Updated last year
OSU-NLP-Group / Explorer
View on GitHub
[ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
☆29Feb 17, 2026Updated 5 months ago
tianyu-z / VCR
View on GitHub
Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.
☆32Feb 26, 2025Updated last year
TongUI-agent / TongUI-agent
View on GitHub
[AAAI 2026]Release of code, datasets and model for our work TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for General…
☆114Dec 1, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
code-philia / GUIPilot
View on GitHub
GUIPilot: A Consistency-based Mobile GUI Testing Approach for Detecting Application-specific Bugs
☆15Apr 22, 2026Updated 2 months ago
JiuTian-VL / SimpAgent
View on GitHub
[ICCV 2025 Highlight] Less is More: Empowering GUI Agent with Context-Aware Simplification
☆48Mar 12, 2026Updated 4 months ago
StarWalkin / UI-NEXUS
View on GitHub
This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…
☆14Jul 27, 2025Updated 11 months ago
niuzaisheng / ScreenExplorer
View on GitHub
ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World
☆26Jun 17, 2025Updated last year
Dongping-Chen / GUI-World
View on GitHub
(ICLR 2025) The Official Code Repository for GUI-World.
☆69Dec 18, 2024Updated last year
bin123apple / InfantAgent
View on GitHub
[NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.
☆39Apr 23, 2026Updated 2 months ago
google-research / android_world
View on GitHub
AndroidWorld is an environment and benchmark for autonomous agents
☆828Updated this week
likaixin2000 / ScreenSpot-Pro-GUI-Grounding
View on GitHub
GUI Grounding for Professional High-Resolution Computer Use
☆383Jun 17, 2026Updated last month
xlang-ai / CUA-Gym-Hub
View on GitHub
CUA-Gym-Hub: mock web apps as reproducible RL training environments for computer-use agents
☆65Jul 9, 2026Updated last week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
open-compass / MMBench-GUI
View on GitHub
Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent w…
☆112Sep 8, 2025Updated 10 months ago
Zsbyqx20 / AgentHazard
View on GitHub
Mobile GUI Agents under Real-world Threats: Are We There Yet?
☆17May 18, 2026Updated 2 months ago
cyysky / MAI-UI-Navigation-Agent
View on GitHub
MAI UI Navigation Agent
☆16Dec 29, 2025Updated 6 months ago
GAI-Community / GraphOmni
View on GitHub
Enable Comprehensive LLM Evaluation on Graph Reasoning
☆79Jun 12, 2025Updated last year
penghao-wu / GUI_Reflection
View on GitHub
☆34Sep 19, 2025Updated 10 months ago
showlab / videogui
View on GitHub
[NeurIPS 2024 D&B] VideoGUI: A Benchmark for GUI Automation from Instructional Videos
☆53Feb 22, 2026Updated 4 months ago
VeriGUI-Team / VeriWeb
View on GitHub
VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seeking
☆88Jan 21, 2026Updated 6 months ago
The-AI-Alliance / cube-standard
View on GitHub
Standardize benchmark wrapping so the community can wrap various otherwise-incompatible benchmarks uniformly and use them everywhere.
☆52Updated this week
OSU-NLP-Group / UGround
View on GitHub
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
☆315Mar 11, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
RUCBM / GUICourse
View on GitHub
GUICourse: From General Vision Langauge Models to Versatile GUI Agents
☆143Mar 1, 2026Updated 4 months ago
OSU-NLP-Group / GUI-Agents-Paper-List
View on GitHub
Awesome GUI Agent Paper List
☆862Jun 28, 2026Updated 3 weeks ago
agentsea / osuniverse
View on GitHub
Benchmark of complex, multimodal desktop-oriented tasks for advanced GUI-navigation AI agents
☆24May 7, 2025Updated last year
lizhh268 / ShadowMaskFormer
View on GitHub
[TAI 2025] Official implementation of TAI-accepted paper: ShadowMaskFormer: Mask Augmented Patch Embedding for Shadow Removal
☆15May 8, 2025Updated last year
HazyResearch / wonderbread
View on GitHub
WONDERBREAD benchmark + dataset for BPM tasks
☆35Jul 30, 2025Updated 11 months ago
Zhiyuan-Zeng / EvalTree
View on GitHub
[COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees
☆31Jul 11, 2025Updated last year
Yan98 / GTA1
View on GitHub
☆130Oct 3, 2025Updated 9 months ago
tanjimin / grad-cam-pytorch-light
View on GitHub
A customizable lightweight Grad-CAM implementation
☆16Nov 30, 2019Updated 6 years ago
NUS-HPC-AI-Lab / Multimodal-ICL-Retriever
View on GitHub
☆10Nov 12, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Sueqk / LMM-VQA
View on GitHub
LMM for VQA, tcsvt version
☆10Jul 19, 2024Updated 2 years ago
ZwEin27 / Sparql-Query-Parser
View on GitHub
an utility to parse sparql query into json format
☆11Nov 22, 2016Updated 9 years ago
xuehao / SimpleCxxLib
View on GitHub
A simple C++ library for introductory CS. Forked from StanfordCPPLib, originally used in Stanford CS106B.
☆14Sep 2, 2024Updated last year
OpenGVLab / GUI-Odyssey
View on GitHub
[ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…
☆159Jan 3, 2026Updated 6 months ago
Louise-LuLin / GCL-SPAN
View on GitHub
Code for the paper "Spectrum Guided Topology Augmentation for Graph Contrastive Learning"
☆11Jul 18, 2023Updated 3 years ago
JIA-Lab-research / ARPO
View on GitHub
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆162May 29, 2025Updated last year
lzy7976 / union-set-model-adaptation
View on GitHub
Union-set Multi-source Model Adaptation for Semantic Segmentation
☆12Oct 24, 2022Updated 3 years ago