alibaba/UI-Ins

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alibaba/UI-Ins)

alibaba / UI-Ins

Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning

☆77

Alternatives and similar repositories for UI-Ins

Users that are interested in UI-Ins are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YXB-NKU / SE-GUI
View on GitHub
[NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"
☆108Oct 21, 2025Updated 9 months ago
JIA-Lab-research / ARPO
View on GitHub
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆162May 29, 2025Updated last year
penghao-wu / GUI_Reflection
View on GitHub
☆34Sep 19, 2025Updated 10 months ago
uivision / UI-Vision
View on GitHub
☆33Jul 3, 2025Updated last year
OSU-NLP-Group / Explorer
View on GitHub
[ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
☆29Feb 17, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Yan98 / GTA1
View on GitHub
☆130Oct 3, 2025Updated 9 months ago
runamu / monday
View on GitHub
[CVPR 2025] Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents
☆33Jun 3, 2025Updated last year
bytedance / GUI-ReWalk
View on GitHub
The official code for "GUI-ReWalk: Massive Data Generation for GUI Agent via Stochastic Exploration and Intent-Aware Reasoning"
☆40May 1, 2026Updated 2 months ago
OpenGVLab / ZeroGUI
View on GitHub
ZeroGUI: Automating Online GUI Learning at Zero Human Cost
☆119Jul 17, 2025Updated last year
tiangeluo / RegionFocus
View on GitHub
A simple visual test-time scaling method for GUI agent grounding
☆26Dec 7, 2025Updated 7 months ago
InfiXAI / InfiGUI-R1
View on GitHub
Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"
☆67Dec 4, 2025Updated 7 months ago
JiuTian-VL / SimpAgent
View on GitHub
[ICCV 2025 Highlight] Less is More: Empowering GUI Agent with Context-Aware Simplification
☆48Mar 12, 2026Updated 4 months ago
microsoft / GUI-Actor
View on GitHub
[NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
☆410Apr 13, 2026Updated 3 months ago
Tongyi-MAI / MobileWorld
View on GitHub
Benchmarking Autonomous Mobile Agents in Agent-User Interactive and MCP-Augmented Environments (ACL 2026)
☆242Jul 2, 2026Updated 3 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
microsoft / FIVE-UI-Evol
View on GitHub
☆31Apr 15, 2026Updated 3 months ago
OSU-NLP-Group / GUI-Agents-Paper-List
View on GitHub
Awesome GUI Agent Paper List
☆862Jun 28, 2026Updated 3 weeks ago
Han1018 / ZonUI-3B
View on GitHub
[WACV 2026] ZonUI-3B — A lightweight, resolution-aware GUI grounding model trained with only 24K samples on a single RTX 4090.
☆26Jan 2, 2026Updated 6 months ago
likaixin2000 / ScreenSpot-Pro-GUI-Grounding
View on GitHub
GUI Grounding for Professional High-Resolution Computer Use
☆383Jun 17, 2026Updated last month
flyfox666 / MAI-UI-WebUI
View on GitHub
WebUI for MAI-UI: Real-World Centric Foundation GUI Agents.
☆26Jan 5, 2026Updated 6 months ago
meituan / EvoCUA
View on GitHub
EvoCUA: Evolving Computer Use Agent
☆332Mar 31, 2026Updated 3 months ago
ZJUSCL / MVP
View on GitHub
Multi-View prediction enhances GUI Grounding
☆21Feb 22, 2026Updated 5 months ago
StarWalkin / UI-NEXUS
View on GitHub
This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…
☆14Jul 27, 2025Updated 11 months ago
cyysky / MAI-UI-Navigation-Agent
View on GitHub
MAI UI Navigation Agent
☆16Dec 29, 2025Updated 6 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
MadeAgents / mobile-use
View on GitHub
MobileUse: an open-source mobile GUI agent for Android phone automation, AndroidWorld/AndroidLab evaluation, hierarchical reflection, and…
☆167Jul 10, 2026Updated 2 weeks ago
Euphoria16 / UI-Genie
View on GitHub
[NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents
☆60Nov 27, 2025Updated 7 months ago
showlab / macosworld
View on GitHub
☆35Jan 28, 2026Updated 5 months ago
MagicAgent-GUI / MagicGUI
View on GitHub
☆80Sep 3, 2025Updated 10 months ago
MadeAgents / ColorBench
View on GitHub
[WWW'26 Oral] ColorBench: a graph-structured benchmark for complex, long-horizon tasks in mobile GUI agents.
☆15Apr 13, 2026Updated 3 months ago
google-research / android_world
View on GitHub
AndroidWorld is an environment and benchmark for autonomous agents
☆830Jul 16, 2026Updated last week
choidami / inductive-oocr
View on GitHub
☆16Mar 22, 2025Updated last year
WebChoreArena / WebChoreArena
View on GitHub
COLM2026
☆36Jul 9, 2026Updated 2 weeks ago
jcottaar / seismic
View on GitHub
Jeroen Cottaar's work for the Kaggle Geophysical Waveform Inversion competition (2nd place)
☆13Aug 11, 2025Updated 11 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
UITron-hub / UItron
View on GitHub
☆67Sep 6, 2025Updated 10 months ago
ZJU-REAL / Awesome-GUI-Agents
View on GitHub
A curated collection of resources, tools, and frameworks for developing GUI Agents.
☆446Jul 9, 2026Updated 2 weeks ago
microsoft / AgentAsJudge
View on GitHub
An agentic evaluation framework
☆20Feb 11, 2026Updated 5 months ago
showlab / Awesome-GUI-Agent
View on GitHub
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
☆1,198Aug 17, 2025Updated 11 months ago
showlab / ShowUI
View on GitHub
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
☆1,886Apr 24, 2026Updated 3 months ago
conghui1002 / DG-UCDIR
View on GitHub
☆13Oct 4, 2023Updated 2 years ago
ash-neupane / multi-token-pred
View on GitHub
Train toy models using multi-token prediction objective
☆14Apr 18, 2026Updated 3 months ago