UITron-hub/UItron

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UITron-hub/UItron)

UITron-hub / UItron

☆67

Alternatives and similar repositories for UItron

Users that are interested in UItron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UITron-hub / UITron-Speech
View on GitHub
☆21Jan 22, 2026Updated 5 months ago
DocTron-hub / Chart-R1
View on GitHub
Chart-R1: Chain-of-Thought Supervision and Reinforcement for Advanced Chart Reasoner
☆24Aug 7, 2025Updated 11 months ago
Euphoria16 / UI-Genie
View on GitHub
[NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents
☆60Nov 27, 2025Updated 7 months ago
aburns4 / textualforesight
View on GitHub
☆12Aug 8, 2024Updated last year
DocTron-hub / OCRVerse
View on GitHub
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models
☆30Feb 4, 2026Updated 5 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
DocTron-hub / FD-RL
View on GitHub
[CVPR 2026] Reading or Reasoning? Format Decoupled Reinforcement Learning for Document OCR
☆18Mar 23, 2026Updated 3 months ago
xiaomi-research / guievalkit
View on GitHub
[ICML 2026] GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents
☆23Feb 26, 2026Updated 4 months ago
xbmxb / EnvDistraction
View on GitHub
☆24Oct 11, 2024Updated last year
open-compass / MMBench-GUI
View on GitHub
Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent w…
☆112Sep 8, 2025Updated 10 months ago
Yuqi-Zhou / GUI-G1
View on GitHub
☆28Sep 15, 2025Updated 10 months ago
StarWalkin / UI-NEXUS
View on GitHub
This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…
☆14Jul 27, 2025Updated 11 months ago
JIA-Lab-research / ARPO
View on GitHub
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆162May 29, 2025Updated last year
InfiXAI / InfiGUI-R1
View on GitHub
Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"
☆67Dec 4, 2025Updated 7 months ago
tiangeluo / RegionFocus
View on GitHub
A simple visual test-time scaling method for GUI agent grounding
☆26Dec 7, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OS-Copilot / OS-Symphony
View on GitHub
[ACL 2026 Main] Official repository for paper: OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agents
☆47Apr 7, 2026Updated 3 months ago
ZJU-REAL / Awesome-GUI-Agents
View on GitHub
A curated collection of resources, tools, and frameworks for developing GUI Agents.
☆445Jul 9, 2026Updated last week
McGill-NLP / agent-reward-bench
View on GitHub
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
☆47Aug 7, 2025Updated 11 months ago
OpenGVLab / ZeroGUI
View on GitHub
ZeroGUI: Automating Online GUI Learning at Zero Human Cost
☆119Jul 17, 2025Updated last year
longrongyang / STGC
View on GitHub
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
☆13Feb 11, 2025Updated last year
OSU-NLP-Group / GUI-Agents-Paper-List
View on GitHub
Awesome GUI Agent Paper List
☆862Jun 28, 2026Updated 3 weeks ago
iLearn-Lab / CVPR26-HiconAgent
View on GitHub
[CVPR 2026] HiconAgent: History Context-aware Policy Optimization for GUI Agents
☆31Mar 9, 2026Updated 4 months ago
volcengine / vePhone
View on GitHub
vePhone sample code (for Android, iOS, and Web/H5)
☆22Jul 1, 2026Updated 2 weeks ago
IMNearth / CoAT
View on GitHub
Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)
☆103Oct 14, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
WadeYin9712 / UI-Simulator
View on GitHub
Code for 🌍 UI-Simulator: LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training
☆21Oct 17, 2025Updated 9 months ago
hithqd / DynamicControl
View on GitHub
☆41Jan 10, 2025Updated last year
microsoft / GUI-Actor
View on GitHub
[NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
☆410Apr 13, 2026Updated 3 months ago
Tongyi-MAI / MobileWorld
View on GitHub
Benchmarking Autonomous Mobile Agents in Agent-User Interactive and MCP-Augmented Environments (ACL 2026)
☆240Jul 2, 2026Updated 2 weeks ago
JackLingjie / VisCodex
View on GitHub
Official Implementation for the paper "VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models"
☆23Aug 14, 2025Updated 11 months ago
ZJULiHongxin / UIPro
View on GitHub
Advanced GUI agents
☆16Feb 3, 2026Updated 5 months ago
njucckevin / SeeClick
View on GitHub
The model, data and code for the visual GUI Agent SeeClick
☆492Jul 13, 2025Updated last year
showlab / macosworld
View on GitHub
☆35Jan 28, 2026Updated 5 months ago
ZJU-ACES-ISE / ChatUITest
View on GitHub
Under construction
☆13Jan 15, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
SEU-VIPGroup / Understanding_Vision_Tasks
View on GitHub
☆13Feb 2, 2025Updated last year
showlab / Awesome-GUI-Agent
View on GitHub
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
☆1,197Aug 17, 2025Updated 11 months ago
XiaoMi / mobilevlm
View on GitHub
MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding
☆78Feb 27, 2025Updated last year
chengyou-jia / AgentStore
View on GitHub
[ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant
☆46Dec 19, 2024Updated last year
wwfnb / Laser
View on GitHub
☆16Sep 16, 2025Updated 10 months ago
iLearn-Lab / ACL26-PersonalAlign
View on GitHub
[ACL 2026 main] PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records
☆21Apr 11, 2026Updated 3 months ago
cyysky / MAI-UI-Navigation-Agent
View on GitHub
MAI UI Navigation Agent
☆16Dec 29, 2025Updated 6 months ago