ZJU-REAL / Awesome-GUI-AgentsLinks

A curated collection of resources, tools, and frameworks for developing GUI Agents.

☆212

Alternatives and similar repositories for Awesome-GUI-Agents

Users that are interested in Awesome-GUI-Agents are comparing it to the libraries listed below

Sorting:

lwpyh / Awesome-MLLM-Reasoning-Collection
A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.
☆359Updated last week
ZJU-REAL / GUI-G2
[AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding
☆246Updated last month
OS-Agent-Survey / OS-Agent-Survey
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).
☆373Updated 4 months ago
SunzeY / SEAgent
Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"
☆215Updated 4 months ago
ritzz-ai / GUI-R1
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
☆207Updated 7 months ago
lll6gg / UI-R1
[AAAI-2026] Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"
☆142Updated last month
ZrrSkywalker / MAVIS
[ICLR 2025] Mathematical Visual Instruction Tuning for Multi-modal Large Language Models
☆152Updated last year
cmriat / l0
A scalable, end-to-end training pipeline for general-purpose agents
☆362Updated 5 months ago
dle666 / R-CoT
Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models
☆181Updated last year
Hui-design / TSPO
[AAAI 2026] ✨ TSPO: Temporal Sampling Policy Optimization for Long-form Video Language Understanding
☆108Updated last month
ZrrSkywalker / MathVerse
[ECCV 2024] Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
☆175Updated 7 months ago
RLHFlow / Self-rewarding-reasoning-LLM
Recipes to train the self-rewarding reasoning LLMs.
☆229Updated 9 months ago
OPPO-PersonalAI / TaskCraft
A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.
☆174Updated 5 months ago
yfzhang114 / r1_reward
✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
☆272Updated 7 months ago
zhangbaijin / From-Redundancy-to-Relevance
[NAACL 2025 Oral] 🎉 From redundancy to relevance: Enhancing explainability in multimodal large language models
☆128Updated 10 months ago
uw-nsl / TinyV
Your efficient and accurate answer verification system for RL training.
☆43Updated 6 months ago
syr-cn / AutoRefine
[NeurIPS 2025 Poster] Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning
☆113Updated last week
ZLKong / Awesome-Collection-Token-Reduction
A collection of token reduction (token pruning, merging, clustering, etc.) techniques for ML/AI
☆267Updated this week
dongyh20 / Chain-of-Spot
Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models
☆100Updated last year
RLHFlow / Minimal-RL
☆254Updated 7 months ago
CR400AF-A / SparseMM
[ICCV 2025] SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
☆79Updated 2 months ago
xiaoachen98 / Open-LLaVA-NeXT
An open-source implementation for training LLaVA-NeXT.
☆428Updated last year
Osilly / Awesome-Interleaving-Reasoning
Interleaving Reasoning: Next-Generation Reasoning Systems for AGI
☆220Updated 2 months ago
dvlab-research / VisionThink
[NeurIPS 2025] Efficient Reasoning Vision Language Models
☆436Updated 3 months ago
VITA-MLLM / Long-VITA
✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy
☆307Updated 7 months ago
ZJU-REAL / Mind-the-Gap
[NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684
☆45Updated 2 months ago
InfiXAI / InfiGUI-R1
Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"
☆61Updated 3 weeks ago
RLHFlow / Online-DPO-R1
Codebase for Iterative DPO Using Rule-based Rewards
☆263Updated 8 months ago
tsinghua-fib-lab / AgentSquare
The official implementation of the paper "AgentSquare: Automatic LLM Agent Search in Modular Design Space""
☆211Updated last month
HUST-AI-HYZ / MemoryAgentBench
Open source code for Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions
☆183Updated 3 weeks ago