ritzz-ai / GUI-R1Links

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

☆205

Alternatives and similar repositories for GUI-R1

Users that are interested in GUI-R1 are comparing it to the libraries listed below

Sorting:

open-compass / MMBench-GUI
Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent w…
☆86Updated 2 months ago
mat-agent / MAT-Agent
MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)
☆74Updated 5 months ago
njucckevin / MM-Self-Improve
A Self-Training Framework for Vision-Language Reasoning
☆87Updated 10 months ago
Osilly / Awesome-Interleaving-Reasoning
Interleaving Reasoning: Next-Generation Reasoning Systems for AGI
☆213Updated last month
lll6gg / UI-R1
[AAAI-2026] Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"
☆142Updated 2 weeks ago
dvlab-research / ARPO
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆138Updated 6 months ago
Tencent / SelfEvolvingAgent
Research works from Tencent AI Lab regarding self-evolving agents
☆69Updated 3 months ago
NUS-TRAIL / NoisyRollout
[NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆98Updated 2 months ago
TIGER-AI-Lab / VL-Rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]
☆168Updated 6 months ago
InfiXAI / InfiGUI-R1
Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"
☆61Updated 6 months ago
ADaM-BJTU / Mind_with_eyes_Awesome_MLLMs_Reasoning
This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!
☆52Updated 8 months ago
OpenDCAI / Awesome_MLLMs_Reasoning
☆110Updated 2 months ago
OpenRLHF / OpenRLHF-M
An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.
☆149Updated 2 months ago
InfiMM / Awesome-Multimodal-LLM-for-Math-STEM
Paper collections of multi-modal LLM for Math/STEM/Code.
☆130Updated 2 weeks ago
LightChen233 / M3CoT
☆84Updated last year
OpenBMB / RLPR
Extrapolating RLVR to General Domains without Verifiers
☆180Updated 3 months ago
LengSicong / MMR1
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources
☆210Updated 2 months ago
MME-Benchmarks / MME-CoT
MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency
☆135Updated 4 months ago
OpenGVLab / GUI-Odyssey
[ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…
☆135Updated 4 months ago
OS-Copilot / OS-Genesis
[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
☆168Updated last month
XiaoYee / Awesome_Efficient_LRM_Reasoning
😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
☆318Updated last month
ltzheng / SimpleTIR
End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
☆332Updated 2 months ago
TideDra / VL-RLHF
A RLHF Infrastructure for Vision-Language Models
☆187Updated last year
IMNearth / CoAT
Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)
☆95Updated last year
yhy-2000 / VideoDeepResearch
☆123Updated 3 weeks ago
mll-lab-nu / VAGEN
Training VLM agents with multi-turn reinforcement learning
☆338Updated last week
EvolvingLMMs-Lab / multimodal-search-r1
MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…
☆359Updated 3 months ago
DataArcTech / ChartMoE
[ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding
☆92Updated 8 months ago
TheRoadQaQ / ReLIFT
Official Repository of "Learning what reinforcement learning can't"
☆69Updated 3 weeks ago
zitian-gao / one-shot-em
One-shot Entropy Minimization
☆187Updated 5 months ago