penghao-wu/GUI_Reflection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/penghao-wu/GUI_Reflection)

penghao-wu / GUI_Reflection

☆34

Alternatives and similar repositories for GUI_Reflection

Users that are interested in GUI_Reflection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

showlab / macosworld
View on GitHub
☆35Jan 28, 2026Updated 5 months ago
UITron-hub / UITron-Speech
View on GitHub
☆21Jan 22, 2026Updated 5 months ago
penghao-wu / ProxyV
View on GitHub
[ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM
☆20May 22, 2025Updated last year
InfiXAI / InfiGUI-R1
View on GitHub
Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"
☆67Dec 4, 2025Updated 7 months ago
VisualSphinx / VisualSphinx
View on GitHub
☆17Jun 3, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
TongUI-agent / TongUI-agent
View on GitHub
[AAAI 2026]Release of code, datasets and model for our work TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for General…
☆114Dec 1, 2025Updated 7 months ago
Euphoria16 / UI-Genie
View on GitHub
[NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents
☆60Nov 27, 2025Updated 7 months ago
ModalMinds / gym-v
View on GitHub
A unified framework for vision-language environments with Gymnasium-compatible interface
☆35Mar 17, 2026Updated 4 months ago
WebChoreArena / WebChoreArena
View on GitHub
COLM2026
☆36Jul 9, 2026Updated last week
tongjingqi / Game-RL
View on GitHub
Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning
☆156Updated this week
alibaba / UI-Ins
View on GitHub
Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning
☆77Apr 20, 2026Updated 3 months ago
LINs-lab / LIE
View on GitHub
[preprint] Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning
☆19Feb 18, 2026Updated 5 months ago
THUDM / MobileRL
View on GitHub
☆93Dec 23, 2025Updated 6 months ago
njucckevin / OpenMobile-Code
View on GitHub
The model, data and code for OpenMobile
☆49Jul 9, 2026Updated last week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
OpenGVLab / ZeroGUI
View on GitHub
ZeroGUI: Automating Online GUI Learning at Zero Human Cost
☆119Jul 17, 2025Updated last year
BigTaige / MP-GUI
View on GitHub
CVPR25
☆28Jul 2, 2025Updated last year
ZJU-REAL / GUI-RCPO
View on GitHub
[AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615
☆67Nov 8, 2025Updated 8 months ago
alexmartin1722 / wikivideo
View on GitHub
WikiVideo: Article Generation from Multiple Videos
☆15Nov 14, 2025Updated 8 months ago
SalesforceAIResearch / strefer
View on GitHub
Strefer: Empowering Video LLMs with Space-Time Referring and Reasoning via Synthetic Instruction Data
☆19Jun 2, 2026Updated last month
Jiahao004 / DeepTheorem
View on GitHub
☆26Jun 10, 2025Updated last year
schelterlabs / deml-lab
View on GitHub
Lab tasks for the course on "Data Engineering for Machine Learning"
☆10May 1, 2023Updated 3 years ago
open-compass / MMBench-GUI
View on GitHub
Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent w…
☆112Sep 8, 2025Updated 10 months ago
StarWalkin / UI-NEXUS
View on GitHub
This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…
☆14Jul 27, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
penghao-wu / visual_jigsaw
View on GitHub
☆78Apr 9, 2026Updated 3 months ago
InfiXAI / InfiGUIAgent
View on GitHub
☆74May 23, 2025Updated last year
IMYangJinheng / DeepCFD-for-Prediction-of-flow-field-in-Laval-nozzle
View on GitHub
This is a U-Net-based deep learning model, which we call DeepCFD. You can use this model to predict the temperature, velocity, and pressu…
☆15Sep 30, 2025Updated 9 months ago
tiangeluo / RegionFocus
View on GitHub
A simple visual test-time scaling method for GUI agent grounding
☆26Dec 7, 2025Updated 7 months ago
Han1018 / ZonUI-3B
View on GitHub
[WACV 2026] ZonUI-3B — A lightweight, resolution-aware GUI grounding model trained with only 24K samples on a single RTX 4090.
☆26Jan 2, 2026Updated 6 months ago
MadeAgents / mobile-use
View on GitHub
MobileUse: an open-source mobile GUI agent for Android phone automation, AndroidWorld/AndroidLab evaluation, hierarchical reflection, and…
☆167Jul 10, 2026Updated last week
nosna / miragenews
View on GitHub
☆16May 14, 2025Updated last year
HowieHwong / ProbeLLM
View on GitHub
ProbeLLM: Automating Principled Diagnosis of LLM Failures
☆17Feb 11, 2026Updated 5 months ago
showlab / FocusUI
View on GitHub
[CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection
☆35Jun 7, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ArtificialZeng / llama3_explained
View on GitHub
the newest version of llama3，source code explained line by line using Chinese
☆22Apr 19, 2024Updated 2 years ago
JiazhengZhang / AgentV-RL
View on GitHub
☆15Apr 17, 2026Updated 3 months ago
Ropedia / S-Agent
View on GitHub
S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence
☆71Jun 26, 2026Updated 3 weeks ago
foreverlasting1202 / QuestA
View on GitHub
☆22Jan 2, 2026Updated 6 months ago
jefferyZhan / GThinker
View on GitHub
[CVPR 2026] GThinker, Reasoning MLLM, Visual Cues, Visual Rethinking
☆18Mar 9, 2026Updated 4 months ago
KANABOON1 / LatentMem
View on GitHub
LatentMem: Customizing Latent Memory for Multi-Agent Systems
☆48Feb 9, 2026Updated 5 months ago
TeamPigeonLab / CS-DJ
View on GitHub
Accept by CVPR 2025 (highlight)
☆25Jun 8, 2025Updated last year