InfiXAI/InfiGUIAgent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/InfiXAI/InfiGUIAgent)

InfiXAI / InfiGUIAgent

☆74

Alternatives and similar repositories for InfiGUIAgent

Users that are interested in InfiGUIAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

InfiXAI / InfiGUI-R1
View on GitHub
Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"
☆67Dec 4, 2025Updated 7 months ago
YurunChen / HarmonyGuard
View on GitHub
Official implementation for “HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Opt…
☆29Jan 10, 2026Updated 6 months ago
YurunChen / Graph2Eval
View on GitHub
[CVPR'26] Official implementation for “Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs”
☆26Jan 10, 2026Updated 6 months ago
OS-Agent-Survey / OS-Agent-Survey
View on GitHub
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).
☆486Aug 16, 2025Updated 11 months ago
QuZhan51496 / paper2anything
View on GitHub
An agent skills pack that turns an academic paper PDF into slides, a poster, a webpage, a Xiaohongshu post, or a WeChat article (paper2sl…
☆277Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
penghao-wu / GUI_Reflection
View on GitHub
☆34Sep 19, 2025Updated 10 months ago
Yi-Biao / EcoAgent
View on GitHub
EcoAgent: An Efficient Device–Cloud Collaborative Multi-Agent Framework for Mobile Automation (AAAI 2026)
☆20Apr 12, 2026Updated 3 months ago
ZJULiHongxin / UIPro
View on GitHub
Advanced GUI agents
☆16Feb 3, 2026Updated 5 months ago
IshiKura-a / ModelGPT
View on GitHub
☆23Mar 18, 2024Updated 2 years ago
AriaUI / Aria-UI
View on GitHub
Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents
☆406Feb 8, 2025Updated last year
GAIR-NLP / AIME-Preview
View on GitHub
☆84Mar 11, 2025Updated last year
OSU-NLP-Group / GUI-Agents-Paper-List
View on GitHub
Awesome GUI Agent Paper List
☆861Jun 28, 2026Updated 3 weeks ago
OS-Copilot / OS-Atlas
View on GitHub
OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
☆452Apr 20, 2025Updated last year
hua-zi / FedCFA
View on GitHub
[AAAI2025] FedCFA: Alleviating Simpson’s Paradox in Model Aggregation with Counterfactual Federated Learning
☆24Jan 23, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
CEMPAplicaciones / MIA
View on GitHub
☆14Jul 23, 2025Updated 11 months ago
OS-Copilot / OS-Genesis
View on GitHub
[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
☆188Oct 8, 2025Updated 9 months ago
OpenGVLab / ZeroGUI
View on GitHub
ZeroGUI: Automating Online GUI Learning at Zero Human Cost
☆119Jul 17, 2025Updated last year
InfiAgent / InfiAgent
View on GitHub
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)
☆198May 29, 2025Updated last year
lll6gg / UI-R1
View on GitHub
[AAAI-2026] Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"
☆158Nov 24, 2025Updated 7 months ago
video-production-buddy / video-production-buddy
View on GitHub
Video Production Buddy - AI Video Production Studio
☆518Updated this week
ZJU-CTAG / B4
View on GitHub
Code for ASE'24 paper "B4: Towards Optimal Assessment of Plausible Code Solutions with Plausible Tests"
☆11Sep 10, 2024Updated last year
xlang-ai / aguvis
View on GitHub
[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
☆389Mar 7, 2025Updated last year
aiming-lab / WebHarbor
View on GitHub
☆27Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
YujieLu10 / CLAP
View on GitHub
☆14Apr 21, 2023Updated 3 years ago
showlab / Awesome-GUI-Agent
View on GitHub
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
☆1,197Aug 17, 2025Updated 11 months ago
YurunChen / repo-docs-skills
View on GitHub
Living project docs for coding agents: keep guides, progress logs, change maps, and handoff context updated as your repo evolves.
☆402Jul 12, 2026Updated last week
rhyang2021 / ARIA
View on GitHub
Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".
☆30Aug 9, 2025Updated 11 months ago
OSU-NLP-Group / WebDreamer
View on GitHub
[TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"
☆104Oct 5, 2025Updated 9 months ago
tml1026 / Lifelong-Personalized-Agent
View on GitHub
☆17Jul 22, 2025Updated 11 months ago
Yu-Qi-hang / ThinkRec
View on GitHub
☆34Jan 29, 2026Updated 5 months ago
shengyuzhang / VideoTitling
View on GitHub
Comprehensive Information Integration Modeling Framework for Video Titling
☆11Aug 27, 2020Updated 5 years ago
zhao-ht / LearnAct
View on GitHub
Code for paper Empowering Large Language Model Agents through Action Learning
☆34Aug 8, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ZZZhr-1 / Robust_GUI_Grounding
View on GitHub
On the Robustness of GUI Grounding Models Against Image Attacks
☆12Apr 8, 2025Updated last year
tsinghua-fib-lab / SmartAgent
View on GitHub
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
☆27Aug 20, 2025Updated 11 months ago
showlab / MovieSeq
View on GitHub
[ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences
☆46Mar 11, 2025Updated last year
OSU-NLP-Group / EIA_against_webagent
View on GitHub
☆40Oct 2, 2024Updated last year
showlab / ShowUI
View on GitHub
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
☆1,881Apr 24, 2026Updated 2 months ago
vyokky / LLM-Brained-GUI-Agents-Survey
View on GitHub
GitHub page for "Large Language Model-Brained GUI Agents: A Survey"
☆230Jun 23, 2025Updated last year
google-research / android_world
View on GitHub
AndroidWorld is an environment and benchmark for autonomous agents
☆826Updated this week