Reallm-Labs / InfiGUIAgentLinks

☆63

Alternatives and similar repositories for InfiGUIAgent

Users that are interested in InfiGUIAgent are comparing it to the libraries listed below

Sorting:

OSU-NLP-Group / UGround
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
☆262Updated last month
Dongping-Chen / GUI-World
(ICLR 2025) The Official Code Repository for GUI-World.
☆61Updated 6 months ago
OpenGVLab / ZeroGUI
ZeroGUI: Automating Online GUI Learning at Zero Human Cost
☆75Updated last week
OS-Copilot / OS-Genesis
[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
☆147Updated this week
JiuTian-VL / Optimus-1
[NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks
☆78Updated last month
facebookresearch / sweet_rl
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆223Updated 2 months ago
ByteDance-Seed / Agent-R
Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"
☆150Updated last month
THUDM / VisualAgentBench
Towards Large Multimodal Models as Visual Foundation Agents
☆221Updated 2 months ago
vsubramaniam851 / multiagent-ft
☆210Updated 4 months ago
satori-reasoning / Satori
[ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
☆103Updated last month
aialt / awesome-mobile-agents
✨✨Latest Papers and Datasets on Mobile and PC GUI Agent
☆129Updated 7 months ago
xlang-ai / OSWorld-G
Scaling Computer-Use Grounding via UI Decomposition and Synthesis
☆85Updated 3 weeks ago
OSU-NLP-Group / WebDreamer
"Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"
☆78Updated 3 months ago
open-compass / GTA
[NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents
☆112Updated 3 months ago
ltzheng / agent-studio
[ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents
☆212Updated last month
OS-Copilot / OS-Atlas
OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
☆356Updated 2 months ago
siyuyuan / evoagent
Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"
☆115Updated 8 months ago
Berkeley-NLP / Agent-Eval-Refine
Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]
☆138Updated 7 months ago
kyle8581 / Web-Shepherd
Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"
☆37Updated last month
RUCBM / GUICourse
GUICourse: From General Vision Langauge Models to Versatile GUI Agents
☆119Updated last year
FoundationAgents / AFlow
🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.
☆168Updated this week
sunblaze-ucb / Intuitor
Code for the paper: "Learning to Reason without External Rewards"
☆319Updated last week
THU-KEG / Agentic-Reward-Modeling
[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆96Updated last month
WeiminXiong / MPO
MPO: Boosting LLM Agents with Meta Plan Optimization
☆62Updated 4 months ago
sail-sg / FlowReasoner
☆126Updated 2 months ago
zjunlp / WorfBench
[ICLR 2025] Benchmarking Agentic Workflow Generation
☆106Updated 4 months ago
RAGEN-AI / VAGEN
☆186Updated this week
GAIR-NLP / PC-Agent-E
Efficient Agent Training for Computer Use
☆114Updated last month
yihedeng9 / OpenVLThinker
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
☆93Updated last week
neulab / MultiUI
Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding
☆52Updated 7 months ago