xlang-ai/OSWorld-G

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xlang-ai/OSWorld-G)

xlang-ai / OSWorld-G

[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis

☆172

Alternatives and similar repositories for OSWorld-G

Users that are interested in OSWorld-G are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Yan98 / GTA1
View on GitHub
☆130Oct 3, 2025Updated 9 months ago
xlang-ai / AgentTrek
View on GitHub
[ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials
☆60Feb 21, 2025Updated last year
xlang-ai / OpenCUA
View on GitHub
[NeurIPS 2025 Spotlight] OpenCUA: Open Foundations for Computer-Use Agents
☆806May 25, 2026Updated 2 months ago
showlab / macosworld
View on GitHub
☆35Jan 28, 2026Updated 6 months ago
xlang-ai / OSWorld
View on GitHub
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
☆3,045Updated this week
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
xlang-ai / computer-agent-arena
View on GitHub
[ICLR 2026] Computer Agent Arena: Toward Human-Centric Evaluation and Analysis of Computer-Use Agents
☆67Feb 26, 2026Updated 5 months ago
xlang-ai / CUA-Gym
View on GitHub
Scalable pipeline for synthesizing verifiable RLVR training data for computer-use agents
☆180Updated this week
xlang-ai / OSWorld-V2
View on GitHub
OSWorld 2.0: Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks
☆213Updated this week
xlang-ai / VideoAgentTrek
View on GitHub
The official repo of VideoAgentTrek
☆58Oct 24, 2025Updated 9 months ago
likaixin2000 / ScreenSpot-Pro-GUI-Grounding
View on GitHub
GUI Grounding for Professional High-Resolution Computer Use
☆383Jun 17, 2026Updated last month
xlang-ai / aguvis
View on GitHub
[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
☆389Mar 7, 2025Updated last year
JIA-Lab-research / ARPO
View on GitHub
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆162May 29, 2025Updated last year
StarWalkin / UI-NEXUS
View on GitHub
This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…
☆14Jul 27, 2025Updated last year
ServiceNow / GroundCUA
View on GitHub
GroundCUA
☆132Mar 24, 2026Updated 4 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
kiaia / GIRAFFE
View on GitHub
Extending context length of visual language models
☆12Dec 18, 2024Updated last year
OS-Copilot / OS-Atlas
View on GitHub
OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
☆452Apr 20, 2025Updated last year
OSU-NLP-Group / GUI-Agents-Paper-List
View on GitHub
Awesome GUI Agent Paper List
☆866Jun 28, 2026Updated last month
microsoft / GUI-Actor
View on GitHub
[NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
☆410Apr 13, 2026Updated 3 months ago
Timothyxxx / KVCachePapers
View on GitHub
☆20May 24, 2024Updated 2 years ago
uivision / UI-Vision
View on GitHub
☆33Jul 3, 2025Updated last year
OSU-NLP-Group / ACuRL
View on GitHub
An Autonomous Curriculum Reinforcement Learning framework that steers agents to continually learn in specific environments with zero huma…
☆38Jun 7, 2026Updated last month
xlang-ai / CUA-Gym-Hub
View on GitHub
CUA-Gym-Hub: mock web apps as reproducible RL training environments for computer-use agents
☆69Updated this week
meituan / EvoCUA
View on GitHub
EvoCUA: Evolving Computer Use Agent
☆334Mar 31, 2026Updated 3 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
HKUNLP / critic-rl
View on GitHub
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆127May 6, 2025Updated last year
McGill-NLP / agent-reward-bench
View on GitHub
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
☆48Aug 7, 2025Updated 11 months ago
ritzz-ai / GUI-R1
View on GitHub
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
☆252May 5, 2025Updated last year
OS-Copilot / OS-Genesis
View on GitHub
[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
☆189Oct 8, 2025Updated 9 months ago
YXB-NKU / SE-GUI
View on GitHub
[NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"
☆108Oct 21, 2025Updated 9 months ago
OS-Copilot / OS-Symphony
View on GitHub
[ACL 2026 Main] Official repository for paper: OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agents
☆48Apr 7, 2026Updated 3 months ago
google-research / android_world
View on GitHub
AndroidWorld is an environment and benchmark for autonomous agents
☆835Jul 16, 2026Updated last week
TongUI-agent / TongUI-agent
View on GitHub
[AAAI 2026]Release of code, datasets and model for our work TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for General…
☆114Dec 1, 2025Updated 7 months ago
web-arena-x / visualwebarena
View on GitHub
VisualWebArena is a benchmark for multimodal agents.
☆484Nov 9, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Timothyxxx / TestTimeTrainingPapers
View on GitHub
☆59Apr 13, 2026Updated 3 months ago
OpenGVLab / ZeroGUI
View on GitHub
ZeroGUI: Automating Online GUI Learning at Zero Human Cost
☆119Jul 17, 2025Updated last year
GAIR-NLP / PC-Agent-E
View on GitHub
[ICLR 2026] Efficient Agent Training for Computer Use
☆146Sep 5, 2025Updated 10 months ago
Tongyi-MAI / MobileWorld
View on GitHub
Benchmarking Autonomous Mobile Agents in Agent-User Interactive and MCP-Augmented Environments (ACL 2026)
☆245Jul 2, 2026Updated 3 weeks ago
VisualWebBench / VisualWebBench
View on GitHub
Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"
☆68Oct 19, 2024Updated last year
OS-Copilot / ScienceBoard
View on GitHub
[ICLR 2026] Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"
☆132Feb 2, 2026Updated 5 months ago
ZJULiHongxin / UIPro
View on GitHub
Advanced GUI agents
☆16Feb 3, 2026Updated 5 months ago