[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
β2,930Jun 10, 2026Updated this week
Alternatives and similar repositories for OSWorld
Users that are interested in OSWorld are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Windows Agent Arena (WAA) πͺ is a scalable OS platform for testing and benchmarking of multi-modal AI agents.β867Apr 13, 2026Updated 2 months ago
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interactionβ392Mar 7, 2025Updated last year
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"β1,512Nov 26, 2025Updated 6 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agentsβ452Apr 20, 2025Updated last year
- VisualWebArena is a benchmark for multimodal agents.β479Nov 9, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesisβ167Nov 6, 2025Updated 7 months ago
- AndroidWorld is an environment and benchmark for autonomous agentsβ794Updated this week
- The model, data and code for the visual GUI Agent SeeClickβ483Jul 13, 2025Updated 11 months ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agentsβ317Mar 11, 2026Updated 3 months ago
- Pioneering Automated GUI Interaction with Native Agentsβ10,929Jan 27, 2026Updated 4 months ago
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wildβ4,838Nov 18, 2024Updated last year
- ππͺ BrowserGym, a Gym environment for web task automationβ1,244Mar 17, 2026Updated 2 months ago
- [ICLR 2026] Computer Agent Arena: Toward Human-Centric Evaluation and Analysis of Computer-Use Agentsβ64Feb 26, 2026Updated 3 months ago
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multβ¦β848Feb 3, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.β563Apr 15, 2026Updated 2 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist wβ¦β1,000Nov 5, 2025Updated 7 months ago
- Awesome GUI Agent Paper Listβ818Jun 5, 2026Updated last week
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.β1,776Sep 9, 2024Updated last year
- π» A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.β1,193Aug 17, 2025Updated 9 months ago
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)β103Oct 14, 2024Updated last year
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.β394Feb 22, 2025Updated last year
- GUI Grounding for Professional High-Resolution Computer Useβ377Apr 14, 2026Updated 2 months ago
- Agent S: an open agentic framework that uses computers like a humanβ11,844May 13, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecβ¦β19,496Updated this week
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reβ¦β586Mar 17, 2026Updated 2 months ago
- Out-of-the-box (OOTB) GUI Agent for Windows and macOSβ1,945May 21, 2025Updated last year
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-userβ¦β1,341Feb 13, 2025Updated last year
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.β1,852Apr 24, 2026Updated last month
- β41Jul 21, 2024Updated last year
- GUICourse: From General Vision Langauge Models to Versatile GUI Agentsβ142Mar 1, 2026Updated 3 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesisβ187Oct 8, 2025Updated 8 months ago
- Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"β1,095Mar 4, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- AIOS: AI Agent Operating Systemβ5,874May 8, 2026Updated last month
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)β3,492Feb 8, 2026Updated 4 months ago
- Towards Large Multimodal Models as Visual Foundation Agentsβ267Apr 24, 2025Updated last year
- Multimodal computer agent data collection programβ171Dec 5, 2025Updated 6 months ago
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-beβ¦β3,081Apr 24, 2025Updated last year
- Platform for stateful agents: AI with advanced memory that can learn and self-improve over time.β23,318May 14, 2026Updated last month
- Mobile-Agent: The Powerful GUI Agent Familyβ8,811May 14, 2026Updated last month