[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
β2,866May 21, 2026Updated this week
Alternatives and similar repositories for OSWorld
Users that are interested in OSWorld are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Windows Agent Arena (WAA) πͺ is a scalable OS platform for testing and benchmarking of multi-modal AI agents.β860Apr 13, 2026Updated last month
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interactionβ391Mar 7, 2025Updated last year
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agentsβ445Apr 20, 2025Updated last year
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"β1,479Nov 26, 2025Updated 6 months ago
- VisualWebArena is a benchmark for multimodal agents.β476Nov 9, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesisβ166Nov 6, 2025Updated 6 months ago
- AndroidWorld is an environment and benchmark for autonomous agentsβ774Apr 9, 2026Updated last month
- The model, data and code for the visual GUI Agent SeeClickβ482Jul 13, 2025Updated 10 months ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agentsβ315Mar 11, 2026Updated 2 months ago
- Pioneering Automated GUI Interaction with Native Agentsβ10,678Jan 27, 2026Updated 4 months ago
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wildβ4,829Nov 18, 2024Updated last year
- ππͺ BrowserGym, a Gym environment for web task automationβ1,224Mar 17, 2026Updated 2 months ago
- [ICLR 2026] Computer Agent Arena: Toward Human-Centric Evaluation and Analysis of Computer-Use Agentsβ60Feb 26, 2026Updated 3 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist wβ¦β991Nov 5, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multβ¦β846Feb 3, 2025Updated last year
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.β554Apr 15, 2026Updated last month
- Awesome GUI Agent Paper Listβ785May 12, 2026Updated 2 weeks ago
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.β1,772Sep 9, 2024Updated last year
- π» A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.β1,191Aug 17, 2025Updated 9 months ago
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)β102Oct 14, 2024Updated last year
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.β395Feb 22, 2025Updated last year
- GUI Grounding for Professional High-Resolution Computer Useβ372Apr 14, 2026Updated last month
- Agent S: an open agentic framework that uses computers like a humanβ11,570May 13, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecβ¦β19,258May 18, 2026Updated last week
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reβ¦β582Mar 17, 2026Updated 2 months ago
- Out-of-the-box (OOTB) GUI Agent for Windows and macOSβ1,942May 21, 2025Updated last year
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-userβ¦β1,340Feb 13, 2025Updated last year
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.β1,843Apr 24, 2026Updated last month
- β41Jul 21, 2024Updated last year
- GUICourse: From General Vision Langauge Models to Versatile GUI Agentsβ141Mar 1, 2026Updated 2 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesisβ186Oct 8, 2025Updated 7 months ago
- AIOS: AI Agent Operating Systemβ5,752May 8, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)β3,444Feb 8, 2026Updated 3 months ago
- Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"β1,089Mar 4, 2024Updated 2 years ago
- Towards Large Multimodal Models as Visual Foundation Agentsβ266Apr 24, 2025Updated last year
- Multimodal computer agent data collection programβ169Dec 5, 2025Updated 5 months ago
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-beβ¦β3,075Apr 24, 2025Updated last year
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.β22,903May 14, 2026Updated last week
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Frameworkβ21,514Updated this week