[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
β2,689Mar 17, 2026Updated last week
Alternatives and similar repositories for OSWorld
Users that are interested in OSWorld are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interactionβ383Mar 7, 2025Updated last year
- Windows Agent Arena (WAA) πͺ is a scalable OS platform for testing and benchmarking of multi-modal AI agents.β839Feb 11, 2026Updated last month
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agentsβ442Apr 20, 2025Updated 11 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"β1,400Nov 26, 2025Updated 3 months ago
- VisualWebArena is a benchmark for multimodal agents.β450Nov 9, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesisβ157Nov 6, 2025Updated 4 months ago
- AndroidWorld is an environment and benchmark for autonomous agentsβ679Updated this week
- The model, data and code for the visual GUI Agent SeeClickβ475Jul 13, 2025Updated 8 months ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agentsβ304Mar 11, 2026Updated 2 weeks ago
- Pioneering Automated GUI Interaction with Native Agentsβ9,928Jan 27, 2026Updated last month
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wildβ4,728Nov 18, 2024Updated last year
- [ICLR 2026] Computer Agent Arena: Toward Human-Centric Evaluation and Analysis of Computer-Use Agentsβ58Feb 26, 2026Updated 3 weeks ago
- ππͺ BrowserGym, a Gym environment for web task automationβ1,170Mar 17, 2026Updated last week
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multβ¦β834Feb 3, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.β525Nov 7, 2025Updated 4 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist wβ¦β963Nov 5, 2025Updated 4 months ago
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.β1,759Sep 9, 2024Updated last year
- Building a comprehensive and handy list of papers for GUI agentsβ653Oct 27, 2025Updated 4 months ago
- π» A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.β1,147Aug 17, 2025Updated 7 months ago
- GUI Grounding for Professional High-Resolution Computer Useβ347Mar 4, 2026Updated 3 weeks ago
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.β392Feb 22, 2025Updated last year
- Agent S: an open agentic framework that uses computers like a humanβ10,423Feb 21, 2026Updated last month
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecβ¦β18,796Mar 16, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)β100Oct 14, 2024Updated last year
- Out-of-the-box (OOTB) GUI Agent for Windows and macOSβ1,914May 21, 2025Updated 10 months ago
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-userβ¦β1,331Feb 13, 2025Updated last year
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reβ¦β541Mar 17, 2026Updated last week
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.β1,755Jan 20, 2026Updated 2 months ago
- β41Jul 21, 2024Updated last year
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesisβ182Oct 8, 2025Updated 5 months ago
- GUICourse: From General Vision Langauge Models to Versatile GUI Agentsβ137Mar 1, 2026Updated 3 weeks ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)β3,253Feb 8, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- AIOS: AI Agent Operating Systemβ5,375Jan 22, 2026Updated 2 months ago
- Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"β1,053Mar 4, 2024Updated 2 years ago
- Towards Large Multimodal Models as Visual Foundation Agentsβ259Apr 24, 2025Updated 11 months ago
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.β21,680Mar 16, 2026Updated last week
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-beβ¦β3,062Apr 24, 2025Updated 11 months ago
- Mobile-Agent: The Powerful GUI Agent Familyβ8,301Updated this week
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replayβ153May 29, 2025Updated 9 months ago