[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
β2,817Apr 25, 2026Updated last week
Alternatives and similar repositories for OSWorld
Users that are interested in OSWorld are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Windows Agent Arena (WAA) πͺ is a scalable OS platform for testing and benchmarking of multi-modal AI agents.β857Apr 13, 2026Updated 3 weeks ago
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interactionβ389Mar 7, 2025Updated last year
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"β1,449Nov 26, 2025Updated 5 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agentsβ444Apr 20, 2025Updated last year
- VisualWebArena is a benchmark for multimodal agents.β466Nov 9, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesisβ163Nov 6, 2025Updated 6 months ago
- AndroidWorld is an environment and benchmark for autonomous agentsβ750Apr 9, 2026Updated 3 weeks ago
- The model, data and code for the visual GUI Agent SeeClickβ478Jul 13, 2025Updated 9 months ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agentsβ312Mar 11, 2026Updated last month
- Pioneering Automated GUI Interaction with Native Agentsβ10,164Jan 27, 2026Updated 3 months ago
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wildβ4,785Nov 18, 2024Updated last year
- ππͺ BrowserGym, a Gym environment for web task automationβ1,215Mar 17, 2026Updated last month
- [ICLR 2026] Computer Agent Arena: Toward Human-Centric Evaluation and Analysis of Computer-Use Agentsβ60Feb 26, 2026Updated 2 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist wβ¦β985Nov 5, 2025Updated 6 months ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multβ¦β845Feb 3, 2025Updated last year
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.β549Apr 15, 2026Updated 3 weeks ago
- Building a comprehensive and handy list of papers for GUI agentsβ747Apr 25, 2026Updated last week
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.β1,766Sep 9, 2024Updated last year
- π» A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.β1,175Aug 17, 2025Updated 8 months ago
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)β102Oct 14, 2024Updated last year
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.β395Feb 22, 2025Updated last year
- GUI Grounding for Professional High-Resolution Computer Useβ367Apr 14, 2026Updated 3 weeks ago
- Agent S: an open agentic framework that uses computers like a humanβ11,011Feb 21, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecβ¦β19,101Apr 27, 2026Updated last week
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reβ¦β575Mar 17, 2026Updated last month
- Out-of-the-box (OOTB) GUI Agent for Windows and macOSβ1,935May 21, 2025Updated 11 months ago
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-userβ¦β1,336Feb 13, 2025Updated last year
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.β1,821Apr 24, 2026Updated last week
- β41Jul 21, 2024Updated last year
- GUICourse: From General Vision Langauge Models to Versatile GUI Agentsβ142Mar 1, 2026Updated 2 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesisβ187Oct 8, 2025Updated 6 months ago
- AIOS: AI Agent Operating Systemβ5,596Apr 23, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)β3,377Feb 8, 2026Updated 2 months ago
- Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"β1,078Mar 4, 2024Updated 2 years ago
- Towards Large Multimodal Models as Visual Foundation Agentsβ265Apr 24, 2025Updated last year
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.β22,391Apr 12, 2026Updated 3 weeks ago
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-beβ¦β3,071Apr 24, 2025Updated last year
- Mobile-Agent: The Powerful GUI Agent Familyβ8,615Apr 14, 2026Updated 3 weeks ago
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Frameworkβ21,046Updated this week