xlang-ai / OSWorld
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
☆1,651Updated this week
Alternatives and similar repositories for OSWorld:
Users that are interested in OSWorld are comparing it to the libraries listed below
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,621Updated 5 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,389Updated 2 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,520Updated 2 months ago
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆718Updated last month
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,618Updated 7 months ago
- [ICLR 2025] Agent S: an open agentic framework that uses computers like a human☆821Updated this week
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…☆2,847Updated this week
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆883Updated 3 weeks ago
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,636Updated 9 months ago
- [ICLR 2025] Automated Design of Agentic Systems☆1,200Updated last month
- AIOS: AI Agent Operating System☆3,867Updated this week
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆620Updated 9 months ago
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user…☆1,288Updated 2 weeks ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,666Updated 6 months ago
- Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs)…☆1,170Updated last week
- Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"☆639Updated last year
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆835Updated 7 months ago
- Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-api☆1,038Updated last month
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆730Updated 7 months ago
- AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.☆762Updated this week
- A curated list of awesome LLM agents frameworks.☆770Updated this week
- Out-of-the-box (OOTB) GUI Agent for Windows and macOS☆1,333Updated this week
- ☆1,338Updated 3 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"☆785Updated 7 months ago
- SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?☆2,555Updated this week
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,208Updated 8 months ago
- A library for advanced large language model reasoning☆2,000Updated last week
- AutoGroq is a groundbreaking tool that revolutionizes the way users interact with Autogen™ and other AI assistants. By dynamically genera…☆1,416Updated 2 months ago
- 🌎💪 BrowserGym, a Gym environment for web task automation☆549Updated last month
- Common interface for interacting with AI agents. The protocol is tech stack agnostic - you can use it with any framework for building age…☆1,110Updated last month