ServiceNow / BrowserGym
ππͺ BrowserGym, a Gym environment for web task automation
β597Updated last week
Alternatives and similar repositories for BrowserGym:
Users that are interested in BrowserGym are comparing it to the libraries listed below
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reβ¦β262Updated this week
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?β167Updated 3 weeks ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"β912Updated last month
- VisualWebArena is a benchmark for multimodal agents.β315Updated 4 months ago
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.β264Updated this week
- [ICLR 2025] Agent S: an open agentic framework that uses computers like a humanβ842Updated this week
- Windows Agent Arena (WAA) πͺ is a scalable OS platform for testing and benchmarking of multi-modal AI agents.β617Updated this week
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycleβ230Updated this week
- Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"β661Updated last year
- AWM: Agent Workflow Memoryβ254Updated last month
- Aguvis: Unified Pure Vision Agents for Autonomous GUI Interactionβ254Updated this week
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multβ¦β727Updated last month
- β374Updated last month
- Agentlessπ±: an agentless approach to automatically solve software development problemsβ1,552Updated 2 months ago
- Code for the paper π³ Tree Search for Language Model Agentsβ184Updated 7 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agentsβ293Updated 3 weeks ago
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhanβ¦β705Updated 9 months ago
- Code and Data for Tau-Benchβ326Updated last month
- An agent benchmark with tasks in a simulated software company.β258Updated this week
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gymβ391Updated this week
- β366Updated last month
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RLβ320Updated last month
- [NeurIPS 2022] πWebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agentsβ309Updated 6 months ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilitiesβ144Updated last month
- π€ Agent-as-a-Judge and DevAI datasetβ342Updated last month
- π¦οΈ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/β293Updated 3 months ago
- β578Updated last month
- π» A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.β547Updated this week
- Building a comprehensive and handy list of papers for GUI agentsβ243Updated 3 weeks ago