ππͺ BrowserGym, a Gym environment for web task automation
β1,162Mar 17, 2026Updated this week
Alternatives and similar repositories for BrowserGym
Users that are interested in BrowserGym are comparing it to the libraries listed below
Sorting:
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reβ¦β535Updated this week
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?β238Feb 23, 2026Updated 3 weeks ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"β1,388Nov 26, 2025Updated 3 months ago
- VisualWebArena is a benchmark for multimodal agents.β445Nov 9, 2024Updated last year
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycleβ303Dec 16, 2025Updated 3 months ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilitiesβ160Feb 11, 2025Updated last year
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RLβ513Jun 6, 2025Updated 9 months ago
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environmentsβ2,667Mar 13, 2026Updated last week
- An Illusion of Progress? Assessing the Current State of Web Agentsβ156Jan 2, 2026Updated 2 months ago
- AWM: Agent Workflow Memoryβ408Dec 22, 2025Updated 2 months ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.β380Mar 10, 2026Updated last week
- An agent benchmark with tasks in a simulated software company.β658Nov 17, 2025Updated 4 months ago
- β41Jul 21, 2024Updated last year
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist wβ¦β961Nov 5, 2025Updated 4 months ago
- Code for the paper π³ Tree Search for Language Model Agentsβ220Jul 25, 2024Updated last year
- Building a comprehensive and handy list of papers for GUI agentsβ649Oct 27, 2025Updated 4 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]β650Jul 29, 2025Updated 7 months ago
- Interaction-first method for generating demonstrations for web-agents on any websiteβ54Apr 29, 2025Updated 10 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agentsβ440Apr 20, 2025Updated 11 months ago
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multβ¦β834Feb 3, 2025Updated last year
- All-in-one Web Agent framework for post-training. Start building with a few clicks!β276Jul 7, 2025Updated 8 months ago
- Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"β1,050Mar 4, 2024Updated 2 years ago
- β68Mar 6, 2025Updated last year
- Towards Large Multimodal Models as Visual Foundation Agentsβ258Apr 24, 2025Updated 10 months ago
- Helping AI practitioners better understand their datasets and models in text classification. From ServiceNow.β72Dec 23, 2024Updated last year
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interactionβ383Mar 7, 2025Updated last year
- An LLM-based Web Navigating Agent (KDD'24)β930Sep 27, 2024Updated last year
- Windows Agent Arena (WAA) πͺ is a scalable OS platform for testing and benchmarking of multi-modal AI agents.β833Feb 11, 2026Updated last month
- β18Jan 3, 2025Updated last year
- π» A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.β1,142Aug 17, 2025Updated 7 months ago
- Multimodal computer agent data collection programβ165Dec 5, 2025Updated 3 months ago
- [NeurIPS 2022] πWebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agentsβ507Sep 6, 2024Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"β70Dec 9, 2024Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]β148Nov 26, 2024Updated last year
- Agentlessπ±: an agentless approach to automatically solve software development problemsβ2,019Dec 22, 2024Updated last year
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.β392Feb 22, 2025Updated last year
- MiniWoB++: a web interaction benchmark for reinforcement learningβ375May 5, 2025Updated 10 months ago
- Challenges for general-purpose web-browsing AI agentsβ67Jun 2, 2025Updated 9 months ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)β3,238Feb 8, 2026Updated last month