ServiceNow / BrowserGym
ππͺ BrowserGym, a Gym environment for web task automation
β719Updated this week
Alternatives and similar repositories for BrowserGym
Users that are interested in BrowserGym are comparing it to the libraries listed below
Sorting:
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reβ¦β322Updated this week
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?β182Updated last week
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"β985Updated 3 months ago
- Windows Agent Arena (WAA) πͺ is a scalable OS platform for testing and benchmarking of multi-modal AI agents.β691Updated last week
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycleβ267Updated last week
- VisualWebArena is a benchmark for multimodal agents.β336Updated 6 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]β455Updated this week
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.β358Updated 3 weeks ago
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multβ¦β742Updated 3 months ago
- AWM: Agent Workflow Memoryβ270Updated 3 months ago
- Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"β769Updated last year
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineeringβ696Updated last week
- Code and Data for Tau-Benchβ485Updated 3 months ago
- xLAM: A Family of Large Action Models to Empower AI Agent Systemsβ427Updated this week
- β595Updated 3 months ago
- AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.β896Updated 3 weeks ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"β820Updated last month
- Code for the paper π³ Tree Search for Language Model Agentsβ198Updated 9 months ago
- [ICLR 2025] Automated Design of Agentic Systemsβ1,286Updated 3 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RLβ373Updated 2 weeks ago
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi eβ¦β462Updated 2 months ago
- An intuitive LLM prompting framework for multifunctional agents, by explicitly constructing a complex "thought process" from simple naturβ¦β447Updated 4 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agentsβ332Updated 3 weeks ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"β518Updated last month
- Agentlessπ±: an agentless approach to automatically solve software development problemsβ1,663Updated 4 months ago
- π Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Papβ¦β191Updated this week
- βοΈ The First Coding Agent-as-a-Judgeβ484Updated this week
- An agent benchmark with tasks in a simulated software company.β350Updated this week
- β406Updated 2 weeks ago
- Atom of Thoughts for Markov LLM Test-Time Scalingβ562Updated last week