ServiceNow / BrowserGym
🌎💪 BrowserGym, a Gym environment for web task automation
☆401Updated last week
Alternatives and similar repositories for BrowserGym:
Users that are interested in BrowserGym are comparing it to the libraries listed below
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?☆138Updated last week
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…☆148Updated this week
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆788Updated last week
- AWM: Agent Workflow Memory☆218Updated 3 weeks ago
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆665Updated last month
- ☆290Updated last month
- VisualWebArena is a benchmark for multimodal agents.☆258Updated last month
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆163Updated this week
- Code and Data for Tau-Bench☆230Updated last week
- This is a collection of resources for computer-use agents, including videos, blogs, papers, and projects.☆135Updated last month
- Agentless🐱: an agentless approach to automatically solve software development problems☆891Updated last week
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆330Updated 6 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆452Updated 8 months ago
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆515Updated 6 months ago
- ☆329Updated 3 weeks ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆255Updated this week
- Code for the paper 🌳 Tree Search for Language Model Agents☆142Updated 4 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆200Updated 3 weeks ago
- Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.☆512Updated 3 weeks ago
- Agent S: an open agentic framework that uses computers like a human☆689Updated this week
- Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"☆441Updated 9 months ago
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆288Updated 3 months ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆124Updated last week
- An Analytical Evaluation Board of Multi-turn LLM Agents☆260Updated 6 months ago
- An intuitive LLM prompting framework for multifunctional agents, by explicitly constructing a complex "thought process" from simple natur…☆422Updated last month
- Environments, tools, and benchmarks for general computer agents☆184Updated last month
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆126Updated 2 weeks ago
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆197Updated 7 months ago
- ☆542Updated 2 months ago
- AndroidWorld is an environment and benchmark for autonomous agents☆151Updated this week