AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.
β572Mar 17, 2026Updated last month
Alternatives and similar repositories for AgentLab
Users that are interested in AgentLab are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ππͺ BrowserGym, a Gym environment for web task automationβ1,209Mar 17, 2026Updated last month
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?β245Feb 23, 2026Updated 2 months ago
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycleβ312Dec 16, 2025Updated 4 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"β1,443Nov 26, 2025Updated 5 months ago
- VisualWebArena is a benchmark for multimodal agents.β463Nov 9, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Interaction-first method for generating demonstrations for web-agents on any websiteβ55Apr 29, 2025Updated last year
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RLβ518Jun 6, 2025Updated 10 months ago
- An agent benchmark with tasks in a simulated software company.β690Nov 17, 2025Updated 5 months ago
- An Illusion of Progress? Assessing the Current State of Web Agentsβ171Jan 2, 2026Updated 3 months ago
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environmentsβ2,807Apr 17, 2026Updated last week
- DoomArena is a Framework for Testing AI Agents Against Evolving Security Threatsβ57Sep 12, 2025Updated 7 months ago
- Towards Large Multimodal Models as Visual Foundation Agentsβ265Apr 24, 2025Updated last year
- Helping AI practitioners better understand their datasets and models in text classification. From ServiceNow.β72Dec 23, 2024Updated last year
- WebLINX is a benchmark for building web navigation agents with conversational capabilitiesβ160Feb 11, 2025Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Building a comprehensive and handy list of papers for GUI agentsβ734Apr 17, 2026Updated last week
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist wβ¦β984Nov 5, 2025Updated 5 months ago
- [TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"β98Oct 5, 2025Updated 6 months ago
- GUI Grounding for Professional High-Resolution Computer Useβ365Apr 14, 2026Updated 2 weeks ago
- SafeArena is a benchmark for assessing the harmful capabilities of web agentsβ22Apr 23, 2025Updated last year
- AWM: Agent Workflow Memoryβ425Dec 22, 2025Updated 4 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesisβ188Oct 8, 2025Updated 6 months ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.β403Updated this week
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agentsβ28Feb 17, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interactionβ388Mar 7, 2025Updated last year
- A verified version of the WebArena Benchmarkβ36Mar 8, 2026Updated last month
- Official Repo for InSTA: Towards Internet-Scale Training For Agentsβ56Jul 11, 2025Updated 9 months ago
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorialsβ54Feb 21, 2025Updated last year
- Setup scripts for the WebArena benchmarkβ21Jun 19, 2025Updated 10 months ago
- β28Jun 5, 2025Updated 10 months ago
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.β394Feb 22, 2025Updated last year
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agentsβ310Mar 11, 2026Updated last month
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Controlβ68Jan 7, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- π AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resourceβ¦β407Feb 17, 2026Updated 2 months ago
- SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.β119Apr 14, 2025Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"β70Dec 9, 2024Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)β21Oct 28, 2025Updated 6 months ago
- Web-grounded natural language instructionsβ18Nov 25, 2024Updated last year
- MLGym A New Framework and Benchmark for Advancing AI Research Agentsβ595Aug 10, 2025Updated 8 months ago
- Accelerating your LLM training to full speed! Made with β€οΈ by ServiceNow Researchβ308Updated this week