AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.
β535Mar 17, 2026Updated this week
Alternatives and similar repositories for AgentLab
Users that are interested in AgentLab are comparing it to the libraries listed below
Sorting:
- ππͺ BrowserGym, a Gym environment for web task automationβ1,162Updated this week
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?β238Feb 23, 2026Updated 3 weeks ago
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycleβ303Dec 16, 2025Updated 3 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"β1,388Nov 26, 2025Updated 3 months ago
- VisualWebArena is a benchmark for multimodal agents.β445Nov 9, 2024Updated last year
- Interaction-first method for generating demonstrations for web-agents on any websiteβ54Apr 29, 2025Updated 10 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RLβ513Jun 6, 2025Updated 9 months ago
- An agent benchmark with tasks in a simulated software company.β658Nov 17, 2025Updated 4 months ago
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environmentsβ2,667Updated this week
- An Illusion of Progress? Assessing the Current State of Web Agentsβ156Jan 2, 2026Updated 2 months ago
- Towards Large Multimodal Models as Visual Foundation Agentsβ258Apr 24, 2025Updated 10 months ago
- Helping AI practitioners better understand their datasets and models in text classification. From ServiceNow.β72Dec 23, 2024Updated last year
- WebLINX is a benchmark for building web navigation agents with conversational capabilitiesβ160Feb 11, 2025Updated last year
- [TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"β96Oct 5, 2025Updated 5 months ago
- GUI Grounding for Professional High-Resolution Computer Useβ347Mar 4, 2026Updated 2 weeks ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist wβ¦β961Nov 5, 2025Updated 4 months ago
- Building a comprehensive and handy list of papers for GUI agentsβ649Oct 27, 2025Updated 4 months ago
- SafeArena is a benchmark for assessing the harmful capabilities of web agentsβ21Apr 23, 2025Updated 10 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesisβ181Oct 8, 2025Updated 5 months ago
- AWM: Agent Workflow Memoryβ408Dec 22, 2025Updated 2 months ago
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agentsβ27Feb 17, 2026Updated last month
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.β380Mar 10, 2026Updated last week
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interactionβ383Mar 7, 2025Updated last year
- β27Jun 5, 2025Updated 9 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agentsβ56Jul 11, 2025Updated 8 months ago
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorialsβ53Feb 21, 2025Updated last year
- Setup scripts for the WebArena benchmarkβ20Jun 19, 2025Updated 9 months ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agentsβ302Mar 11, 2026Updated last week
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Controlβ68Jan 7, 2026Updated 2 months ago
- π AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resourceβ¦β388Feb 17, 2026Updated last month
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.β392Feb 22, 2025Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"β70Dec 9, 2024Updated last year
- SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.β115Apr 14, 2025Updated 11 months ago
- All-in-one Web Agent framework for post-training. Start building with a few clicks!β276Jul 7, 2025Updated 8 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)β20Oct 28, 2025Updated 4 months ago
- Web-grounded natural language instructionsβ18Nov 25, 2024Updated last year
- Accelerating your LLM training to full speed! Made with β€οΈ by ServiceNow Researchβ296Updated this week
- Pioneering Automated GUI Interaction with Native Agentsβ9,875Jan 27, 2026Updated last month
- MLGym A New Framework and Benchmark for Advancing AI Research Agentsβ587Aug 10, 2025Updated 7 months ago