MinorJerry / WebVoyagerLinks
Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"
β937Updated last year
Alternatives and similar repositories for WebVoyager
Users that are interested in WebVoyager are comparing it to the libraries listed below
Sorting:
- Windows Agent Arena (WAA) πͺ is a scalable OS platform for testing and benchmarking of multi-modal AI agents.β775Updated 5 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"β1,186Updated 3 weeks ago
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multβ¦β791Updated 8 months ago
- ππͺ BrowserGym, a Gym environment for web task automationβ941Updated this week
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist wβ¦β886Updated 6 months ago
- Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-apiβ1,183Updated 4 months ago
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.β447Updated 4 months ago
- Agentlessπ±: an agentless approach to automatically solve software development problemsβ1,943Updated 10 months ago
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reβ¦β434Updated this week
- Code and Data for Tau-Benchβ901Updated 2 months ago
- An agent benchmark with tasks in a simulated software company.β570Updated 2 weeks ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agentsβ391Updated 6 months ago
- xLAM: A Family of Large Action Models to Empower AI Agent Systemsβ564Updated 2 months ago
- VisualWebArena is a benchmark for multimodal agents.β392Updated 11 months ago
- [ICLR 2025] Automated Design of Agentic Systemsβ1,438Updated 8 months ago
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhanβ¦β1,408Updated last year
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environmentsβ2,255Updated last week
- E2B Desktop Sandbox for LLMs. E2B Sandbox with desktop graphical environment that you can connect to any LLM for secure computer use.β1,127Updated last week
- agent q - oss advanced reasoning and learning for autonomous ai agentsβ496Updated last year
- AWM: Agent Workflow Memoryβ335Updated 8 months ago
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interactionβ362Updated 7 months ago
- Open-source resources on agents for computer use.β378Updated 2 weeks ago
- Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs)β¦β1,413Updated 7 months ago
- βοΈ The First Coding Agent-as-a-Judgeβ646Updated 5 months ago
- π¦οΈ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/β376Updated 3 months ago
- Learn how to use CUA (our Computer Using Agent) via the API on multiple computer environments.β1,180Updated 6 months ago
- β634Updated 9 months ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agentsβ283Updated 3 months ago
- Autonomous Agents (LLMs) research papers. Updated Daily.β1,044Updated this week
- AI computer use powered by open source LLMs and E2B Desktop Sandboxβ1,618Updated 4 months ago