[NeurIPS 2022] πWebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
β551Sep 6, 2024Updated last year
Alternatives and similar repositories for WebShop
Users that are interested in WebShop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist wβ¦β1,000Nov 5, 2025Updated 7 months ago
- ALFWorld: Aligning Text and Embodied Environments for Interactive Learningβ769Feb 8, 2026Updated 4 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"β1,512Nov 26, 2025Updated 6 months ago
- MiniWoB++: a web interaction benchmark for reinforcement learningβ384May 27, 2026Updated 2 weeks ago
- Code for the paper "LASER: LLM Agent with State-Space Exploration for Web Navigation"β35Sep 26, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- VisualWebArena is a benchmark for multimodal agents.β477Nov 9, 2024Updated last year
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)β3,492Feb 8, 2026Updated 4 months ago
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)β260Jul 16, 2024Updated last year
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898β249May 5, 2024Updated 2 years ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilitiesβ161Feb 11, 2025Updated last year
- [ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Modelsβ3,970Feb 6, 2024Updated 2 years ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]β149Nov 26, 2024Updated last year
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.β363Dec 3, 2025Updated 6 months ago
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"β840Jul 30, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]β419May 20, 2024Updated 2 years ago
- A codebase for "Language Models can Solve Computer Tasks"β240May 1, 2024Updated 2 years ago
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.β394Feb 22, 2025Updated last year
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"β206Apr 17, 2025Updated last year
- β121Apr 8, 2025Updated last year
- FireAct: Toward Language Agent Fine-tuningβ294Oct 22, 2023Updated 2 years ago
- AgentTuning: Enabling Generalized Agent Abilities for LLMsβ1,497Oct 31, 2023Updated 2 years ago
- A Universal Platform for Training and Evaluation of Mobile Interactionβ62Sep 24, 2025Updated 8 months ago
- Workflow-Guided Exploration: sample-efficient RL agent for web tasksβ118Jun 5, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiβ¦β801May 30, 2026Updated 2 weeks ago
- GUICourse: From General Vision Langauge Models to Versatile GUI Agentsβ141Mar 1, 2026Updated 3 months ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agentsβ54Feb 27, 2025Updated last year
- π AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resourceβ¦β441Feb 17, 2026Updated 3 months ago
- Towards Large Multimodal Models as Visual Foundation Agentsβ267Apr 24, 2025Updated last year
- β15Mar 26, 2024Updated 2 years ago
- [NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learningβ3,180Jan 14, 2025Updated last year
- Code for the paper π³ Tree Search for Language Model Agentsβ222Jul 25, 2024Updated last year
- β116Jul 2, 2024Updated last year
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- β222Dec 20, 2024Updated last year
- An extensible benchmark for evaluating large language models on planningβ465Jun 2, 2026Updated last week
- List of language agents based on paper "Cognitive Architectures for Language Agents"β1,229Jan 16, 2025Updated last year
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.β5,665May 21, 2025Updated last year
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RLβ526Jun 6, 2025Updated last year
- ππͺ BrowserGym, a Gym environment for web task automationβ1,244Mar 17, 2026Updated 2 months ago
- β55Feb 19, 2025Updated last year