MiniWoB++: a web interaction benchmark for reinforcement learning
☆384May 27, 2026Updated last week
Alternatives and similar repositories for MiniWoB-plusplus
Users that are interested in MiniWoB-plusplus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Workflow-Guided Exploration: sample-efficient RL agent for web tasks☆118Jun 5, 2023Updated 3 years ago
- Demos for the MiniWoB++ benchmark☆21Feb 23, 2018Updated 8 years ago
- WebGym: Web-browser-based tasks for RL Agents☆24Feb 4, 2021Updated 5 years ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆999Nov 5, 2025Updated 7 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,499Nov 26, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A codebase for "Language Models can Solve Computer Tasks"☆240May 1, 2024Updated 2 years ago
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆548Sep 6, 2024Updated last year
- ☆60Jan 9, 2024Updated 2 years ago
- Mapping natural language commands to web elements☆38Jul 26, 2022Updated 3 years ago
- ☆16Apr 9, 2021Updated 5 years ago
- VisualWebArena is a benchmark for multimodal agents.☆477Nov 9, 2024Updated last year
- ☆18Apr 17, 2026Updated last month
- An API conversion tool for popular external reinforcement learning environments☆210May 10, 2026Updated 3 weeks ago
- Collection of in-progress libraries for entity neural networks.☆29Jun 24, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆260Jul 16, 2024Updated last year
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆69Jan 7, 2026Updated 5 months ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆160Feb 11, 2025Updated last year
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?☆252Apr 25, 2026Updated last month
- This repository contains the opensource version of the datasets were used for different parts of training and testing of models that grou…☆34Aug 20, 2020Updated 5 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆31Sep 10, 2025Updated 8 months ago
- ☆19Mar 1, 2023Updated 3 years ago
- The model, data and code for the visual GUI Agent SeeClick☆483Jul 13, 2025Updated 10 months ago
- [EMNLP 2022] The baseline code for META-GUI dataset☆15Jul 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Nov 22, 2022Updated 3 years ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆3,472Feb 8, 2026Updated 4 months ago
- Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments☆61Aug 19, 2024Updated last year
- 🌎💪 BrowserGym, a Gym environment for web task automation☆1,241Mar 17, 2026Updated 2 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆394Feb 22, 2025Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆57May 21, 2023Updated 3 years ago
- RL research on Android devices.☆1,221May 21, 2026Updated 2 weeks ago
- A Universal Platform for Training and Evaluation of Mobile Interaction☆62Sep 24, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆2,911Updated this week
- Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"☆52Nov 10, 2024Updated last year
- An engine for high performance multi-agent environments with very large numbers of agents, along with a set of reference environments☆333May 23, 2026Updated 2 weeks ago
- ☆41Jul 21, 2024Updated last year
- ☆20Apr 24, 2024Updated 2 years ago
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆361Dec 3, 2025Updated 6 months ago
- AndroidWorld is an environment and benchmark for autonomous agents☆777Apr 9, 2026Updated 2 months ago