MiniWoB++: a web interaction benchmark for reinforcement learning
☆371May 5, 2025Updated 10 months ago
Alternatives and similar repositories for miniwob-plusplus
Users that are interested in miniwob-plusplus are comparing it to the libraries listed below
Sorting:
- Workflow-Guided Exploration: sample-efficient RL agent for web tasks☆118Jun 5, 2023Updated 2 years ago
- Graph-based Deep Q Network for Web Navigation☆48Jul 8, 2019Updated 6 years ago
- Demos for the MiniWoB++ benchmark☆21Feb 23, 2018Updated 8 years ago
- WebGym: Web-browser-based tasks for RL Agents☆24Feb 4, 2021Updated 5 years ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,353Nov 26, 2025Updated 3 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆952Nov 5, 2025Updated 4 months ago
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆496Sep 6, 2024Updated last year
- A codebase for "Language Models can Solve Computer Tasks"☆240May 1, 2024Updated last year
- VisualWebArena is a benchmark for multimodal agents.☆440Nov 9, 2024Updated last year
- ☆59Jan 9, 2024Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆31Sep 10, 2025Updated 5 months ago
- An API conversion tool for popular external reinforcement learning environments☆204Dec 15, 2025Updated 2 months ago
- ☆16Apr 9, 2021Updated 4 years ago
- Mapping natural language commands to web elements☆38Jul 26, 2022Updated 3 years ago
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆255Jul 16, 2024Updated last year
- GPT implementation in Flax☆18Jan 8, 2022Updated 4 years ago
- Collection of in-progress libraries for entity neural networks.☆30Jun 24, 2022Updated 3 years ago
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?☆234Feb 23, 2026Updated last week
- ☆19Mar 1, 2023Updated 3 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Nov 22, 2022Updated 3 years ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆160Feb 11, 2025Updated last year
- This repository contains the opensource version of the datasets were used for different parts of training and testing of models that grou…☆34Aug 20, 2020Updated 5 years ago
- ☆20Apr 24, 2024Updated last year
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆3,187Feb 8, 2026Updated 3 weeks ago
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆389Feb 22, 2025Updated last year
- 🌎💪 BrowserGym, a Gym environment for web task automation☆1,140Feb 10, 2026Updated 3 weeks ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆68Jan 7, 2026Updated 2 months ago
- A Universal Platform for Training and Evaluation of Mobile Interaction☆61Sep 24, 2025Updated 5 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆35Feb 25, 2026Updated last week
- The model, data and code for the visual GUI Agent SeeClick☆469Jul 13, 2025Updated 7 months ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆300Jul 18, 2025Updated 7 months ago
- An engine for high performance multi-agent environments with very large numbers of agents, along with a set of reference environments☆321Nov 16, 2025Updated 3 months ago
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆2,608Feb 28, 2026Updated last week
- BabyAI platform. A testbed for training agents to understand and execute language commands.☆756Oct 1, 2023Updated 2 years ago
- RL research on Android devices.☆1,190Feb 26, 2026Updated last week
- EfficientSAM + YOLO World base model for use with Autodistill.☆10Feb 21, 2024Updated 2 years ago
- hanabi_learning_environment is a research platform for Hanabi experiments.☆11May 17, 2022Updated 3 years ago