MiniWoB++: a web interaction benchmark for reinforcement learning
☆377Apr 6, 2026Updated last week
Alternatives and similar repositories for miniwob-plusplus
Users that are interested in miniwob-plusplus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Graph-based Deep Q Network for Web Navigation☆48Jul 8, 2019Updated 6 years ago
- Demos for the MiniWoB++ benchmark☆21Feb 23, 2018Updated 8 years ago
- WebGym: Web-browser-based tasks for RL Agents☆24Feb 4, 2021Updated 5 years ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆975Nov 5, 2025Updated 5 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,428Nov 26, 2025Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A codebase for "Language Models can Solve Computer Tasks"☆240May 1, 2024Updated last year
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆520Sep 6, 2024Updated last year
- ☆60Jan 9, 2024Updated 2 years ago
- Mapping natural language commands to web elements☆38Jul 26, 2022Updated 3 years ago
- VisualWebArena is a benchmark for multimodal agents.☆456Nov 9, 2024Updated last year
- ☆16Apr 9, 2021Updated 5 years ago
- ☆18Mar 18, 2026Updated 3 weeks ago
- An API conversion tool for popular external reinforcement learning environments☆207Updated this week
- Collection of in-progress libraries for entity neural networks.☆30Jun 24, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆256Jul 16, 2024Updated last year
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆68Jan 7, 2026Updated 3 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?☆242Feb 23, 2026Updated last month
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆160Feb 11, 2025Updated last year
- This repository contains the opensource version of the datasets were used for different parts of training and testing of models that grou…☆34Aug 20, 2020Updated 5 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆31Sep 10, 2025Updated 7 months ago
- ☆19Mar 1, 2023Updated 3 years ago
- The model, data and code for the visual GUI Agent SeeClick☆478Jul 13, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [EMNLP 2022] The baseline code for META-GUI dataset☆14Jul 9, 2024Updated last year
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Nov 22, 2022Updated 3 years ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆3,316Feb 8, 2026Updated 2 months ago
- 🌎💪 BrowserGym, a Gym environment for web task automation☆1,193Mar 17, 2026Updated 3 weeks ago
- Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments☆61Aug 19, 2024Updated last year
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆394Feb 22, 2025Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆57May 21, 2023Updated 2 years ago
- Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"☆52Nov 10, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- RL research on Android devices.☆1,204Feb 26, 2026Updated last month
- A Universal Platform for Training and Evaluation of Mobile Interaction☆61Sep 24, 2025Updated 6 months ago
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆2,757Apr 2, 2026Updated 2 weeks ago
- An engine for high performance multi-agent environments with very large numbers of agents, along with a set of reference environments☆329Nov 16, 2025Updated 5 months ago
- ☆41Jul 21, 2024Updated last year
- ☆20Apr 24, 2024Updated last year
- AndroidWorld is an environment and benchmark for autonomous agents☆712Apr 9, 2026Updated last week