β41Jul 21, 2024Updated last year
Alternatives and similar repositories for webagents-step
Users that are interested in webagents-step are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper π³ Tree Search for Language Model Agentsβ221Jul 25, 2024Updated last year
- WONDERBREAD benchmark + dataset for BPM tasksβ34Jul 30, 2025Updated 8 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"β1,428Nov 26, 2025Updated 4 months ago
- β16Apr 9, 2021Updated 5 years ago
- VisualWebArena is a benchmark for multimodal agents.β454Nov 9, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- WebLINX is a benchmark for building web navigation agents with conversational capabilitiesβ160Feb 11, 2025Updated last year
- GUICourse: From General Vision Langauge Models to Versatile GUI Agentsβ140Mar 1, 2026Updated last month
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]β149Nov 26, 2024Updated last year
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"β13Oct 20, 2024Updated last year
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?β242Feb 23, 2026Updated last month
- ππͺ BrowserGym, a Gym environment for web task automationβ1,190Mar 17, 2026Updated 3 weeks ago
- A Universal Platform for Training and Evaluation of Mobile Interactionβ61Sep 24, 2025Updated 6 months ago
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"β35Nov 11, 2025Updated 5 months ago
- β73Jun 10, 2025Updated 10 months ago
- NordVPN Threat Protection Proβ’ β’ AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- β24Apr 1, 2026Updated last week
- β22May 3, 2025Updated 11 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist wβ¦β972Nov 5, 2025Updated 5 months ago
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environmentsβ2,757Apr 2, 2026Updated last week
- Automating enterprise workflows with multimodal agentsβ117Oct 9, 2024Updated last year
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multβ¦β841Feb 3, 2025Updated last year
- β15Jan 24, 2025Updated last year
- An LLM-based Web Navigating Agent (KDD'24)β934Sep 27, 2024Updated last year
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception oβ¦β29Jul 9, 2025Updated 9 months ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Conceptual Construct Representationsβ11Feb 23, 2023Updated 3 years ago
- Multimodal computer agent data collection programβ167Dec 5, 2025Updated 4 months ago
- β15Nov 3, 2022Updated 3 years ago
- A codebase for "Language Models can Solve Computer Tasks"β240May 1, 2024Updated last year
- β13May 16, 2025Updated 10 months ago
- [ACL 2024] Novel reranking method to select the best solutions for code generationβ16Jun 9, 2024Updated last year
- Official Project Webpage for paper "DiffSRL: Learning Dynamic-aware State Representation for Control via Differentiable Simulation"β12Apr 4, 2022Updated 4 years ago
- β13Jun 14, 2023Updated 2 years ago
- β20Apr 24, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- AWM: Agent Workflow Memoryβ415Dec 22, 2025Updated 3 months ago
- The first large scale formally verified reasoning dataset for Verilogβ21May 16, 2025Updated 10 months ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generationβ27Jun 7, 2024Updated last year
- Sotopia-Ο: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)β83May 7, 2024Updated last year
- Setup scripts for the WebArena benchmarkβ20Jun 19, 2025Updated 9 months ago
- β35Mar 24, 2023Updated 3 years ago
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorchβ29Mar 22, 2026Updated 3 weeks ago