☆41Jul 21, 2024Updated last year
Alternatives and similar repositories for webagents-step
Users that are interested in webagents-step are comparing it to the libraries listed below
Sorting:
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆12Oct 20, 2024Updated last year
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆21Jan 6, 2026Updated last month
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Aug 7, 2025Updated 6 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆148Nov 26, 2024Updated last year
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,353Nov 26, 2025Updated 3 months ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆160Feb 11, 2025Updated last year
- ☆16Apr 9, 2021Updated 4 years ago
- [WSDM 2024] Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding☆17Mar 6, 2024Updated last year
- VisualWebArena is a benchmark for multimodal agents.☆440Nov 9, 2024Updated last year
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆136Updated this week
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?☆234Feb 23, 2026Updated last week
- Multimodal computer agent data collection program☆164Dec 5, 2025Updated 2 months ago
- ☆32Aug 17, 2025Updated 6 months ago
- ☆20Apr 24, 2024Updated last year
- A Universal Platform for Training and Evaluation of Mobile Interaction☆60Sep 24, 2025Updated 5 months ago
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆2,608Updated this week
- Automating enterprise workflows with multimodal agents☆115Oct 9, 2024Updated last year
- This repository contains expert evaluation interface and data evaluation script for the OpenScholar project.☆36Nov 19, 2024Updated last year
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆28Jul 9, 2025Updated 7 months ago
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆826Feb 3, 2025Updated last year
- AWM: Agent Workflow Memory☆397Dec 22, 2025Updated 2 months ago
- ☆23Jul 24, 2024Updated last year
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆947Nov 5, 2025Updated 3 months ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆300Jul 18, 2025Updated 7 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆81May 7, 2024Updated last year
- GPI-Space: Memory Driven Computing and Big Data☆10Jan 2, 2025Updated last year
- Jax like function transformation engine but micro, microjax☆34Oct 25, 2024Updated last year
- The Library for LLM-based multi-agent applications☆102Jul 18, 2025Updated 7 months ago
- A codebase for "Language Models can Solve Computer Tasks"☆240May 1, 2024Updated last year
- This repository contains the code to reproduce the experiments of the poster "Supervised Contrastive Learning for Product Matching"☆38Feb 11, 2022Updated 4 years ago
- FPGA Low latency 10GBASE-R PCS☆12May 23, 2023Updated 2 years ago
- Demo repository showcasing how to use reusable workflows to build artifact attestations☆14Feb 16, 2026Updated 2 weeks ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆22Jan 19, 2026Updated last month
- A working FE Bypass for all Roblox clients☆19Jan 10, 2026Updated last month
- Desktop client for Walltaker powered by golang☆12Sep 13, 2022Updated 3 years ago
- Extract information from XBRL files in the ESEF format☆13Jan 3, 2026Updated last month
- An LLM-based Web Navigating Agent (KDD'24)☆929Sep 27, 2024Updated last year
- ☆40Jul 26, 2024Updated last year