π Benchmark your browser agent on ~2.5k READ and ACTION based tasks
β88Jul 29, 2025Updated 7 months ago
Alternatives and similar repositories for WebBench
Users that are interested in WebBench are comparing it to the libraries listed below
Sorting:
- Challenges for general-purpose web-browsing AI agentsβ67Jun 2, 2025Updated 9 months ago
- Opensource benchmark evaluating web operators/agents performanceβ47Apr 11, 2025Updated 10 months ago
- β18Nov 1, 2024Updated last year
- Accessible Python client to debug and interact with screenpipe.β25Jan 11, 2025Updated last year
- β36Jul 24, 2025Updated 7 months ago
- GitHub CLI extension that adds full inline PR review comment support β view, navigate, reply to, and resolve review threads directly fromβ¦β92Jan 28, 2026Updated last month
- β23Jul 24, 2024Updated last year
- Library to stream operating system events to AIβ41Apr 8, 2025Updated 11 months ago
- β43Jan 18, 2025Updated last year
- β17Sep 3, 2025Updated 6 months ago
- Airports information per languageβ14Oct 17, 2015Updated 10 years ago
- β40May 26, 2023Updated 2 years ago
- Efficient computer use agent powered by Meta Llama 4 Maverickβ46Apr 17, 2025Updated 10 months ago
- β25Mar 2, 2026Updated last week
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agentsβ21Jan 6, 2026Updated 2 months ago
- β13Aug 3, 2024Updated last year
- β10May 19, 2024Updated last year
- β11Dec 30, 2025Updated 2 months ago
- β33Jan 27, 2026Updated last month
- β12Mar 17, 2025Updated 11 months ago
- Reasoning-based Evaluation and Ranking of Translations.β19Jul 18, 2025Updated 7 months ago
- Summarize commits of your teammates using LLM to save timeβ13Jan 17, 2025Updated last year
- πΎ Documentation for Fly.io Spritesβ32Feb 23, 2026Updated 2 weeks ago
- Windows SSPI wrapper in prue pythonβ15Nov 29, 2023Updated 2 years ago
- Reference implementation of models from Nyonic Model Factoryβ12May 13, 2024Updated last year
- β14Jun 3, 2025Updated 9 months ago
- β32Sep 19, 2025Updated 5 months ago
- β11Jan 28, 2025Updated last year
- π£ππ A simple utility to draft scheduling emails.β12Sep 13, 2023Updated 2 years ago
- Widget loader example for the series of articles about web widgets.β12Jun 14, 2021Updated 4 years ago
- Execute Shellcode And Other Goodies From MMCβ14Jun 17, 2015Updated 10 years ago
- β10Oct 6, 2021Updated 4 years ago
- Fine-tune copilot based on your codebaseβ12Mar 26, 2024Updated last year
- β22Jan 15, 2026Updated last month
- Google Sheets Rest API transformed into GraphQL to be added as a remote schema in Hasuraβ13Dec 11, 2022Updated 3 years ago
- A gallery to display recent images from the #the-faraday-cage discord channelβ10Dec 10, 2021Updated 4 years ago
- β12Apr 18, 2025Updated 10 months ago
- A language-agnostic framework that makes a codebase aware of its own architecture, decisions, and failure modes β and enforces that knowlβ¦β26Jan 22, 2026Updated last month
- β15Apr 26, 2025Updated 10 months ago