π Benchmark your browser agent on ~2.5k READ and ACTION based tasks
β90Jul 29, 2025Updated 8 months ago
Alternatives and similar repositories for WebBench
Users that are interested in WebBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Opensource benchmark evaluating web operators/agents performanceβ47Apr 11, 2025Updated 11 months ago
- Benchmark of complex, multimodal desktop-oriented tasks for advanced GUI-navigation AI agentsβ24May 7, 2025Updated 10 months ago
- Fine-tune copilot based on your codebaseβ12Mar 26, 2024Updated 2 years ago
- GitHub CLI extension that adds full inline PR review comment support β view, navigate, reply to, and resolve review threads directly fromβ¦β107Jan 28, 2026Updated 2 months ago
- Dual API router (Anthropic + OpenAI compatible) for Claude MAX Plan - Use flat-rate billing with ANY AI tool: OpenAI SDK, LangChain, Aβ¦β55Dec 21, 2025Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Productivity-Boosting Burp Suite extension written in Kotlin that enables persistent sticky session handling in web application testingβ¦β12Oct 8, 2025Updated 5 months ago
- Apache Arrow-compatible space-efficient "tape" class in pure Rust to be used with StringZilla for GPU, NUMA, and disk transfers of variabβ¦β29Nov 21, 2025Updated 4 months ago
- β18Jan 3, 2025Updated last year
- Multi-Agent Reinforcement Learning Environment for the card game SkyJo, compatible with PettingZoo and RLLIBβ16Feb 21, 2026Updated last month
- β18Jun 11, 2024Updated last year
- Source code for "An Empirical Study of Code Smells in Transformer-based Code Generation Techniques".β11Oct 4, 2022Updated 3 years ago
- Tools for running Home Assistant on Windowsβ10Apr 12, 2017Updated 8 years ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Largβ¦β26Mar 6, 2025Updated last year
- OpenAPI specs for integrating DataForSEO APIs with ChatGPT Actionsβ29Mar 3, 2026Updated 3 weeks ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β36Jul 24, 2025Updated 8 months ago
- Language Models for Code Completion: a Practical Evaluationβ13Jan 19, 2024Updated 2 years ago
- β35Mar 5, 2026Updated 3 weeks ago
- ClickAttention: Click Region Similarity Guided Interactive Segmentationβ23Jan 3, 2025Updated last year
- Reasoning-based Evaluation and Ranking of Translations.β20Jul 18, 2025Updated 8 months ago
- β33Sep 19, 2025Updated 6 months ago
- β18Nov 1, 2024Updated last year
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!β10Apr 4, 2024Updated last year
- A Prompt Learning Framework for Source Code Summarizationβ14Dec 26, 2023Updated 2 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- β18Mar 2, 2026Updated 3 weeks ago
- Code for "A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction"β15Mar 15, 2024Updated 2 years ago
- β13Aug 3, 2024Updated last year
- Tools for developing and optimizing background agents.β31Updated this week
- β14Jun 3, 2025Updated 9 months ago
- β40May 26, 2023Updated 2 years ago
- This is code for the EMNLP 2022 Paper "UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation".β10Apr 30, 2023Updated 2 years ago
- π¦Ύπ»π distributed training & serverless inference at scale on RunPodβ19May 26, 2024Updated last year
- β12Jan 2, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- π Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)β25Oct 15, 2023Updated 2 years ago
- LLM agent to generate workflow UML diagramsβ17Mar 25, 2024Updated 2 years ago
- β17Jul 12, 2025Updated 8 months ago
- A sidekick to help you write code in notebooksβ15Jul 10, 2022Updated 3 years ago
- π A code review tool with Github by ChatGPTβ15Jul 23, 2024Updated last year
- Summarize commits of your teammates using LLM to save timeβ13Jan 17, 2025Updated last year
- Machinery data, made easy. Easily download and prepare common industrial datasets.β23Feb 13, 2024Updated 2 years ago