π Benchmark your browser agent on ~2.5k READ and ACTION based tasks
β96Jul 29, 2025Updated 10 months ago
Alternatives and similar repositories for WebBench
Users that are interested in WebBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Uncertainty Quantification with Pre-trained Language Models: An Empirical Analysisβ15Oct 11, 2022Updated 3 years ago
- Accessible Python client to debug and interact with screenpipe.β25Jan 11, 2025Updated last year
- Benchmark of complex, multimodal desktop-oriented tasks for advanced GUI-navigation AI agentsβ24May 7, 2025Updated last year
- Fine-tune copilot based on your codebaseβ12Mar 26, 2024Updated 2 years ago
- β13Feb 22, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tiny institutional ineptitude tracker.β17Sep 7, 2024Updated last year
- Few-shot Bayesian Imitation Learning with Policies as Logic over Programsβ21Oct 19, 2025Updated 7 months ago
- Interface for interacting with Gradient AI in Pythonβ15Jun 28, 2024Updated last year
- GPT Table Semantic Parsing with complex & non-intuitive structure.β17Jul 16, 2025Updated 10 months ago
- [LREC-Coling 2024] PECC: Problem Extraction and Coding Challengesβ14May 30, 2024Updated 2 years ago
- β17Oct 30, 2023Updated 2 years ago
- Sample audio and video files for the YouTube Video Tutorials on HTML5 Audio and Videoβ16Mar 4, 2021Updated 5 years ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Largβ¦β26Mar 6, 2025Updated last year
- Source code for "An Empirical Study of Code Smells in Transformer-based Code Generation Techniques".β11Oct 4, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Natural Language or Notβ11Jun 20, 2022Updated 3 years ago
- Language Models for Code Completion: a Practical Evaluationβ13Jan 19, 2024Updated 2 years ago
- Official code repository for "Web Agents with World Models [ICLR 2025]".β30Mar 2, 2025Updated last year
- Reasoning-based Evaluation and Ranking of Translations.β20Jul 18, 2025Updated 10 months ago
- β18Nov 1, 2024Updated last year
- A dataset of Java bugs for automatic repair, derived from the C bugs of IntroClassβ15Aug 11, 2021Updated 4 years ago
- β18Mar 2, 2026Updated 2 months ago
- Answering Ambiguous Questions via Iterative Promptingβ14May 25, 2024Updated 2 years ago
- On-the-fly Definition Augmentation of LLMs for Biomedical NERβ14Apr 14, 2025Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A ChatGPT-like application built with Streamlit for interactive conversation with OpenAI's GPT-3.5 model.β17Aug 8, 2024Updated last year
- β13Aug 3, 2024Updated last year
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledgeβ17Nov 16, 2021Updated 4 years ago
- β14Jun 3, 2025Updated 11 months ago
- The infoZilla unstructured software engineering data mining tool. It can find and extract source code regions, patches, stack traces, enuβ¦β15Jan 24, 2019Updated 7 years ago
- β40May 26, 2023Updated 3 years ago
- MUSIC: MUtation analySIs tool with High Configurability and Extensibilityβ18Apr 24, 2026Updated last month
- A mutation tool for source and IRβ13Sep 6, 2018Updated 7 years ago
- β12Jan 2, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Declarative AI Pipelinesβ22Oct 2, 2024Updated last year
- π Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)β26Oct 15, 2023Updated 2 years ago
- Cyber-Physical V&V Challenges for the Evaluation of State of the Art Model Checkersβ13Feb 12, 2020Updated 6 years ago
- β23Mar 4, 2025Updated last year
- β17Jul 12, 2025Updated 10 months ago
- Machinery data, made easy. Easily download and prepare common industrial datasets.β23Feb 13, 2024Updated 2 years ago
- Code Snippet Recommendation from Stack Overflow Postβ19Jun 30, 2021Updated 4 years ago