EvalBench is a flexible framework designed to measure the quality of generative AI (GenAI) workflows around database specific tasks.
☆36Apr 3, 2026Updated this week
Alternatives and similar repositories for evalbench
Users that are interested in evalbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Google Cloud Creative Studio is a comprehensive, all-in-one Generative AI Platform designed as a deployable solution for your own Google …☆48Mar 31, 2026Updated last week
- ☆16Nov 28, 2025Updated 4 months ago
- ☆21Mar 18, 2026Updated 3 weeks ago
- ☆17Mar 25, 2026Updated 2 weeks ago
- Sample apps and notebooks for Cloud Spanner on Google Cloud☆28Updated this week
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICLR 2026] Draw-In-Mind: Rebalancing Designer-Painter Roles in Unified Multimodal Models Benefits Image Editing☆27Jan 27, 2026Updated 2 months ago
- The Python Proto Converter converts between protos in Python. Proto conversion is often needed when converting between Database Access Ob…☆17Dec 3, 2025Updated 4 months ago
- ☆39Updated this week
- ☆154Nov 6, 2025Updated 5 months ago
- TypeScript repo for the XDK auto-generated code.☆42Feb 28, 2026Updated last month
- A modular, HTTP-driven orchestrator to define, practice, and refine pipelines and generative tools☆19Apr 27, 2025Updated 11 months ago
- ☆34Mar 25, 2026Updated 2 weeks ago
- A Structured Output Benchmark whose 'ground-truth' is actually right☆19Dec 5, 2025Updated 4 months ago
- A design approach to implementing a good GitHub Wiki system of documentation. This repository explains how to create a GitHub Wiki and th…☆17Mar 19, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Accelerate ingestion/transformation of pathology images into DICOMWeb☆25Apr 2, 2026Updated last week
- Rickbot ADK - a multi-personality chatbot built using Google ADK and Gemini, and leveraging Agent-Starter-Pack and Gemini CLI☆17Feb 15, 2026Updated last month
- [CVPR 2026] Official Implementation of Edit2Perceive☆35Feb 21, 2026Updated last month
- [ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval☆17Aug 24, 2022Updated 3 years ago
- ☆25Mar 30, 2026Updated last week
- A Mimetic Procedural Benchmark Generator for the Abstraction and Reasoning Corpus☆41Updated this week
- ☆41Dec 9, 2025Updated 4 months ago
- 𝔸𝕄𝔹ℝ𝕆𝕊𝕀𝔸: A Benchmark for Parsing Ambiguous Questions into Database Queries☆15Oct 31, 2024Updated last year
- ☆41Mar 30, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Cyber Pilot is the governance and validation layer for AI-assisted software delivery☆49Updated this week
- ☆23Jan 10, 2026Updated 2 months ago
- AI-in-One Dashboard Power BI template for comprehensive AI usage analytics☆31Updated this week
- ☆27Mar 20, 2026Updated 2 weeks ago
- Create and manage Artifact Registry repositories☆24Feb 24, 2026Updated last month
- A dbt adapter for TiDB☆15Dec 14, 2023Updated 2 years ago