GoogleCloudPlatform / evalbenchLinks
EvalBench is a flexible framework designed to measure the quality of generative AI (GenAI) workflows around database specific tasks.
☆27Updated last week
Alternatives and similar repositories for evalbench
Users that are interested in evalbench are comparing it to the libraries listed below
Sorting:
- ☆42Updated this week
- Automated knowledge graph creation SDK☆124Updated last year
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆181Updated last week
- DSPY on action with OpenSource LLMs.☆103Updated last year
- A jump start solution using GKE or Cloud Run with Cloud SQL and VertexAI☆60Updated last month
- ☆78Updated 2 months ago
- ☆45Updated 2 months ago
- The Open Data QnA python library enables you to chat with your databases by leveraging LLM Agents on Google Cloud. Open Data QnA enables…☆220Updated last week
- Validation Tools for A2A Agents☆335Updated this week
- ☆75Updated last year
- Official Repo for CRMArena and CRMArena-Pro☆132Updated this week
- Build MLOps Pipelines in Minutes☆255Updated 2 weeks ago
- ☆38Updated 2 weeks ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆140Updated 5 months ago
- ☆44Updated last month
- ☆45Updated last year
- Framework for building data agent workflows☆84Updated last year
- Data management with LLMs☆182Updated last year
- Langtrace SDK for Python Applications☆45Updated 6 months ago
- Neo4j Extensions and Integrations with Vertex AI and LangChain☆27Updated 9 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆121Updated last week
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 9 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆84Updated last year
- ☆42Updated 11 months ago
- Hugging Face Deep Learning Containers (DLCs) for Google Cloud☆162Updated last week
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzu☆73Updated 4 months ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆80Updated 9 months ago
- Fast and flexible memory for agents and AI applications using Redis☆165Updated last week
- Official page for ICLR 2025 paper "Sufficient Context: A New Lens on Retrieval Augmented Generation Systems"☆63Updated 6 months ago
- Query language for blending SQL and local language models across structured + unstructured data, with type constraints.☆159Updated this week