GoogleCloudPlatform / evalbenchLinks

EvalBench is a flexible framework designed to measure the quality of generative AI (GenAI) workflows around database specific tasks.

☆18

Alternatives and similar repositories for evalbench

Users that are interested in evalbench are comparing it to the libraries listed below

Sorting:

microsoft / benchmark-qed
Automated benchmarking of Retrieval-Augmented Generation (RAG) systems
☆35Updated 2 weeks ago
aws-samples / building-a-knowledge-graph-with-generative-ai
☆12Updated last year
bbcCorp / kafka-actions
Action to install kafka
☆8Updated 4 years ago
interp-reasoning / thought-anchors.com
⚓️ Interactive playground for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.
☆15Updated last week
aws-samples / protein-similarity-search
This repository contains code and notebooks detailing a protein similarity search solution using the ProtT5-XL-UniRef50 model, Amazon Ope…
☆10Updated last year
pavanjava / llama_workflow_and_agents
This repository is a combination of llama workflows and agents together which is a powerful concept.
☆17Updated 11 months ago
TuneHQ / cookbook
☆16Updated 8 months ago
aws-samples / rag-qna-bot-for-your-website-using-langchain-amazon-aurorapg-and-amazon-bedrock
☆14Updated last year
risingwavelabs / risingwave-py
Python stream processing with RisingWave
☆20Updated last month
jjovalle99 / DSPy-Text2SQL
DSPY on action with OpenSource LLMs.
☆72Updated last year
joaomarcoscrs / change-my-clothes
Using computer vision and GenAI to change your clothes
☆10Updated 8 months ago
langroid / Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
☆15Updated last year
marqo-ai / ecommerce-search
Multimodal ecommerce search application built using Marqo Cloud and Marqo's SOTA ecommerce embedding models.
☆18Updated 8 months ago
kuzudb / api-server
REST-style API server for the Kuzu graph database powered by Express.js.
☆15Updated this week
guidance-ai / jsonschemabench
☆46Updated last month
prrao87 / lancedb-study
Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search
☆26Updated last year
backblaze-b2-samples / ai-rag-examples
Code samples showing how to include data stored in Backblaze B2 in a RAG application
☆11Updated 9 months ago
microsoft / recipe-tool
Experimental tool for creating "recipes" to drive automations
☆17Updated this week
antl3x / ToolRAG
Unlimited LLM tools, zero context penalties — ToolRAG serves exactly the LLM tools your user-query demands.
☆12Updated 3 months ago
huggingface / inference-playground
☆45Updated this week
microsoft / byoeb
BYOeB is a tool to build a chatbot with a custom knowledge base and an expert-in-the-loop.
☆25Updated 5 months ago
cfahlgren1 / hf-data-explorer
Chrome Extension for exploring Hugging Face datasets 🔎
☆50Updated 9 months ago
jxnl / spiral-mcp
☆16Updated 2 months ago
aws-samples / text-embeddings-pipeline-for-rag
A pipeline to convert contextual knowledge stored in documents and databases into text embeddings, and store them in a vector store
☆18Updated 3 months ago
google / rag-playground
☆34Updated 6 months ago
GoogleCloudPlatform / cloudsql-jump-start-solution-for-genai
A jump start solution using GKE or Cloud Run with Cloud SQL and VertexAI
☆52Updated 3 months ago
boorich / mcp-human-loop
Evaluate if a task requires human intervention
☆14Updated 6 months ago
atlanhq / dbt-action
Whenever you make a change to a dbt model, Atlan will add downstream lineage impact context right in your pull requests.
☆12Updated 3 weeks ago
astronomer / orchestrating-workflows-for-genai-deeplearning-ai
Companion repository for the Orchestrating Workflows for GenAI Applications course on Deeplearning.AI: https://bit.ly/45P4WQN
☆17Updated last month
weaviate / how-to-ingest-pdfs-with-unstructured
☆19Updated 2 years ago