microsoft / benchmark-qedLinks
Automated benchmarking of Retrieval-Augmented Generation (RAG) systems
☆35Updated 2 weeks ago
Alternatives and similar repositories for benchmark-qed
Users that are interested in benchmark-qed are comparing it to the libraries listed below
Sorting:
- ☆12Updated last year
- Code samples showing how to include data stored in Backblaze B2 in a RAG application☆11Updated 9 months ago
- EvalBench is a flexible framework designed to measure the quality of generative AI (GenAI) workflows around database specific tasks.☆18Updated 2 weeks ago
- Blueprint by Mozilla.ai for answering questions about structured documents☆35Updated 4 months ago
- Let Claude control a web browser on your machine.☆34Updated last month
- ☆40Updated 3 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆91Updated 2 weeks ago
- ☆13Updated last month
- Unlimited LLM tools, zero context penalties — ToolRAG serves exactly the LLM tools your user-query demands.☆12Updated 3 months ago
- This repository is a combination of llama workflows and agents together which is a powerful concept.☆17Updated 11 months ago
- Framework for creating reliable LLM-based conversational agents☆48Updated 2 weeks ago
- AI agent with RAG+ReAct on Indian Constitution & BNS☆69Updated 2 weeks ago
- Computer use-like MCP for webapps and electron apps, to enable AI agents to test their changes☆27Updated last week
- AgenticSearch operates within an agentic workflow, utilizing Gemini 2.0 and an extensive tool registry to handle complex questions. By in…☆20Updated 5 months ago
- Experimental tool for creating "recipes" to drive automations☆17Updated this week
- Autonomous software engineering department with github/roo☆11Updated last month
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-…☆106Updated 4 months ago
- This project involves using llamaindex Multi Agents concierge system and Qdrant vector database to customize the RAG application with use…☆54Updated 10 months ago
- Multi-Agent Systems with Google's Agent Development Kit + A2A + MCP☆35Updated 2 months ago
- ☆14Updated last month
- Galleries for Models, Datasets, and Plugins used by Transformer Lab☆23Updated this week
- ☆17Updated last month
- ☆15Updated last year
- Task management for AI agents☆15Updated 2 weeks ago
- Eunomia is the open-source authorization layer for AI Agents☆56Updated last week
- FalkorDB-Browser is a visualization UI for FalkorDB.☆38Updated this week
- Validation Tools for A2A Agents☆96Updated this week
- Fastest way to scaffold FastHTML applications.☆24Updated 2 months ago
- ☆52Updated 3 weeks ago
- Deep Research through Multi-Agents, using GraphRAG☆76Updated 8 months ago