microsoft / benchmark-qedLinks
Automated benchmarking of Retrieval-Augmented Generation (RAG) systems
☆72Updated this week
Alternatives and similar repositories for benchmark-qed
Users that are interested in benchmark-qed are comparing it to the libraries listed below
Sorting:
- Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning☆473Updated 4 months ago
- CUGA is an open-source generalist agent for the enterprise, supporting complex task execution on web and APIs, OpenAPI/MCP integrations, …☆665Updated 2 weeks ago
- ☆80Updated 4 months ago
- ☆278Updated last week
- [EMNLP'25 findings] This is the official repo for the paper, HiRAG: Retrieval-Augmented Generation with Hierarchical Knowledge.☆511Updated 2 months ago
- Ranking LLMs on agentic tasks☆211Updated 2 months ago
- ☆237Updated 2 months ago
- ☆334Updated last month
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆642Updated last week
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆140Updated 5 months ago
- This is the official repository for Auto-RAG.☆232Updated 6 months ago
- Official page for ICLR 2025 paper "Sufficient Context: A New Lens on Retrieval Augmented Generation Systems"☆63Updated 6 months ago
- Repo for "Adaptation of Agentic AI"☆585Updated 2 weeks ago
- Readymade evaluators for agent trajectories☆473Updated 5 months ago
- Workflows are an event-driven, async-first, step-based way to control the execution flow of AI applications like agents.☆308Updated this week
- RAG evaluation without the need for "golden answers"☆338Updated last month
- 🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.☆692Updated this week
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆459Updated 5 months ago
- An open-source tool for LLM prompt optimization.☆759Updated last week
- Agentic Web: Weaving the Next Web with AI Agents.☆408Updated 2 weeks ago
- ☆223Updated 7 months ago
- UniversalRAG: Retrieval-Augmented Generation over Corpora of Diverse Modalities and Granularities☆157Updated 8 months ago
- Docling LangChain integration☆63Updated 2 months ago
- Benchmark and optimize LLM inference across frameworks with ease☆161Updated 4 months ago
- Building LLM-Enabled Multi Agent Applications from Scratch☆353Updated this week
- Official code of the ACL 2025 paper "SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation"☆133Updated 6 months ago
- This repository contains the toolkit for replicating results from our technical report.☆200Updated 5 months ago
- Evolve your language agent with Agentic Context Engineering (ACE)☆576Updated 2 weeks ago
- Extract structured data from CUAD contracts using LangChain, build a knowledge graph, and query insights through a LangGraph agent - tran…☆146Updated 8 months ago
- Salesforce Enterprise Deep Research☆1,064Updated last week