Evaluation tools for Retrieval-augmented Generation (RAG) methods.
☆170Nov 18, 2024Updated last year
Alternatives and similar repositories for rageval
Users that are interested in rageval are comparing it to the libraries listed below
Sorting:
- A curated list of resources dedicated to retrieval-augmented generation (RAG).☆128Oct 31, 2025Updated 4 months ago
- ☆11Oct 15, 2022Updated 3 years ago
- ☆12Jan 11, 2026Updated last month
- ☆215Apr 2, 2025Updated 11 months ago
- TrustRAG:The RAG Framework within Reliable input,Trusted output☆1,230Jan 7, 2026Updated last month
- KDD2024-WhoIsWho-Top3☆16Jun 17, 2024Updated last year
- Automated Evaluation of RAG Systems☆695Mar 28, 2025Updated 11 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆201Aug 16, 2024Updated last year
- Evaluation for AI apps and agent☆44Jan 18, 2024Updated 2 years ago
- Code of LeCoRE☆13Feb 15, 2023Updated 3 years ago
- ☆11Jan 3, 2024Updated 2 years ago
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- Supercharge Your LLM Application Evaluations 🚀☆12,736Feb 24, 2026Updated last week
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆12Oct 12, 2024Updated last year
- Source Data of ACL2021 paper "Syntax-Enhanced Pre-trained Model"☆11Jun 1, 2021Updated 4 years ago
- InternLM-7B微调, SFT/LoRA, instruction finetune☆13May 17, 2024Updated last year
- Evaluate Dify assistants with promptfoo!☆18Mar 6, 2024Updated last year
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆166Oct 14, 2025Updated 4 months ago
- Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)☆426Apr 3, 2025Updated 11 months ago
- ☆51Jun 14, 2024Updated last year
- Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.☆34Apr 29, 2024Updated last year
- 🔥 AgentScale: A Scalable Microservices-based Agent Orchestration Framework☆27Jul 25, 2024Updated last year
- ☆356May 17, 2024Updated last year
- The source code used for paper "Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts", in WSDM 2023.☆15May 27, 2023Updated 2 years ago
- Code that accompanies the PyData New York (2022) talk: Addressing the sensitivity of Large language models☆13Nov 7, 2022Updated 3 years ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆239Updated this week
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆159Jul 25, 2025Updated 7 months ago
- 大模型检索增强生成技术最佳实践。☆88Sep 4, 2024Updated last year
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆177Jun 20, 2024Updated last year
- ☆164Apr 17, 2023Updated 2 years ago
- An Open-Source Package for Information Retrieval☆168Updated this week
- ☆2,123May 8, 2024Updated last year
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆56May 22, 2025Updated 9 months ago
- ☆83Apr 18, 2024Updated last year
- Learning High-Quality and General-Purpose Phrase Representations. Findings of EACL 2024☆16Feb 29, 2024Updated 2 years ago
- This repository contains the metadata and data of different databases that we use for testing☆14Jan 29, 2025Updated last year
- Fine-Tuning Embedding for RAG with Synthetic Data☆523Sep 11, 2023Updated 2 years ago
- ☆16Jul 12, 2024Updated last year
- ☆19Aug 23, 2024Updated last year