amazon-science / auto-rag-eval
Code repo for the ICML 2024 paper "Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation"
☆72Updated 9 months ago
Alternatives and similar repositories for auto-rag-eval:
Users that are interested in auto-rag-eval are comparing it to the libraries listed below
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆166Updated 4 months ago
- Comprehensive benchmark for RAG☆154Updated 4 months ago
- Codebase accompanying the Summary of a Haystack paper.☆76Updated 6 months ago
- RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Langua…☆357Updated 4 months ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆104Updated this week
- ☆143Updated 8 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆105Updated 6 months ago
- Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasets☆149Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆152Updated last year
- Attribute (or cite) statements generated by LLMs back to in-context information.☆221Updated 5 months ago
- Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)☆286Updated 4 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆126Updated last year
- ☆143Updated 11 months ago
- ☆160Updated 7 months ago
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆203Updated 3 months ago
- awesome synthetic (text) datasets☆265Updated 5 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆202Updated 5 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆273Updated 8 months ago
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆129Updated last year
- ☆39Updated 7 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆131Updated 4 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆102Updated 11 months ago
- Automated Evaluation of RAG Systems☆569Updated this week
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.☆79Updated 2 months ago
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆55Updated last year
- ☆33Updated 3 weeks ago
- ☆154Updated 3 months ago
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆191Updated 11 months ago
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆107Updated 6 months ago