amazon-science / auto-rag-evalLinks
Code repo for the ICML 2024 paper "Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation"
☆77Updated 11 months ago
Alternatives and similar repositories for auto-rag-eval
Users that are interested in auto-rag-eval are comparing it to the libraries listed below
Sorting:
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆180Updated 6 months ago
- ☆143Updated 10 months ago
- RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Langua…☆370Updated 3 weeks ago
- Benchmarking library for RAG☆206Updated this week
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆157Updated last year
- Comprehensive benchmark for RAG☆185Updated 7 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆162Updated last month
- ☆45Updated 9 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆128Updated last year
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆142Updated 5 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆110Updated 8 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆208Updated last week
- ☆38Updated 10 months ago
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆106Updated 7 months ago
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 8 months ago
- Large language models for document ranking.☆54Updated 3 weeks ago
- ☆74Updated 4 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆84Updated 9 months ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆53Updated 6 months ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆128Updated last year
- ☆41Updated 3 months ago
- ☆149Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆135Updated 6 months ago
- ☆173Updated 9 months ago
- ☆69Updated last year
- The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.☆158Updated last month
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆131Updated last year
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆104Updated last year
- Finetune mistral-7b-instruct for sentence embeddings☆82Updated last year
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆453Updated this week