facebookresearch / CRAG
Comprehensive benchmark for RAG
☆133Updated 4 months ago
Alternatives and similar repositories for CRAG:
Users that are interested in CRAG are comparing it to the libraries listed below
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆156Updated 3 months ago
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆130Updated 2 months ago
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.☆120Updated 8 months ago
- Benchmarking library for RAG☆177Updated this week
- LOFT: A 1 Million+ Token Long-Context Benchmark☆176Updated last week
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆150Updated last year
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆104Updated 4 months ago
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 5 months ago
- ☆159Updated 7 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆136Updated 4 months ago
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]☆130Updated 2 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆114Updated 4 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆85Updated last month
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆151Updated last year
- Code implementation of synthetic continued pretraining☆93Updated 2 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆194Updated this week
- ☆142Updated 10 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆77Updated last month
- Benchmark baseline for retrieval qa applications☆103Updated 10 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆409Updated this week
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".☆127Updated 10 months ago
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆203Updated 2 months ago
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆194Updated 9 months ago
- ☆119Updated 5 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆103Updated 6 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆125Updated 11 months ago
- ☆174Updated 2 years ago
- ☆131Updated last month
- Codebase for reproducing the experiments of the semantic uncertainty paper (paragraph-length experiments).☆54Updated 11 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆233Updated last year