Comprehensive benchmark for RAG
☆264Jun 14, 2025Updated 8 months ago
Alternatives and similar repositories for CRAG
Users that are interested in CRAG are comparing it to the libraries listed below
Sorting:
- ☆21Jun 12, 2024Updated last year
- ☆52Aug 14, 2024Updated last year
- ☆59Jan 19, 2025Updated last year
- ☆14Apr 16, 2024Updated last year
- Official repository for RAG-Gym☆121Mar 4, 2025Updated last year
- Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval And Synthesis For SLMs☆55Oct 7, 2025Updated 4 months ago
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.☆145Jan 6, 2026Updated last month
- ☆216Apr 2, 2025Updated 11 months ago
- ☆27May 23, 2024Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- ☆356May 17, 2024Updated last year
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆45Jun 24, 2025Updated 8 months ago
- Automated Evaluation of RAG Systems☆695Mar 28, 2025Updated 11 months ago
- Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)☆426Apr 3, 2025Updated 11 months ago
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge☆17Nov 16, 2021Updated 4 years ago
- ☆105Mar 25, 2025Updated 11 months ago
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,172Nov 17, 2025Updated 3 months ago
- Benchmark baseline for retrieval qa applications☆120Apr 14, 2024Updated last year
- Code for Robust Fine-tuning (RbFT)☆17Jan 31, 2025Updated last year
- CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models☆361May 20, 2025Updated 9 months ago
- Corrective Retrieval Augmented Generation☆446Oct 8, 2024Updated last year
- ☆33Jul 15, 2025Updated 7 months ago
- This is the official repository for Auto-RAG.☆233Jul 18, 2025Updated 7 months ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆43Oct 1, 2024Updated last year
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆166Oct 14, 2025Updated 4 months ago
- ☆21Apr 17, 2023Updated 2 years ago
- RAGChecker: A Fine-grained Framework For Diagnosing RAG☆1,059Dec 13, 2024Updated last year
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆190Sep 13, 2025Updated 5 months ago
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590☆79Jul 31, 2025Updated 7 months ago
- ☆16Sep 17, 2024Updated last year
- ☆14Apr 14, 2025Updated 10 months ago
- [ICLR 2025] Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation☆156Jan 27, 2025Updated last year
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆50Apr 12, 2025Updated 10 months ago
- meta-comprehensive-rag-benchmark-kdd-cup-2024 phase1 task1 rank3☆21Jun 21, 2024Updated last year
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆69Dec 8, 2025Updated 2 months ago
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆46Jul 17, 2025Updated 7 months ago
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆50Sep 4, 2025Updated 6 months ago
- Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.☆1,094Jul 5, 2025Updated 8 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,085Nov 13, 2025Updated 3 months ago