ParticleMedia / RAGTruth
Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"
☆157Updated 3 months ago
Alternatives and similar repositories for RAGTruth:
Users that are interested in RAGTruth are comparing it to the libraries listed below
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆194Updated 9 months ago
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆190Updated 11 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆108Updated 8 months ago
- ☆174Updated 2 years ago
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.☆122Updated 8 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆89Updated last month
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆254Updated last year
- [NAACL 2024] End-to-End Beam Retrieval for Multi-Hop Question Answering☆95Updated 11 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆136Updated 4 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆104Updated 6 months ago
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆104Updated 4 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆234Updated last year
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆116Updated 4 months ago
- ☆142Updated 11 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆126Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆152Updated last year
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆123Updated last year
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆78Updated last month
- Code accompanying "How I learned to start worrying about prompt formatting".☆102Updated 5 months ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆477Updated 5 months ago
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆132Updated 3 months ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆65Updated 7 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆131Updated 4 months ago
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆203Updated 3 months ago
- Generative Judge for Evaluating Alignment☆230Updated last year
- ☆275Updated last year
- Benchmarking library for RAG☆178Updated this week
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆212Updated 4 months ago
- Comprehensive benchmark for RAG☆144Updated 4 months ago
- Finetune mistral-7b-instruct for sentence embeddings☆81Updated 10 months ago