ParticleMedia / RAGTruthLinks
Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"
☆185Updated 6 months ago
Alternatives and similar repositories for RAGTruth
Users that are interested in RAGTruth are comparing it to the libraries listed below
Sorting:
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.☆133Updated last month
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆141Updated last month
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆215Updated last year
- Comprehensive benchmark for RAG☆191Updated last week
- ☆178Updated last week
- ☆283Updated last year
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆144Updated 6 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆114Updated 11 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆134Updated 7 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆129Updated last year
- LOFT: A 1 Million+ Token Long-Context Benchmark☆201Updated last week
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆193Updated last year
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆107Updated 7 months ago
- ☆44Updated last week
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆170Updated 2 weeks ago
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]☆164Updated 5 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆144Updated 7 months ago
- Multilingual Large Language Models Evaluation Benchmark☆124Updated 10 months ago
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆204Updated 6 months ago
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆99Updated last year
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆97Updated 4 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆157Updated last year
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆489Updated 8 months ago
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆57Updated last year
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆128Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆105Updated last week
- Generative Judge for Evaluating Alignment☆239Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆243Updated last year
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs (ACL 2024)☆67Updated last month
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆127Updated 10 months ago