Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"
☆227Dec 2, 2024Updated last year
Alternatives and similar repositories for RAGTruth
Users that are interested in RAGTruth are comparing it to the libraries listed below
Sorting:
- List of papers on hallucination detection in LLMs.☆1,053Jan 11, 2026Updated last month
- ☆49Jan 7, 2024Updated 2 years ago
- ☆13Aug 26, 2024Updated last year
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆602Jun 26, 2024Updated last year
- The implement of paper:"ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability"☆61Jun 3, 2025Updated 8 months ago
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆554Feb 12, 2024Updated 2 years ago
- Official code for the paper Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception. The code is based on t…☆19Aug 5, 2025Updated 6 months ago
- ☆22Jan 13, 2025Updated last year
- ☆21Jun 12, 2024Updated last year
- ☆18Oct 6, 2022Updated 3 years ago
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆309May 1, 2025Updated 9 months ago
- Code for the paper "HALoGEN: Fantastic LLM Hallucinations and Where To Find Them"☆24May 18, 2025Updated 9 months ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆152Mar 11, 2024Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆63Dec 25, 2023Updated 2 years ago
- ☆22Feb 3, 2024Updated 2 years ago
- ☆21Aug 19, 2024Updated last year
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆2,326May 25, 2024Updated last year
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.☆25May 15, 2025Updated 9 months ago
- ☆28Apr 14, 2025Updated 10 months ago
- ☆14Feb 2, 2025Updated last year
- ☆14Apr 29, 2025Updated 9 months ago
- Source code of DRAGIN, ACL 2024 main conference Long Paper (Oral)☆186Dec 5, 2025Updated 2 months ago
- StrategyQA 데이터 세트 번역☆23Apr 12, 2024Updated last year
- Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large …☆1,076Sep 27, 2025Updated 5 months ago
- A Benchmark for Multi-Stage Legal Case Documents Generation☆15Feb 24, 2025Updated last year
- this is based on the paper Chain-of-Retrieval Augmented Generation☆14Mar 29, 2025Updated 10 months ago
- Codes for "Benchmarking the Generation of Fact Checking Explanations"☆10Aug 16, 2024Updated last year
- ☆10Oct 14, 2020Updated 5 years ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆415Apr 13, 2025Updated 10 months ago
- ☆187Jul 2, 2025Updated 7 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆87Aug 12, 2024Updated last year
- ☆76Feb 16, 2024Updated 2 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Apr 12, 2021Updated 4 years ago
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 5 months ago
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detec…☆30Nov 14, 2023Updated 2 years ago
- Reformatted Alignment☆111Sep 23, 2024Updated last year
- LettuceDetect is a hallucination detection framework for RAG applications.☆533Sep 9, 2025Updated 5 months ago
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆329Sep 9, 2024Updated last year