EdinburghNLP/awesome-hallucination-detection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/EdinburghNLP/awesome-hallucination-detection)

EdinburghNLP / awesome-hallucination-detection

List of papers on hallucination detection in LLMs.

☆1,119

Alternatives and similar repositories for awesome-hallucination-detection

Users that are interested in awesome-hallucination-detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LuckyyySTA / Awesome-LLM-hallucination
View on GitHub
LLM hallucination paper list
☆338Mar 11, 2024Updated 2 years ago
HillZhang1999 / llm-hallucination-survey
View on GitHub
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large …
☆1,085Sep 27, 2025Updated 10 months ago
RUCAIBox / HaluEval
View on GitHub
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
☆592Feb 12, 2024Updated 2 years ago
potsawee / selfcheckgpt
View on GitHub
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
☆628Jun 26, 2024Updated 2 years ago
jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness
View on GitHub
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
☆833Jun 5, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
shmsw25 / FActScore
View on GitHub
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…
☆450Apr 13, 2025Updated last year
balevinstein / Probes
View on GitHub
☆58Jun 30, 2023Updated 3 years ago
hkust-nlp / felm
View on GitHub
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆65Dec 25, 2023Updated 2 years ago
zjunlp / FactCHD
View on GitHub
[IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection
☆90Apr 28, 2024Updated 2 years ago
amazon-science / RefChecker
View on GitHub
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Langua…
☆435May 16, 2025Updated last year
vectara / hallucination-leaderboard
View on GitHub
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
☆3,295May 11, 2026Updated 2 months ago
showlab / Awesome-MLLM-Hallucination
View on GitHub
📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
☆1,034Sep 27, 2025Updated 10 months ago
abhika-m / FAVA
View on GitHub
☆77Feb 16, 2024Updated 2 years ago
ParticleMedia / RAGTruth
View on GitHub
Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"
☆260Dec 2, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
likenneth / honest_llama
View on GitHub
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
☆581Jan 28, 2025Updated last year
oneal2000 / MIND
View on GitHub
Source code of our paper MIND, ACL 2024 Long Paper
☆65Nov 14, 2025Updated 8 months ago
HKUST-KnowComp / Knowledge-Constrained-Decoding
View on GitHub
Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detec…
☆30Nov 14, 2023Updated 2 years ago
voidism / DoLa
View on GitHub
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
☆557Jul 12, 2026Updated 2 weeks ago
wangcunxiang / LLM-Factuality-Survey
View on GitHub
The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>
☆339Mar 28, 2026Updated 3 months ago
zjunlp / KnowledgeEditingPapers
View on GitHub
Must-read Papers on Knowledge Editing for Large Language Models.
☆1,242Jun 25, 2026Updated last month
voidism / Lookback-Lens
View on GitHub
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
☆152Oct 13, 2025Updated 9 months ago
salesforce / factCC
View on GitHub
Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper
☆305May 1, 2025Updated last year
intuit / sac3
View on GitHub
Official repo for SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency
☆39Jan 18, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
xieyuquanxx / awesome-Large-MultiModal-Hallucination
View on GitHub
😎 curated list of awesome LMM hallucinations papers, methods & resources.
☆150Mar 23, 2024Updated 2 years ago
jlko / semantic_uncertainty
View on GitHub
Codebase for reproducing the experiments of the semantic uncertainty paper (short-phrase and sentence-length experiments).
☆421Apr 12, 2024Updated 2 years ago
patronus-ai / Lynx-hallucination-detection
View on GitHub
☆47Jul 10, 2024Updated 2 years ago
yuh-zha / AlignScore
View on GitHub
ACL2023 - AlignScore, a metric for factual consistency evaluation.
☆164Mar 11, 2024Updated 2 years ago
EleutherAI / lm-evaluation-harness
View on GitHub
A framework for few-shot evaluation of language models.
☆13,415Jul 13, 2026Updated last week
RUCAIBox / HaluEval-2.0
View on GitHub
☆50Jan 7, 2024Updated 2 years ago
IINemo / lm-polygraph
View on GitHub
☆497May 18, 2026Updated 2 months ago
lorenzkuhn / semantic_uncertainty
View on GitHub
☆186Jun 20, 2024Updated 2 years ago
hkust-nlp / Activation_Decoding
View on GitHub
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆64Mar 30, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yuzhaouoe / SAE-based-representation-engineering
View on GitHub
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆83Jun 20, 2026Updated last month
zjunlp / EasyEdit
View on GitHub
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
☆2,883Jul 14, 2026Updated last week
maitrix-org / llm-reasoners
View on GitHub
A library for advanced large language model reasoning
☆2,341Jun 10, 2025Updated last year
ruizheliUOA / Awesome-Interpretability-in-Large-Language-Models
View on GitHub
This repository collects all relevant resources about interpretability in LLMs
☆402Nov 1, 2024Updated last year
AmourWaltz / Awesome-Reliable-LLM
View on GitHub
☆193Mar 8, 2026Updated 4 months ago
wasiahmad / Awesome-LLM-Synthetic-Data
View on GitHub
A reading list on LLM based Synthetic Data Generation 🔥
☆1,545Jun 5, 2025Updated last year
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,934Updated this week