microsoft/HaDes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/HaDes)

microsoft / HaDes

Token-level Reference-free Hallucination Detection

☆97

Alternatives and similar repositories for HaDes

Users that are interested in HaDes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

violet-zct / fairseq-detect-hallucination
View on GitHub
Detect hallucinated tokens for conditional sequence generation.
☆64Apr 15, 2022Updated 4 years ago
McGill-NLP / FaithDial
View on GitHub
☆51Feb 5, 2023Updated 3 years ago
RUCAIBox / HaluEval
View on GitHub
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
☆592Feb 12, 2024Updated 2 years ago
KaijuML / dtt-multi-branch
View on GitHub
Code for Controlling Hallucinations at Word Level in Data-to-Text Generation (C. Rebuffel, M. Roberti, L. Soulier, G. Scoutheeten, R. Can…
☆16Jun 12, 2023Updated 3 years ago
nouhadziri / Neural-Path-Hunter
View on GitHub
Code for the EMNLP'21 paper "Neural Path Hunter: Reducing Hallucination in Dialogue Systems via Path Grounding"
☆16Mar 13, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lcl-hse / heptabot
View on GitHub
A full-text error corrector for English based on transformers and deep learning
☆10Jan 8, 2023Updated 3 years ago
potsawee / selfcheckgpt
View on GitHub
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
☆627Jun 26, 2024Updated 2 years ago
mcao516 / EntFA
View on GitHub
☆27Nov 6, 2022Updated 3 years ago
yuh-zha / AlignScore
View on GitHub
ACL2023 - AlignScore, a metric for factual consistency evaluation.
☆164Mar 11, 2024Updated 2 years ago
mega002 / qdmr-based-question-generation
View on GitHub
The official code of TACL 2022, "Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition".
☆12Oct 18, 2021Updated 4 years ago
hkust-nlp / felm
View on GitHub
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆65Dec 25, 2023Updated 2 years ago
yale-nlp / InstruSum
View on GitHub
☆23Feb 26, 2024Updated 2 years ago
nayeon7lee / FactualityPrompt
View on GitHub
☆90Nov 11, 2022Updated 3 years ago
salesforce / factCC
View on GitHub
Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper
☆305May 1, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
google-research / true
View on GitHub
Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".
☆92Jun 16, 2026Updated last month
ZhangShiyue / extractive_is_not_faithful
View on GitHub
☆17May 19, 2023Updated 3 years ago
google / BEGIN-dataset
View on GitHub
A benchmark dataset for evaluating dialog system and natural language generation metrics.
☆39Jun 13, 2022Updated 4 years ago
shmsw25 / FActScore
View on GitHub
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…
☆448Apr 13, 2025Updated last year
soco-ai / SF-QA
View on GitHub
Evaluation framework for open-domain question answering.
☆20May 16, 2021Updated 5 years ago
potsawee / mqag0
View on GitHub
MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency
☆31Sep 11, 2023Updated 2 years ago
orhonovich / q-squared
View on GitHub
☆30Sep 5, 2021Updated 4 years ago
hkust-nlp / Activation_Decoding
View on GitHub
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆63Mar 30, 2024Updated 2 years ago
launchnlp / LitCab
View on GitHub
☆25Jun 10, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
xieyxclack / factual_coco
View on GitHub
The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.
☆17Nov 11, 2021Updated 4 years ago
flageval-baai / HalluDial
View on GitHub
☆21Aug 19, 2024Updated last year
yxuansu / Contrastive_Search_Is_What_You_Need
View on GitHub
[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation
☆122Mar 5, 2023Updated 3 years ago
AI21Labs / factor
View on GitHub
Code and data for the FACTOR paper
☆54Nov 15, 2023Updated 2 years ago
google-research-datasets / xsum_hallucination_annotations
View on GitHub
Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…
☆84Nov 26, 2020Updated 5 years ago
xu1998hz / SEScore2
View on GitHub
☆17Mar 3, 2025Updated last year
seanie12 / SWEP
View on GitHub
[ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QA
☆16May 11, 2022Updated 4 years ago
Shikib / fed
View on GitHub
Code for SIGdial 2020 paper: Unsupervised Evaluation of Interactive Dialog with DialoGPT (https://arxiv.org/abs/2006.12719)
☆28Jun 8, 2020Updated 6 years ago
jderiu / spot-the-bot-code
View on GitHub
☆13Mar 1, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
amazon-science / fact-check-summarization
View on GitHub
☆77May 3, 2024Updated 2 years ago
zthang / Focus
View on GitHub
☆24Feb 3, 2024Updated 2 years ago
D3Mlab / cr-lt-kgqa
View on GitHub
CR-LT KGQA Dataset Repository
☆10Jun 1, 2025Updated last year
youngbin-ro / Multi2OIE
View on GitHub
Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)
☆55Aug 14, 2022Updated 3 years ago
google-research / dialog-inpainting
View on GitHub
☆97Aug 6, 2022Updated 3 years ago
facebookresearch / HalluLens
View on GitHub
Codebase for LLM Textual Hallucination Benchmark
☆84Apr 25, 2025Updated last year
bckim92 / colloquial-claims
View on GitHub
✅ How Robust are Fact Checking Systems on Colloquial Claims?. In NAACL-HLT, 2021.
☆22Jul 1, 2021Updated 5 years ago