facebookresearch/HalluLens

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/HalluLens)

facebookresearch / HalluLens

Codebase for LLM Textual Hallucination Benchmark

☆84

Alternatives and similar repositories for HalluLens

Users that are interested in HalluLens are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AbhilashaRavichander / HALoGEN
View on GitHub
Code for the paper "HALoGEN: Fantastic LLM Hallucinations and Where To Find Them"
☆25May 18, 2025Updated last year
flageval-baai / HalluDial
View on GitHub
☆21Aug 19, 2024Updated last year
RUCAIBox / HaluEval-2.0
View on GitHub
☆50Jan 7, 2024Updated 2 years ago
open-compass / ANAH
View on GitHub
[ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPO
☆66Apr 30, 2025Updated last year
THU-KEG / LRM-FactEval
View on GitHub
☆17Jun 25, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
derenlei / FactCG
View on GitHub
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data (NAACL 2025)
☆17Jul 14, 2025Updated last year
ParticleMedia / RAGTruth
View on GitHub
Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"
☆260Dec 2, 2024Updated last year
Yixiao-Song / VeriScore
View on GitHub
☆39Dec 17, 2025Updated 7 months ago
dqxiu / KAssess
View on GitHub
☆14Oct 28, 2023Updated 2 years ago
RUCAIBox / HaluAgent
View on GitHub
☆23Jul 1, 2024Updated 2 years ago
vectara / FaithBench
View on GitHub
☆16May 12, 2025Updated last year
hkust-nlp / felm
View on GitHub
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆65Dec 25, 2023Updated 2 years ago
debjitpaul / Causal_CoT
View on GitHub
About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning…
☆13Jan 14, 2026Updated 6 months ago
bebr2 / RACE
View on GitHub
Code for RACE.
☆15Nov 12, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
THU-BPM / ICT
View on GitHub
Official repo for ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models
☆28Mar 24, 2025Updated last year
shmsw25 / FActScore
View on GitHub
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…
☆450Apr 13, 2025Updated last year
DSN-2024 / DSN
View on GitHub
DSN jailbreak Attack & Evaluation Ensemble
☆17Feb 7, 2026Updated 5 months ago
armingh2000 / FactScoreLite
View on GitHub
FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package bu…
☆14Apr 25, 2024Updated 2 years ago
moussaKam / FrugalScore
View on GitHub
FrugalScore is an approach to learn a fixed, low cost version of any expensive NLG metric, while retaining most of its original performan…
☆16Sep 21, 2022Updated 3 years ago
thu-coai / BARREL
View on GitHub
[ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs
☆18May 21, 2025Updated last year
s-nlp / PsiloQA
View on GitHub
The PsiloQA pipeline automates the construction of a multilingual, span-level hallucination detection dataset with contexts.
☆16Apr 24, 2026Updated 3 months ago
thu-coai / TransferAttack
View on GitHub
[ACL 2025] Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints
☆19May 23, 2025Updated last year
GaryStack / Trustworthy-Evaluation
View on GitHub
Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)
☆19Jul 19, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
TIGER-AI-Lab / MMLU-Pro
View on GitHub
The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]
☆414Mar 18, 2026Updated 4 months ago
baixianghuang / HalluEditBench
View on GitHub
Can Knowledge Editing Really Correct Hallucinations? (ICLR 2025)
☆26Aug 10, 2025Updated 11 months ago
injadlu / DAMA
View on GitHub
[ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"
☆16May 24, 2025Updated last year
layer6ai-labs / CMLMC
View on GitHub
Code for the ICLR'22 paper "Improving Non-Autoregressive Translation Models Without Distillation"
☆18Mar 11, 2022Updated 4 years ago
launchnlp / LitCab
View on GitHub
☆25Jun 10, 2025Updated last year
Sckathach / subspace-rerouting
View on GitHub
Using Mechanistic Interpretability to Craft Adversarial Attacks against Large Language Models
☆15Jul 7, 2025Updated last year
GaryStack / MMR-V
View on GitHub
Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"? [ICLR26]
☆40Jun 23, 2025Updated last year
curt-tigges / probity
View on GitHub
☆19Apr 10, 2025Updated last year
DYR1 / MoGU
View on GitHub
Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.
☆18Jan 14, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Jwoo5 / integrated-ehr-pipeline
View on GitHub
☆14Aug 9, 2024Updated last year
zthang / Focus
View on GitHub
☆24Feb 3, 2024Updated 2 years ago
ziweiji / Self_Reflection_Medical
View on GitHub
Code for paper Towards Mitigating LLM Hallucination via Self Reflection
☆30Oct 9, 2023Updated 2 years ago
francescortu / comp-mech
View on GitHub
Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals; ACL 2024
☆13May 24, 2024Updated 2 years ago
jczhang02 / MUSIC_dataset_script
View on GitHub
This repo contains script to download MUSIC dataset from youtube
☆12Jan 19, 2024Updated 2 years ago
zomux / lanmt-ebm
View on GitHub
lanmt ebm
☆12Jun 19, 2020Updated 6 years ago
Qwen-Applications / MARCH
View on GitHub
☆28Jun 9, 2026Updated last month