vr25/hallucination-foundation-model-survey

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vr25/hallucination-foundation-model-survey)

vr25 / hallucination-foundation-model-survey

A Survey of Hallucination in Large Foundation Models

☆56

Alternatives and similar repositories for hallucination-foundation-model-survey

Users that are interested in hallucination-foundation-model-survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hongbinye / Cognitive-Mirage-Hallucinations-in-LLMs
View on GitHub
Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"
☆49Oct 21, 2023Updated 2 years ago
pillowsofwind / Course-Correction
View on GitHub
[EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"
☆20Oct 2, 2024Updated last year
HillZhang1999 / llm-hallucination-survey
View on GitHub
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large …
☆1,085Sep 27, 2025Updated 10 months ago
launchnlp / LitCab
View on GitHub
☆25Jun 10, 2025Updated last year
zhukun1020 / NoiseFilter_IB
View on GitHub
☆19Sep 3, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
siyan-zhao / ICL_decision_boundary
View on GitHub
official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…
☆20Jul 27, 2025Updated last year
allenai / dream
View on GitHub
☆23Sep 2, 2024Updated last year
hrwise-nlp / Cue-CoT
View on GitHub
Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs [EMNLP 2023 Findings]
☆24Nov 18, 2023Updated 2 years ago
sauc-abadal / ALT
View on GitHub
Official repository for ALT (ALignment with Textual feedback).
☆10Jul 25, 2024Updated 2 years ago
ShannonAI / backdoor_nlg
View on GitHub
☆18Jul 1, 2021Updated 5 years ago
mahmoudkanazzal / PromSec
View on GitHub
☆12Dec 22, 2025Updated 7 months ago
francescortu / comp-mech
View on GitHub
Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals; ACL 2024
☆13May 24, 2024Updated 2 years ago
ChanLiang / ORIG
View on GitHub
[ACL 2023 findings] Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularization
☆17Aug 26, 2023Updated 2 years ago
kaistAI / InstructIR
View on GitHub
IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…
☆32Jun 13, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dreasysnail / converse_GAN
View on GitHub
☆20Dec 18, 2022Updated 3 years ago
EsYoon7 / RLHF-TLCR
View on GitHub
[ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"
☆12Dec 6, 2024Updated last year
AsaCooperStickland / situational-awareness-evals
View on GitHub
Measuring the situational awareness of language models
☆41Feb 12, 2024Updated 2 years ago
Hediby / fastsent_theano
View on GitHub
Implementing FastSent in theano
☆12May 2, 2016Updated 10 years ago
zjunlp / FactCHD
View on GitHub
[IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection
☆90Apr 28, 2024Updated 2 years ago
yizhongw / truthfulqa_reeval
View on GitHub
☆12Mar 7, 2024Updated 2 years ago
kaiwenzha / contrastive-poisoning
View on GitHub
[ICLR 2023, Spotlight] Indiscriminate Poisoning Attacks on Unsupervised Contrastive Learning
☆32Dec 2, 2023Updated 2 years ago
uclanlp / ParaBART
View on GitHub
Code for our NAACL-2021 paper "Disentangling Semantics and Syntax in Sentence Embeddings with Pre-trained Language Models".
☆23Nov 8, 2021Updated 4 years ago
DanielJDufour / hatebase
View on GitHub
Python Version of Andrew Welter's Hatebase Wrapper
☆10Feb 20, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jiangjiechen / EDUCAT
View on GitHub
Resources for our AAAI 2022 paper: "Unsupervised Editing for Counterfactual Stories".
☆13Oct 25, 2022Updated 3 years ago
ZrW00 / MuScleLoRA
View on GitHub
The code implementation of MuScleLoRA (Accepted in ACL 2024)
☆10Dec 1, 2024Updated last year
zhao-ht / ConvexCertify
View on GitHub
This is the code of our work CISS Certified Robustness Against Natural Language Attacks by Causal Intervention published on ICML 2022
☆11Dec 6, 2022Updated 3 years ago
RUCAIBox / HaluEval-2.0
View on GitHub
☆50Jan 7, 2024Updated 2 years ago
allenai / FineGrainedRLHF
View on GitHub
☆283Jan 6, 2025Updated last year
2187Nick / ADAS
View on GitHub
Automated Design of Agentic Systems
☆10Sep 7, 2024Updated last year
XTxiatong / PaperArxiv
View on GitHub
This is an Uncertainty Study Arxiv
☆12Mar 4, 2025Updated last year
RJ-T / NIPS2022_EP_BNP
View on GitHub
Official Implementation of NIPS 2022 paper Pre-activation Distributions Expose Backdoor Neurons
☆15Jan 13, 2023Updated 3 years ago
Algorithmic-Alignment-Lab / CommonClaim
View on GitHub
Explore, Establish, Exploit: Red Teaming Language Models from Scratch
☆15Jun 21, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
mrcabbage972 / simple-toolformer
View on GitHub
A Python implementation of Toolformer using Huggingface Transformers
☆14Mar 20, 2023Updated 3 years ago
deeplearning-wisc / args
View on GitHub
☆47Feb 8, 2024Updated 2 years ago
dqxiu / KAssess
View on GitHub
☆14Oct 28, 2023Updated 2 years ago
ChanLiang / acl-emnlp-poster-templates
View on GitHub
Templates and examples for ACL and EMNLP conference posters.
☆15Oct 5, 2024Updated last year
krafton-ai / MPC
View on GitHub
The git repository of Modular Prompted Chatbot paper
☆35May 24, 2023Updated 3 years ago
UCSB-NLP-Chang / llm_uncertainty
View on GitHub
☆43Feb 2, 2024Updated 2 years ago
c-box / causalEval
View on GitHub
Code for ACL 2022 long paper: Can Prompt Probe Pretrained Language Models? Understanding the Invisible Risks from a Causal View
☆10May 17, 2022Updated 4 years ago