technion-cs-nlp / hallucination-mitigationLinks

☆22

Alternatives and similar repositories for hallucination-mitigation

Users that are interested in hallucination-mitigation are comparing it to the libraries listed below

Sorting:

abhika-m / FAVA
☆74Updated last year
allenai / noncompliance
This repository contains data, code and models for contextual noncompliance.
☆24Updated last year
OSU-NLP-Group / llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Updated last year
GAIR-NLP / MetaCritique
Evaluate the Quality of Critique
☆36Updated last year
BunsenFeng / AbstainQA
AbstainQA, ACL 2024
☆28Updated last year
DAMO-NLP-SG / contrastive-cot
Contrastive Chain-of-Thought Prompting
☆68Updated 2 years ago
googleinterns / localizing-paragraph-memorization
☆15Updated last year
GAIR-NLP / scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
☆42Updated last year
UKPLab / acl2025-diverse-cot
Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"
☆33Updated 4 months ago
shizhediao / R-Tuning
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…
☆125Updated last year
Reason-Wang / NAT
[NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…
☆29Updated last year
eth-lre / LLM_ICL
ACL24
☆10Updated last year
wwxu21 / CUT
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆58Updated last year
Zayne-sprague / MuSR
☆55Updated last year
hkust-nlp / Activation_Decoding
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆62Updated last year
neulab / data-agora
[ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"
☆40Updated 11 months ago
Zce1112zslx / IKE
☆41Updated last year
GAIR-NLP / benbench
Benchmarking Benchmark Leakage in Large Language Models
☆56Updated last year
ernie-research / Tool-Augmented-Reward-Model
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆51Updated 5 months ago
hkust-nlp / B-STaR
B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
☆86Updated 5 months ago
THUNLP-MT / SKR
Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)
☆28Updated last year
SteveKGYang / MetaAligner
Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models
☆24Updated last year
psunlpgroup / ReaLMistake
This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".
☆30Updated last year
zthang / Focus
☆21Updated last year
ChengpengLi1003 / DotaMath
☆30Updated 10 months ago
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆67Updated 11 months ago
OSU-NLP-Group / LLM-Knowledge-Conflict
[ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"
☆77Updated last year
dmis-lab / CompAct
[EMNLP 2024] CompAct: Compressing Retrieved Documents Actively for Question Answering
☆36Updated last year
qtli / GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆63Updated last year
OSU-NLP-Group / Middleware
Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)
☆37Updated 10 months ago