baixianghuang / HalluEditBenchLinks

Can Knowledge Editing Really Correct Hallucinations? (ICLR 2025)

☆26

Alternatives and similar repositories for HalluEditBench

Users that are interested in HalluEditBench are comparing it to the libraries listed below

Sorting:

hkust-nlp / Activation_Decoding
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆61Updated last year
ChnQ / TracingLLM
☆30Updated last year
ChenmienTan / malmen
☆35Updated last year
yaojin17 / Unlearning_LLM
[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"
☆61Updated last year
jinhaoduan / SAR
[ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models
☆59Updated last year
jinzhuoran / RWKU
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
☆85Updated last year
Lordog / R-Judge
R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)
☆91Updated 5 months ago
Zayne-sprague / To-CoT-or-not-to-CoT
☆25Updated 6 months ago
ozyyshr / RAST
Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)
☆19Updated 2 weeks ago
SafeAILab / RAIN
[ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning
☆99Updated last year
bethgelab / sober-reasoning
A Sober Look at Language Model Reasoning
☆86Updated 3 weeks ago
kevinyaobytedance / llm_unlearn
LLM Unlearning
☆177Updated 2 years ago
RUCAIBox / HaluEval-2.0
☆47Updated last year
boyiwei / CoTaEval
[NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models
☆17Updated last year
MiaoXiong2320 / llm-uncertainty
code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"
☆135Updated last year
sail-sg / CPO
[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.
☆131Updated 7 months ago
alisawuffles / proxy-tuning
Code associated with Tuning Language Models by Proxy (Liu et al., 2024)
☆121Updated last year
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆66Updated 11 months ago
zhiyuanhubj / UoT
[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
☆102Updated last year
zepingyu0512 / neuron-attribution
code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models
☆45Updated 11 months ago
rookie-joe / AutoPSV
☆50Updated last year
deeplearning-wisc / haloscope
source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"
☆60Updated 6 months ago
zjunlp / KnowledgeCircuits
[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers
☆159Updated 8 months ago
MingyuJ666 / The-Impact-of-Reasoning-Step-Length-on-Large-Language-Models
[ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…
☆46Updated 5 months ago
tatsu-lab / test_set_contamination
☆41Updated last year
Vaidehi99 / InfoDeletionAttacks
☆46Updated 8 months ago
Jometeorie / KnowledgeSpread
☆35Updated last year
princeton-nlp / benign-data-breaks-safety
☆41Updated last year
MingyuJ666 / LVLM-Safety
[FCS'24] LVLM Safety paper
☆19Updated 9 months ago
tmlr-group / NoisyRationales
[NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"
☆37Updated 3 months ago