katiekang1998 / llm_hallucinationsLinks

☆17

Alternatives and similar repositories for llm_hallucinations

Users that are interested in llm_hallucinations are comparing it to the libraries listed below

Sorting:

GAIR-NLP / alignment-for-honesty
☆76Updated last year
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆118Updated last year
HillZhang1999 / ICD
Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"
☆69Updated last year
Zce1112zslx / IKE
☆41Updated 2 years ago
hkust-nlp / felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆61Updated last year
BunsenFeng / AbstainQA
AbstainQA, ACL 2024
☆28Updated last year
dannyallover / overthinking_the_truth
☆29Updated last year
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆67Updated last year
GAIR-NLP / BeHonest
BeHonest: Benchmarking Honesty in Large Language Models
☆34Updated last year
qtli / GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆63Updated last year
yizhongw / llm-temporal-alignment
Methods and evaluation for aligning language models temporally
☆30Updated last year
RUCAIBox / HaluEval-2.0
☆47Updated last year
genglinliu / UnknownBench
Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge
☆14Updated last year
F2-Song / ICDPO
The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…
☆16Updated last year
eric-mitchell / serac
Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model
☆70Updated 3 years ago
edenbiran / RippleEdits
Evaluating the Ripple Effects of Knowledge Editing in Language Models
☆55Updated last year
swj0419 / in-context-pretraining
☆54Updated last year
nayeon7lee / FactualityPrompt
☆87Updated 3 years ago
ADaM-BJTU / W2SG
The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”
☆17Updated last year
dqxiu / CaliNet
☆32Updated 3 years ago
feyzaakyurek / dune
Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.
☆22Updated last year
Re-Align / AlignTDS
Analyzing LLM Alignment via Token distribution shift
☆17Updated last year
NJUNLP / QAlign
☆38Updated last year
ellaneeman / disent_qa
This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.
☆16Updated 2 years ago
ruiqi-zhong / nlparam
Augmenting Statistical Models with Natural Language Parameters
☆29Updated last year
HKUST-KnowComp / Knowledge-Constrained-Decoding
Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detec…
☆30Updated 2 years ago
Yangyi-Chen / PaperList-Trustworthy-Applications
Mostly recording papers about models' trustworthy applications. Intending to include topics like model evaluation & analysis, security, c…
☆21Updated 2 years ago
hkust-nlp / PEM_composition
[NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"
☆61Updated 2 years ago
Nanami18 / Snowballed_Hallucination
☆44Updated last year
hanxuhu / SeqIns
The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…
☆30Updated last year