genglinliu / UnknownBenchLinks

Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge

☆14

Alternatives and similar repositories for UnknownBench

Users that are interested in UnknownBench are comparing it to the libraries listed below

Sorting:

hkust-nlp / felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆59Updated last year
GAIR-NLP / alignment-for-honesty
☆75Updated last year
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆66Updated 11 months ago
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆117Updated last year
balevinstein / Probes
☆57Updated 2 years ago
HillZhang1999 / ICD
Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"
☆69Updated last year
dannyallover / overthinking_the_truth
☆29Updated last year
eric-mitchell / serac
Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model
☆68Updated 2 years ago
nayeon7lee / FactualityPrompt
☆86Updated 2 years ago
Nanami18 / Snowballed_Hallucination
☆44Updated last year
katiekang1998 / llm_hallucinations
☆17Updated last year
yizhongw / truthfulqa_reeval
☆11Updated last year
RUCAIBox / Language-Specific-Neurons
☆85Updated 10 months ago
RUCAIBox / HaluEval-2.0
☆47Updated last year
OSU-NLP-Group / LLM-Knowledge-Conflict
[ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"
☆77Updated last year
GAIR-NLP / BeHonest
BeHonest: Benchmarking Honesty in Large Language Models
☆34Updated last year
lifan-yuan / OOD_NLP
[NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…
☆35Updated 2 years ago
declare-lab / resta
Restore safety in fine-tuned language models through task arithmetic
☆29Updated last year
Zhou-Zoey / RMB-Reward-Model-Benchmark
☆43Updated 7 months ago
ruiqi-zhong / nlparam
Augmenting Statistical Models with Natural Language Parameters
☆29Updated last year
zthang / Focus
☆20Updated last year
launchnlp / LitCab
☆25Updated 4 months ago
ajyl / dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
☆83Updated 7 months ago
ellaneeman / disent_qa
This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.
☆16Updated 2 years ago
alisawuffles / proxy-tuning
Code associated with Tuning Language Models by Proxy (Liu et al., 2024)
☆121Updated last year
BunsenFeng / AbstainQA
AbstainQA, ACL 2024
☆28Updated last year
edenbiran / RippleEdits
Evaluating the Ripple Effects of Knowledge Editing in Language Models
☆56Updated last year
Thartvigsen / GRACE
[NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
☆81Updated 10 months ago
SALT-NLP / chain-of-thought-bias
☆28Updated last year
zepingyu0512 / neuron-attribution
code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models
☆45Updated 11 months ago