MiaoXiong2320 / llm-uncertaintyLinks

code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"

☆125

Alternatives and similar repositories for llm-uncertainty

Users that are interested in llm-uncertainty are comparing it to the libraries listed below

Sorting:

jinhaoduan / SAR
[ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models
☆53Updated 11 months ago
UCSB-NLP-Chang / llm_uncertainty
☆32Updated last year
zlin7 / UQ-NLG
☆97Updated last year
lorenzkuhn / semantic_uncertainty
☆171Updated last year
deeplearning-wisc / haloscope
source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"
☆51Updated 3 months ago
hkust-nlp / Activation_Decoding
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆61Updated last year
ajyl / dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
☆74Updated 4 months ago
deeplearning-wisc / args
☆43Updated last year
dannyallover / overthinking_the_truth
☆29Updated last year
yaojin17 / Unlearning_LLM
[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"
☆59Updated 10 months ago
balevinstein / Probes
☆52Updated 2 years ago
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆62Updated 8 months ago
zhiyuanhubj / UoT
[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
☆100Updated last year
AlexanderVNikitin / kernel-language-entropy
Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)
☆27Updated 7 months ago
zjysteven / mink-plus-plus
[ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs
☆41Updated 2 months ago
javiferran / sae_entities
☆60Updated 4 months ago
activatedgeek / calibration-tuning
☆51Updated 3 months ago
zepingyu0512 / in-context-mechanism
code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…
☆13Updated 8 months ago
UCSB-NLP-Chang / causal_unlearn
[EMNLP 2024] "Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective"
☆25Updated last year
Jiuzhouh / Uncertainty-Aware-Language-Agent
This is the official repo for Towards Uncertainty-Aware Language Agent.
☆26Updated 11 months ago
princeton-nlp / benign-data-breaks-safety
☆41Updated 10 months ago
ZFancy / awesome-activation-engineering
A curated list of resources for activation engineering
☆99Updated 2 months ago
chrisliu298 / awesome-representation-engineering
A resource repository for representation engineering in large language models
☆129Updated 8 months ago
tatsu-lab / test_set_contamination
☆38Updated last year
SafeAILab / RAIN
[ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning
☆96Updated last year
Thartvigsen / GRACE
[NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
☆78Updated 7 months ago
Zayne-sprague / To-CoT-or-not-to-CoT
☆26Updated 3 months ago
abhishekpanigrahi1996 / Skill-Localization-by-grafting
☆51Updated last year
princeton-nlp / unintentional-unalignment
[ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
☆29Updated 6 months ago
fc2869 / lo-fit
LoFiT: Localized Fine-tuning on LLM Representations
☆39Updated 6 months ago