intuit-ai-research / SPUQLinks

SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models

☆16

Alternatives and similar repositories for SPUQ

Users that are interested in SPUQ are comparing it to the libraries listed below

Sorting:

smartyfh / LLM-Uncertainty-Bench
Benchmarking LLMs via Uncertainty Quantification
☆247Updated last year
MiaoXiong2320 / llm-uncertainty
code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"
☆135Updated last year
princeton-nlp / unintentional-unalignment
[ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
☆31Updated 9 months ago
ucl-dark / llm_debate
Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"
☆118Updated last year
zlin7 / UQ-NLG
☆102Updated last year
haozheji / exact-optimization
ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment
☆57Updated last year
haotiansun14 / BBox-Adapter
Lightweight Adapting for Black-Box Large Language Models
☆23Updated last year
jinhaoduan / SAR
[ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models
☆59Updated last year
srzer / MOD
Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".
☆27Updated last year
UCSB-NLP-Chang / llm_uncertainty
☆40Updated last year
YefanZhou / TempBalance
[NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training
☆35Updated 6 months ago
damanimehul / RLCR
Official repository for Beyond Binary Rewards: Training LMs to Reason about Their Uncertainty
☆38Updated 2 months ago
Lingkai-Kong / RE-Control
Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective
☆34Updated 9 months ago
tml-epfl / icl-alignment
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆31Updated 9 months ago
ChnQ / TracingLLM
☆30Updated last year
guyuntian / CoT_benchmark
Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"
☆20Updated 2 years ago
activatedgeek / calibration-tuning
☆52Updated 6 months ago
balevinstein / Probes
☆57Updated 2 years ago
ykwon0407 / DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
☆76Updated last year
glorgao / SelectiveDPO
Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples
☆44Updated 3 months ago
deeplearning-wisc / args
☆46Updated last year
tatsu-lab / test_set_contamination
☆41Updated last year
peterljq / Parsimonious-Concept-Engineering
PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)
☆40Updated 11 months ago
ozyyshr / RAST
Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)
☆19Updated 2 weeks ago
RLHFlow / Directional-Preference-Alignment
Directional Preference Alignment
☆57Updated last year
WeiXiongUST / Building-Math-Agents-with-Multi-Turn-Iterative-Preference-Learning
This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…
☆30Updated 10 months ago
RLHFlow / Minimal-RL
☆244Updated 5 months ago
stanfordnlp / axbench
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
☆136Updated 4 months ago
ZHZisZZ / modpo
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
☆91Updated last year
uw-nsl / safechain
[ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
☆23Updated 7 months ago