shuoli90 / Rank-CalibrationLinks

This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.

☆13

Alternatives and similar repositories for Rank-Calibration

Users that are interested in Rank-Calibration are comparing it to the libraries listed below

Sorting:

hartvigsen-group / composable-interventions
☆28Updated 7 months ago
Varal7 / conformal-language-modeling
Conformal Language Modeling
☆32Updated last year
UCSB-NLP-Chang / llm_uncertainty
☆40Updated last year
alisawuffles / tokenizer-attack
Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"
☆15Updated 5 months ago
formll / resolving-scaling-law-discrepancies
☆20Updated last year
cambridgeltl / zepo
Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)
☆13Updated last year
HazyResearch / skill-it
Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models
☆47Updated last year
yfqiu-nlp / sea-llm
Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"
☆28Updated 10 months ago
p-lambda / in-n-out
Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"
☆13Updated 3 years ago
Hritikbansal / jpo
☆13Updated 3 months ago
tatsu-lab / linguistic_calibration
Align your LM to express calibrated verbal statements of confidence in its long-form generations.
☆27Updated last year
activatedgeek / calibration-tuning
☆52Updated 6 months ago
probabilistic-inference-scaling / probabilistic-inference-scaling
☆51Updated 7 months ago
haotiansun14 / BBox-Adapter
Lightweight Adapting for Black-Box Large Language Models
☆23Updated last year
DeqingFu / transformers-icl-second-order
Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…
☆18Updated 11 months ago
princeton-nlp / unintentional-unalignment
[ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
☆31Updated 8 months ago
zlin7 / UQ-NLG
☆100Updated last year
msakarvadia / AttentionLens
Interpretating the latent space representations of attention head outputs for LLMs
☆34Updated last year
KempnerInstitute / llm_uncertainty
Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"
☆10Updated last year
bhaweshiitk / ConformalLLM
Extending Conformal Prediction to LLMs
☆68Updated last year
Bradley-Butcher / Conformers
Unofficial implementation of Conformal Language Modeling by Quach et al
☆29Updated 2 years ago
LoryPack / LLM-LieDetector
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
☆71Updated last year
stanfordnlp / axbench
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
☆136Updated 3 months ago
srzer / MOD
Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".
☆26Updated 11 months ago
lifan-yuan / OOD_NLP
[NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…
☆35Updated 2 years ago
milesaturpin / cot-unfaithfulness
☆48Updated last year
jiahai-feng / binding-iclr
☆15Updated last year
alon-albalak / FLAD
Few-shot Learning with Auxiliary Data
☆31Updated last year
nathanhu0 / CaMeLS
Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.
☆25Updated last year
sylinrl / CalibratedMath
Teaching Models to Express Their Uncertainty in Words
☆39Updated 3 years ago