KempnerInstitute / llm_uncertaintyLinks
Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"
☆11Updated last year
Alternatives and similar repositories for llm_uncertainty
Users that are interested in llm_uncertainty are comparing it to the libraries listed below
Sorting:
- ☆32Updated last year
- ☆13Updated 7 months ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆29Updated last year
- ☆51Updated 2 years ago
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆12Updated last year
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆57Updated 3 months ago
- ☆20Updated 3 months ago
- Post-processing for fair classification☆16Updated 7 months ago
- Group-conditional DRO to alleviate spurious correlations☆15Updated 4 years ago
- ☆16Updated last year
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆13Updated 4 years ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Updated 2 years ago
- ☆37Updated last year
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated 2 years ago
- Understanding Rare Spurious Correlations in Neural Network☆12Updated 3 years ago
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated 3 years ago
- Code repo for the model organisms and convergent directions of EM papers.☆48Updated 4 months ago
- ☆29Updated last year
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆21Updated last year
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆18Updated 3 years ago
- Source code of "Calibrating Large Language Models Using Their Generations Only", ACL2024☆22Updated last year
- ☆52Updated 2 years ago
- Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"☆29Updated last year
- Self-Supervised Alignment with Mutual Information☆20Updated last year
- ☆14Updated 5 years ago
- Learning adapter weights from task descriptions☆19Updated 2 years ago
- ☆34Updated 2 years ago
- Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"☆21Updated 2 years ago
- ☆17Updated 2 years ago