KempnerInstitute/llm_uncertainty

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KempnerInstitute/llm_uncertainty)

KempnerInstitute / llm_uncertainty

Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"

☆11

Alternatives and similar repositories for llm_uncertainty

Users that are interested in llm_uncertainty are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Butanium / tiny-activation-dashboard
View on GitHub
A tiny easily hackable implementation of a feature dashboard.
☆17Oct 21, 2025Updated 9 months ago
matchten / LoRA-Models-for-SAEs
View on GitHub
Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"
☆17Mar 31, 2025Updated last year
ThirdAIResearch / Dessert
View on GitHub
DESSERT Effeciently Searches Sets of Embeddings via Retrieval Tables
☆18Feb 21, 2024Updated 2 years ago
XuchanBao / behavioral-self-awareness
View on GitHub
☆37Feb 20, 2025Updated last year
fengyzpku / Simple_Dataset_Distillation
View on GitHub
A new simple method for dataset distillation called Randomized Truncated Backpropagation Through Time (RaT-BPTT)
☆14Apr 21, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
JoshEngels / FLINNG
View on GitHub
A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing
☆23Dec 9, 2023Updated 2 years ago
j-cb / GOOD
View on GitHub
Provable Worst Case Guarantees for the Detection of Out-of-Distribution Data
☆13Sep 20, 2022Updated 3 years ago
NeoVand / Debater
View on GitHub
Simulates a debate between two AI agents on a given topic
☆16Oct 11, 2024Updated last year
JoshEngels / SAE-Dark-Matter
View on GitHub
Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"
☆23Feb 6, 2025Updated last year
yangarbiter / interpretable-robust-trees
View on GitHub
Connecting Interpretability and Robustness in Decision Trees through Separation
☆17May 8, 2021Updated 5 years ago
EngSalem / HaLo
View on GitHub
☆16Sep 27, 2023Updated 2 years ago
atticusg / MultiplyQuantifiedData
View on GitHub
☆10Nov 1, 2019Updated 6 years ago
harshays / inputgradients
View on GitHub
Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)
☆12Jan 10, 2023Updated 3 years ago
mymakar / causally_motivated_shortcut_removal
View on GitHub
☆14Jul 5, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
feyzaakyurek / bbnli
View on GitHub
Bias Benchmark for Natural Language Inference. Code repo for the Findings of NAACL 2022 paper "On Measuring Social Biases in Prompt-Based…
☆15Apr 28, 2022Updated 4 years ago
graphml-lab-pwr / lapeigvals
View on GitHub
Implementation of the paper "Hallucination Detection in LLMs Using Spectral Features of Attention Maps"
☆16Oct 18, 2025Updated 9 months ago
kukrishna / genaudit
View on GitHub
☆15Mar 29, 2025Updated last year
AngelaZZZ-611 / reasoning_models_probing
View on GitHub
☆21May 14, 2026Updated 2 months ago
csinva / mdl-complexity
View on GitHub
MDL Complexity computations and experiments from the paper "Revisiting complexity and the bias-variance tradeoff".
☆18Jun 12, 2023Updated 3 years ago
tml-epfl / sam-low-rank-features
View on GitHub
Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]
☆29Sep 22, 2023Updated 2 years ago
violet-zct / group-conditional-DRO
View on GitHub
Group-conditional DRO to alleviate spurious correlations
☆15Jul 15, 2021Updated 5 years ago
Yifan-Song793 / GoodBadGreedy
View on GitHub
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
☆31Jul 17, 2024Updated 2 years ago
JoshEngels / SAE-Probes
View on GitHub
Code for reproducing our paper "Are Sparse Autoencoders Useful? A Case Study in Sparse Probing"
☆33Mar 31, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
utrerf / robust_transfer_learning
View on GitHub
Accelerating Transfer Learning with Robust Neural Nets
☆11Oct 2, 2020Updated 5 years ago
nusnlp / FSPO
View on GitHub
Official code for our paper "Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models"
☆26Oct 31, 2025Updated 8 months ago
zzzace2000 / robust_cls_model
View on GitHub
The code to reproduce CVPR 2021 paper "Towards Robust Classification Model by Counterfactual and Invariant Data Generation"
☆16Jul 29, 2021Updated 4 years ago
debjitpaul / Causal_CoT
View on GitHub
About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning…
☆13Jan 14, 2026Updated 6 months ago
JoshEngels / MultiDimensionalFeatures
View on GitHub
Code for reproducing our paper "Not All Language Model Features Are Linear"
☆90Nov 27, 2024Updated last year
balevinstein / Probes
View on GitHub
☆58Jun 30, 2023Updated 3 years ago
eth-lre / LLM_ICL
View on GitHub
ACL24
☆11Jun 7, 2024Updated 2 years ago
YanNeu / spurious_imagenet
View on GitHub
Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet
☆32Aug 22, 2023Updated 2 years ago
mggg / GerryChainJulia
View on GitHub
A high-performance implementation of GerryChain in Julia
☆19Jan 3, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
abhishekpanigrahi1996 / transformer_in_transformer
View on GitHub
☆47Oct 11, 2023Updated 2 years ago
leonardo-blas / usc-tg-24-us-election
View on GitHub
☆24Apr 15, 2026Updated 3 months ago
noelshin / Deep-Learning-Bootcamp-with-PyTorch
View on GitHub
Deep learning introduction to beginners with PyTorch
☆12Apr 24, 2020Updated 6 years ago
GuoTianYu2000 / Active-Dormant-Attention
View on GitHub
codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"
☆11Dec 30, 2024Updated last year
SwordElucidator / nanoBackpackLM
View on GitHub
The simplest repository for training medium-sized BackpackLM for cs224n
☆25Aug 13, 2023Updated 2 years ago
epfml / pam
View on GitHub
☆16Dec 9, 2023Updated 2 years ago
jhayes14 / black-box-attacks
View on GitHub
Comparison of gradient estimation techniques for black-box adversarial examples
☆11Oct 31, 2018Updated 7 years ago