sambowyer / bayes_evalsLinks
A lightweight library for Bayesian analysis of LLM evals (ICML 2025 Spotlight Position Paper)
☆20Updated 3 months ago
Alternatives and similar repositories for bayes_evals
Users that are interested in bayes_evals are comparing it to the libraries listed below
Sorting:
- Extending Conformal Prediction to LLMs☆67Updated last year
- PyTorch library for Active Fine-Tuning☆90Updated last week
- Attribution-based Parameter Decomposition☆30Updated 2 months ago
- ☆22Updated 4 months ago
- ☆106Updated 6 months ago
- Probabilistic programming with large language models☆134Updated last month
- This is the repository for the CONFLARE (CONformal LArge language model REtrieval) Python package.☆20Updated last year
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆28Updated 9 months ago
- Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).☆83Updated 3 weeks ago
- SDLG is an efficient method to accurately estimate aleatoric semantic uncertainty in LLMs☆27Updated last year
- Understanding how features learned by neural networks evolve throughout training☆37Updated 10 months ago
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆46Updated 9 months ago
- Portfolio REgret for Confidence SEquences☆20Updated 8 months ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28Updated last year
- ☆69Updated last year
- Open source replication of Anthropic's Crosscoders for Model Diffing☆59Updated 10 months ago
- Interpret text data using LLMs (scikit-learn compatible).☆170Updated last week
- Learning to route instances for Human vs AI Feedback (ACL 2025 Main)☆23Updated last month
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆72Updated last year
- ☆141Updated 2 weeks ago
- Extract full next-token probabilities via language model APIs☆247Updated last year
- ☆40Updated last year
- Functional Benchmarks and the Reasoning Gap☆88Updated 11 months ago
- ☆31Updated 4 months ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆18Updated 2 months ago
- code for training & evaluating Contextual Document Embedding models☆197Updated 3 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 4 months ago
- ☆78Updated 2 weeks ago
- ☆28Updated 6 months ago