sambowyer / bayes_evalsLinks
A lightweight library for Bayesian analysis of LLM evals (ICML 2025 Spotlight Position Paper)
☆21Updated 5 months ago
Alternatives and similar repositories for bayes_evals
Users that are interested in bayes_evals are comparing it to the libraries listed below
Sorting:
- Extending Conformal Prediction to LLMs☆68Updated last year
- Attribution-based Parameter Decomposition☆31Updated 4 months ago
- Interpret text data with LLMs (sklearn compatible).☆171Updated last month
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆19Updated 4 months ago
- A package for statistically rigorous scientific discovery using machine learning. Implements prediction-powered inference.☆264Updated 2 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 6 months ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆30Updated last year
- PyTorch library for Active Fine-Tuning☆93Updated last month
- This is the repository for the CONFLARE (CONformal LArge language model REtrieval) Python package.☆20Updated last year
- An introduction to LLM Sampling☆79Updated 10 months ago
- ☆142Updated 2 months ago
- ☆230Updated last week
- Portfolio REgret for Confidence SEquences☆20Updated 10 months ago
- SDLG is an efficient method to accurately estimate aleatoric semantic uncertainty in LLMs☆26Updated last year
- relplot: Utilities for measuring calibration and plotting reliability diagrams☆170Updated this week
- Open source replication of Anthropic's Crosscoders for Model Diffing☆59Updated last year
- Because we don't want a jupyter notebook mess...☆61Updated 4 months ago
- ☆36Updated 6 months ago
- Erasing concepts from neural representations with provable guarantees☆239Updated 9 months ago
- ☆43Updated last year
- Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).☆91Updated 3 months ago
- Probabilistic programming with large language models☆141Updated last week
- Sparse Autoencoder Training Library☆55Updated 6 months ago
- ☆23Updated 6 months ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆129Updated 3 years ago
- ☆78Updated last year
- Unified access to Large Language Model modules using NNsight☆55Updated this week
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28Updated last year
- 🧠 Starter templates for doing interpretability research☆75Updated 2 years ago
- Extract full next-token probabilities via language model APIs☆247Updated last year