sambowyer / bayes_evalsLinks
A lightweight library for Bayesian analysis of LLM evals (ICML 2025 Spotlight Position Paper)
☆21Updated 6 months ago
Alternatives and similar repositories for bayes_evals
Users that are interested in bayes_evals are comparing it to the libraries listed below
Sorting:
- Extending Conformal Prediction to LLMs☆68Updated last year
- Attribution-based Parameter Decomposition☆33Updated 6 months ago
- [ICML 2025] HypotheSAEs: Hypothesizing interpretable relationships in text datasets using sparse autoencoders. https://arxiv.org/abs/2502…☆66Updated last month
- Interpret text data with LLMs (sklearn compatible).☆172Updated 2 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 8 months ago
- Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).☆96Updated 4 months ago
- This is the repository for the CONFLARE (CONformal LArge language model REtrieval) Python package.☆21Updated last year
- A package for statistically rigorous scientific discovery using machine learning. Implements prediction-powered inference.☆269Updated 3 months ago
- relplot: Utilities for measuring calibration and plotting reliability diagrams☆175Updated last month
- PyTorch library for Active Fine-Tuning☆95Updated 2 months ago
- Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University☆227Updated 3 weeks ago
- SDLG is an efficient method to accurately estimate aleatoric semantic uncertainty in LLMs☆28Updated last year
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆32Updated last year
- A statistical toolkit for scientific discovery using machine learning☆79Updated last year
- Probabilistic programming with large language models☆154Updated last month
- Open source replication of Anthropic's Crosscoders for Model Diffing☆63Updated last year
- Testing Language Models for Memorization of Tabular Datasets.☆36Updated 10 months ago
- This repository contains a Jax implementation of conformal training corresponding to the ICLR'22 paper "learning optimal conformal classi…☆130Updated 3 years ago
- A package for conformal prediction with conditional guarantees.☆67Updated 2 months ago
- ☆24Updated 8 months ago
- A Natural Language Interface to Explainable Boosting Machines☆68Updated last year
- ☆44Updated last year
- Portfolio REgret for Confidence SEquences☆20Updated last year
- ☆88Updated last week
- Conformal prediction for controlling monotonic risk functions. Simple accompanying PyTorch code for conformal risk control in computer vi…☆73Updated 2 years ago
- ☆233Updated 3 weeks ago
- Extract full next-token probabilities via language model APIs☆248Updated last year
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆134Updated last year
- Discovering Data-driven Hypotheses in the Wild☆122Updated 6 months ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆51Updated last year