sambowyer / bayes_evalsLinks
A lightweight library for Bayesian analysis of LLM evals (ICML 2025 Spotlight Position Paper)
☆21Updated 4 months ago
Alternatives and similar repositories for bayes_evals
Users that are interested in bayes_evals are comparing it to the libraries listed below
Sorting:
- Attribution-based Parameter Decomposition☆31Updated 4 months ago
- Extending Conformal Prediction to LLMs☆68Updated last year
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 5 months ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆29Updated 11 months ago
- we got you bro☆36Updated last year
- ☆142Updated last month
- PyTorch library for Active Fine-Tuning☆93Updated 3 weeks ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆18Updated 4 months ago
- ☆35Updated 6 months ago
- ☆109Updated 8 months ago
- relplot: Utilities for measuring calibration and plotting reliability diagrams☆170Updated 3 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆59Updated 11 months ago
- Understanding how features learned by neural networks evolve throughout training☆39Updated 11 months ago
- SDLG is an efficient method to accurately estimate aleatoric semantic uncertainty in LLMs☆27Updated last year
- Code for minimum-entropy coupling.☆32Updated last year
- ☆22Updated 6 months ago
- Sparse and discrete interpretability tool for neural networks☆64Updated last year
- ☆230Updated last week
- Probabilistic programming with large language models☆139Updated 2 months ago
- 🧠 Starter templates for doing interpretability research☆75Updated 2 years ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆216Updated last week
- Interpret text data using LLMs (scikit-learn compatible).☆170Updated last week
- Sparse Autoencoder Training Library☆55Updated 5 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated 2 weeks ago
- ☆77Updated last year
- Extract full next-token probabilities via language model APIs☆247Updated last year
- This is the repository for the CONFLARE (CONformal LArge language model REtrieval) Python package.☆20Updated last year
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆129Updated 3 years ago
- A package for statistically rigorous scientific discovery using machine learning. Implements prediction-powered inference.☆260Updated last month
- A Python package for generating concise, high-quality summaries of a probability distribution☆53Updated last week