sambowyer / bayes_evalsLinks
A lightweight library for Bayesian analysis of LLM evals (ICML 2025 Spotlight Position Paper)
☆19Updated 2 months ago
Alternatives and similar repositories for bayes_evals
Users that are interested in bayes_evals are comparing it to the libraries listed below
Sorting:
- SDLG is an efficient method to accurately estimate aleatoric semantic uncertainty in LLMs☆26Updated last year
- A collection of various LLM sampling methods implemented in pure Pytorch☆23Updated 8 months ago
- Extending Conformal Prediction to LLMs☆67Updated last year
- Portfolio REgret for Confidence SEquences☆20Updated 8 months ago
- Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).☆81Updated this week
- ☆136Updated 4 months ago
- Learning to route instances for Human vs AI Feedback (ACL 2025 Main)☆23Updated 2 weeks ago
- ☆62Updated 8 months ago
- ☆31Updated 3 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆143Updated 2 months ago
- ☆76Updated this week
- Sparse Autoencoder Training Library☆54Updated 3 months ago
- PyTorch library for Active Fine-Tuning☆88Updated 5 months ago
- Hypothesizing interpretable relationships in text datasets using sparse autoencoders.☆39Updated this week
- ☆43Updated 9 months ago
- Attribution-based Parameter Decomposition☆28Updated 2 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 3 months ago
- Erasing concepts from neural representations with provable guarantees☆232Updated 6 months ago
- ☆64Updated last week
- Discovering Data-driven Hypotheses in the Wild☆104Updated 2 months ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28Updated last year
- Understanding how features learned by neural networks evolve throughout training☆36Updated 9 months ago
- Probabilistic programming with large language models☆129Updated 2 weeks ago
- This is the repository for the CONFLARE (CONformal LArge language model REtrieval) Python package.☆20Updated last year
- ☆104Updated 6 months ago
- ☆55Updated last week
- A mechanistic approach for understanding and detecting factual errors of large language models.☆47Updated last year
- Functional Benchmarks and the Reasoning Gap☆88Updated 10 months ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆28Updated 9 months ago
- ☆28Updated 6 months ago