A lightweight library for Bayesian analysis of LLM evals (ICML 2025 Spotlight Position Paper)
☆25May 28, 2025Updated 10 months ago
Alternatives and similar repositories for bayes_evals
Users that are interested in bayes_evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] ParallelBench: Understanding the Tradeoffs of Parallel Decoding in Diffusion LLMs☆45Mar 27, 2026Updated 3 weeks ago
- ☆15Aug 14, 2025Updated 8 months ago
- Portfolio REgret for Confidence SEquences☆21Jan 6, 2026Updated 3 months ago
- Oak National Academy's AI Auto Eval tools provide LLM as a judge evaluation on lesson plans and resources☆17Nov 4, 2025Updated 5 months ago
- Teaching Models to Express Their Uncertainty in Words☆38May 26, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Stochastic trace estimation using JAX☆17Aug 20, 2025Updated 7 months ago
- ☆10Feb 9, 2026Updated 2 months ago
- ☆15Nov 23, 2023Updated 2 years ago
- Course materials for PSYCH101-D. "Data Science for Research Psychology"☆12Apr 6, 2022Updated 4 years ago
- A manuscript exploring the effects of taxonomic bias on microbiome differential-abundance analysis.☆12Oct 30, 2023Updated 2 years ago
- An R package to help assess the sensitivity of a Bayesian model (fitted with Stan) to the specification of its likelihood and priors☆11Apr 8, 2025Updated last year
- Define command-line interfaces using ordinary dart methods and classes.☆22Dec 13, 2017Updated 8 years ago
- Materials for a 'Python for Science' bootcamp workshop.☆13Sep 23, 2018Updated 7 years ago
- Files required to follow along the introduction session to machine learning with sklearn and nilearn☆12Jun 18, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Streamlit Multi AI Platform Chat App☆10Nov 5, 2024Updated last year
- ☆13Updated this week
- Run LLMs on Replicate with vLLM☆26Jul 19, 2025Updated 9 months ago
- Nilearn tutorials for OHBM 2016 educational course☆13Jul 13, 2016Updated 9 years ago
- Implicit Deep Adaptive Design (iDAD): Policy-Based Experimental Design without Likelihoods☆23Dec 30, 2021Updated 4 years ago
- Course material for the Advanced Cognitive Modeling class (Master students, Aarhus University)☆22May 10, 2021Updated 4 years ago
- Train neural networks to use as SMC and importance sampling proposals☆24Dec 6, 2017Updated 8 years ago
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- Utility provides a more meaningful measure of forecast skill than goodness-of-fit☆18Apr 26, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Python Model Builder - fit statistical models using algorithmic differentiation☆13Mar 7, 2019Updated 7 years ago
- ☆12Sep 26, 2019Updated 6 years ago
- A simple R package for calculating (meta-) SDT measures☆19Feb 5, 2024Updated 2 years ago
- Tools and tutorials for multi-level regression and post-stratification of survey data☆11Mar 5, 2024Updated 2 years ago
- Code for simulations and empirical analyses for the article "How to control for confounds in decoding analyses of neuroimaging data"☆11Aug 24, 2018Updated 7 years ago
- A simple plugin for syncing movies from IMDb to Obsidian☆16May 8, 2024Updated last year
- Examples on INLA within MCMC☆12May 15, 2017Updated 8 years ago
- ☆10Apr 2, 2024Updated 2 years ago
- Super-Paramagnetic Clustering, Maximum entropy, Maximum Likelihood Methods.☆11Oct 18, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆28Sep 19, 2025Updated 7 months ago
- ☆11Nov 27, 2019Updated 6 years ago
- Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs (EMNLP 2024)☆16Nov 17, 2024Updated last year
- ☆13Dec 2, 2024Updated last year
- Support for the DIDE cluster☆10Nov 22, 2024Updated last year
- statistical models to analyze diagnostic tests☆16Nov 19, 2020Updated 5 years ago
- Code related to the paper "Time series classification with random convolution kernels: pooling operators and input representations matter…☆15Jan 14, 2026Updated 3 months ago