☆49May 17, 2026Updated last week
Alternatives and similar repositories for hibayes
Users that are interested in hibayes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Inspect extension for agentic cyber evaluations☆29Apr 23, 2026Updated last month
- A Kubernetes sandbox environment for use with inspect_ai☆31May 14, 2026Updated 2 weeks ago
- Collection of evals for Inspect AI☆512Updated this week
- Deprecated-- this code has been moved into a class of ao_core, which requires a private beta license. This repo is kept up for posterity …☆11Mar 5, 2025Updated last year
- LLM as World Models using Bayesian inference☆18May 27, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Sample Level Analysis of Pathway Alteration Enrichments☆10Jan 21, 2019Updated 7 years ago
- Inspect: A framework for large language model evaluations☆2,137Updated this week
- Redwood Research's transformer interpretability tools☆15Apr 15, 2022Updated 4 years ago
- Forecastbench is a dynamic, contamination-free benchmark of LLM forecasting accuracy with human comparison groups, serving as a valuable …☆64May 21, 2026Updated last week
- A Julia package for differentiating through expectations with Monte-Carlo estimates☆16Nov 25, 2024Updated last year
- Forecastbench Datasets, updated nightly☆28May 21, 2026Updated last week
- tugHall: a simulator of cancer cell evolution based on the hallmarks of cancer, linked to the mutational states of tumor-related genes. T…☆13Dec 11, 2023Updated 2 years ago
- Data for Decision, Affordable Analytics for All☆10Oct 6, 2024Updated last year
- A notebook that compares a reasoning model x a non reasoning model that runs a loop using logprobs found uncertainty☆25Aug 18, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆28Mar 6, 2024Updated 2 years ago
- A sample API that retrieves constellations as an example to demonstrate features in the OpenAPI 3.0 specification.☆14Nov 12, 2024Updated last year
- Code for simulations and empirical analyses for the article "How to control for confounds in decoding analyses of neuroimaging data"☆11Aug 24, 2018Updated 7 years ago
- Inference API for many LLMs and other useful tools for empirical research☆122May 12, 2026Updated 2 weeks ago
- A toolkit for describing model features and intervening on those features to steer behavior.☆241Mar 16, 2026Updated 2 months ago
- Examples of how-to use Azure OpenAI Log Probabilities (LogProbs) feature to enhance Generative AI - Q&A grounding.☆23May 10, 2025Updated last year
- Code for reproducing the results from "CrAM: A Compression-Aware Minimizer" accepted at ICLR 2023☆10Mar 1, 2023Updated 3 years ago
- [ICML 2025] Official Implementation of "Hessian Geometry of Latent Space in Generative Models"☆18Aug 16, 2025Updated 9 months ago
- A Grammar of Data Manipulation for Omics Data☆21Aug 31, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- cancereffectsizeR: An R package for calculation of somatic mutation rates and quantification of selection in cancer☆20Jan 6, 2026Updated 4 months ago
- ☆19Mar 25, 2024Updated 2 years ago
- Concept Relevance Propagation for Localization Models, accepted at SAIAD workshop at CVPR 2023.☆15Jan 16, 2024Updated 2 years ago
- [NeurIPS 2024] CoSy is an automatic evaluation framework for textual explanations of neurons.☆20Jan 28, 2026Updated 4 months ago
- A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorc…☆13Jun 16, 2023Updated 2 years ago
- Register of codecheckers for the community process☆20Mar 9, 2026Updated 2 months ago
- ☆28Nov 28, 2024Updated last year
- A project comparing the implementations of a basic AI agent using Langchain and PydanticAI frameworks☆18Jan 27, 2025Updated last year
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆23Feb 6, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This crate is now part of the vm-virtio workspace: https://github.com/rust-vmm/vm-virtio☆15Mar 2, 2022Updated 4 years ago
- Statistical analysis methods for comparing prompt and model performance in LLM evaluations.☆104May 13, 2026Updated 2 weeks ago
- CLIP is an open source, multimodal computer vision model and it's awesome!☆17Dec 16, 2024Updated last year
- Application Security Vulnerability Periodic Table☆14Aug 25, 2014Updated 11 years ago
- Generate the Tracy-Widom distribution functions for beta = 1, 2, or 4 in Python☆10Mar 15, 2025Updated last year
- Lightweight framework for structured and repeatable model validation☆11Jan 8, 2026Updated 4 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆286Mar 6, 2026Updated 2 months ago