UKGovernmentBEIS/hibayes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UKGovernmentBEIS/hibayes)

UKGovernmentBEIS / hibayes

☆53

Alternatives and similar repositories for hibayes

Users that are interested in hibayes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UKGovernmentBEIS / inspect_cyber
View on GitHub
An Inspect extension for agentic cyber evaluations
☆38Jun 18, 2026Updated last month
koayon / atp_star
View on GitHub
PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)
☆20Jan 19, 2025Updated last year
UKGovernmentBEIS / inspect_evals
View on GitHub
Collection of evals for Inspect AI
☆592Updated this week
meridianlabs-ai / inspect_viz
View on GitHub
Data visualization for Inspect AI large language model evalutions.
☆21Updated this week
sambowyer / bayes_evals
View on GitHub
A lightweight library for Bayesian analysis of LLM evals (ICML 2025 Spotlight Position Paper)
☆25May 28, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
UKGovernmentBEIS / inspect_ai
View on GitHub
Inspect: A framework for large language model evaluations
☆2,386Updated this week
UKGovernmentBEIS / aisi-sandboxing
View on GitHub
The open-source AISI toolkit for sandboxing agentic evaluations
☆25Aug 7, 2025Updated 11 months ago
meridianlabs-ai / inspect_scout
View on GitHub
In-depth analysis of AI agent transcripts.
☆57Updated this week
UKGovernmentBEIS / control-arena
View on GitHub
ControlArena is a collection of settings, model organisms and protocols - for running control experiments.
☆210Updated this week
IINemo / llm-uncertainty-head
View on GitHub
☆26Feb 23, 2026Updated 4 months ago
kdu4108 / context-vs-prior-finetuning
View on GitHub
☆15May 27, 2025Updated last year
ndif-team / nnterp
View on GitHub
Unified access to Large Language Model modules using NNsight
☆116Jul 2, 2026Updated 2 weeks ago
meridianlabs-ai / inspect_flow
View on GitHub
Inspect Flow is a workflow stack built on Inspect AI that enables research organisations to run AI evaluations at scale.
☆16Updated this week
MilaNLProc / language-invariant-properties
View on GitHub
☆22Mar 31, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
thestephencasper / latent_adversarial_training
View on GitHub
☆24Jul 25, 2024Updated last year
METR / vivaria
View on GitHub
Vivaria is METR's tool for running evaluations and conducting agent elicitation research.
☆140May 18, 2026Updated 2 months ago
davidstutz / cvpr2019-adversarial-robustness
View on GitHub
CVPR 2019 paper "Disentangling Adversarial Robustness and Generalization".
☆14Oct 28, 2019Updated 6 years ago
anthropic-experimental / automated-auditing
View on GitHub
Prompts used in the Automated Auditing Blog Post
☆166Jul 24, 2025Updated 11 months ago
safety-research / false-facts
View on GitHub
☆50Jul 4, 2025Updated last year
meridianlabs-ai / inspect_petri
View on GitHub
An alignment auditing agent capable of quickly exploring alignment hypothesis
☆1,263Updated this week
TransluceAI / observatory
View on GitHub
A toolkit for describing model features and intervening on those features to steer behavior.
☆247Mar 16, 2026Updated 4 months ago
jbkjr / train-procgen-pytorch
View on GitHub
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆14May 17, 2024Updated 2 years ago
goodfire-ai / scribe-task-suite
View on GitHub
A suite of interpretability tasks to evaluate agents using Scribe for notebook access
☆18Oct 2, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
simulation-based-inference / simulation-based-inference.github.io
View on GitHub
Website
☆12Jul 12, 2026Updated last week
ziquanliu / CVPR2023-TWINS
View on GitHub
Official code for "TWINS: A Fine-Tuning Framework for Improved Transferability of Adversarial Robustness and Generalization", CVPR 2023
☆13Apr 26, 2023Updated 3 years ago
koayon / phil-interp-papers
View on GitHub
A curated reading list for researchers in the Philosophy of Interpretability
☆17Aug 17, 2025Updated 11 months ago
safety-research / safety-tooling
View on GitHub
Inference API for many LLMs and other useful tools for empirical research
☆133May 29, 2026Updated last month
Butanium / tiny-activation-dashboard
View on GitHub
A tiny easily hackable implementation of a feature dashboard.
☆17Oct 21, 2025Updated 9 months ago
MinhxLe / subliminal-learning
View on GitHub
☆151Feb 10, 2026Updated 5 months ago
DanielPolatajko / inspect_wandb
View on GitHub
Integration between Inspect and Weights & Biases
☆24Updated this week
LukeBailey181 / obfuscated-activations
View on GitHub
Codebase for Obfuscated Activations Bypass LLM Latent-Space Defenses
☆31Feb 11, 2025Updated last year
LoryPack / LLM-LieDetector
View on GitHub
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
☆74Jun 19, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
METR / Measuring-Early-2025-AI-on-Exp-OSS-Devs
View on GitHub
Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity: https://metr.org/blog/2025-07-10-early-2025-ai-e…
☆16Feb 23, 2026Updated 4 months ago
thestephencasper / explore_establish_exploit_llms
View on GitHub
☆31Jul 14, 2023Updated 3 years ago
apartresearch / 3cb
View on GitHub
3cb: Catastrophic Cyber Capabilities Benchmarking of Large Language Models
☆16Oct 30, 2024Updated last year
microsoft / implicitMemory
View on GitHub
☆19Feb 12, 2026Updated 5 months ago
jessica-taylor / hashlattice
View on GitHub
A distributed network based on hash codes and lattices.
☆14Aug 16, 2016Updated 9 years ago
nagornovys / Cancer_cell_evolution
View on GitHub
tugHall: a simulator of cancer cell evolution based on the hallmarks of cancer, linked to the mutational states of tumor-related genes. T…
☆13Dec 11, 2023Updated 2 years ago
harish-kamath / rqae
View on GitHub
Residual Quantization Autoencoder, used for interpreting LLMs
☆14Jan 1, 2025Updated last year