safety-research/bloom

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/safety-research/bloom)

safety-research / bloom

bloom - evaluate any behavior immediately 🌸🌱

☆1,371

Alternatives and similar repositories for bloom

Users that are interested in bloom are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

meridianlabs-ai / inspect_petri
View on GitHub
An alignment auditing agent capable of quickly exploring alignment hypothesis
☆1,273Updated this week
safety-research / safety-tooling
View on GitHub
Inference API for many LLMs and other useful tools for empirical research
☆134May 29, 2026Updated last month
safety-research / assistant-axis
View on GitHub
The Assistant Axis is a direction in activation space that captures how "Assistant-like" a model's behavior is. Models can drift away fro…
☆158Jan 20, 2026Updated 6 months ago
UKGovernmentBEIS / inspect_ai
View on GitHub
Inspect: A framework for large language model evaluations
☆2,406Updated this week
safety-research / persona_vectors
View on GitHub
Persona Vectors: Monitoring and Controlling Character Traits in Language Models
☆452Apr 22, 2026Updated 3 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
decoderesearch / circuit-tracer
View on GitHub
☆2,875Jul 18, 2026Updated last week
anthropic-experimental / automated-auditing
View on GitHub
Prompts used in the Automated Auditing Blog Post
☆167Jul 24, 2025Updated last year
google / langextract
View on GitHub
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…
☆37,814Updated this week
OpenPipe / ART
View on GitHub
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…
☆10,529Updated this week
safety-research / false-facts
View on GitHub
☆50Jul 4, 2025Updated last year
microsoft / agent-lightning
View on GitHub
The absolute trainer to light up AI agents.
☆17,424Jul 16, 2026Updated last week
confident-ai / deepeval
View on GitHub
The LLM Evaluation Framework
☆17,111Updated this week
anthropic-experimental / agentic-misalignment
View on GitHub
☆643Jun 19, 2025Updated last year
StarTrail-org / LEANN
View on GitHub
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on …
☆12,728Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆36,371Updated this week
ajobi-uhc / seer
View on GitHub
This was designed for interp researchers who want to do research on or with interp agents to give quality of life improvements and fix …
☆146Feb 8, 2026Updated 5 months ago
UKGovernmentBEIS / inspect_evals
View on GitHub
Collection of evals for Inspect AI
☆602Updated this week
safety-research / auditing-agents
View on GitHub
☆28Jul 1, 2026Updated 3 weeks ago
getzep / graphiti
View on GitHub
Build Real-Time Knowledge Graphs for AI Agents
☆29,191Updated this week
NVIDIA-NeMo / Gym
View on GitHub
Evaluate and improve models and agents using environments
☆1,066Updated this week
a2ui-project / a2ui
View on GitHub
☆15,893Updated this week
ndif-team / nnterp
View on GitHub
Unified access to Large Language Model modules using NNsight
☆116Updated this week
emcie-co / parlant
View on GitHub
Build reliable customer-facing AI agents with Parlant: an interaction control harness optimized for controlled, consistent, and predictab…
☆18,178Jul 12, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
PrimeIntellect-ai / verifiers
View on GitHub
Our library for RL environments + evals
☆4,400Updated this week
humanlayer / humanlayer
View on GitHub
The best way to get AI coding agents to solve hard problems in complex codebases.
☆11,160Jun 19, 2026Updated last month
thinking-machines-lab / tinker-cookbook
View on GitHub
Post-training with Tinker
☆3,911Updated this week
modelcontextprotocol / registry
View on GitHub
A community driven registry service for Model Context Protocol (MCP) servers.
☆7,067Updated this week
cocoindex-io / cocoindex
View on GitHub
Incremental engine for long horizon agents 🌟 Star if you like it!
☆11,052Updated this week
promptfoo / promptfoo
View on GitHub
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, De…
☆23,579Updated this week
ApolloResearch / deception-detection
View on GitHub
☆44Feb 11, 2025Updated last year
TransluceAI / introspective-interp
View on GitHub
Repository for "Training Language Models To Explain Their Own Computations"
☆23Jul 7, 2026Updated 2 weeks ago
letta-ai / letta
View on GitHub
Platform for stateful agents: AI with advanced memory that can learn and self-improve over time.
☆23,955Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
safety-research / finetuning-auditor
View on GitHub
Auditing agents for fine-tuning safety
☆21Oct 21, 2025Updated 9 months ago
safety-research / open-source-alignment-faking
View on GitHub
Open Source Replication of Anthropic's Alignment Faking Paper
☆58Apr 4, 2025Updated last year
UKGovernmentBEIS / control-arena
View on GitHub
ControlArena is a collection of settings, model organisms and protocols - for running control experiments.
☆213Updated this week
cywinski / eliciting-secret-knowledge
View on GitHub
Code repository for "Eliciting Secret Knowledge from Language Models"
☆24Mar 30, 2026Updated 3 months ago
simstudioai / sim
View on GitHub
Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.
☆29,220Updated this week
agno-agi / agno
View on GitHub
Build, run, and manage agent platforms.
☆41,417Updated this week
BerriAI / litellm
View on GitHub
The fastest, litest AI Gateway. Rust core with Python SDK. Call 100+ LLM APIs in OpenAI (or native) format with cost tracking, guardrails…
☆54,668Updated this week