safety-research / bloomLinks
bloom - evaluate any behavior immediately Β πΈπ±
β1,136Updated 2 weeks ago
Alternatives and similar repositories for bloom
Users that are interested in bloom are comparing it to the libraries listed below
Sorting:
- An alignment auditing agent capable of quickly exploring alignment hypothesisβ863Updated last week
- β1,066Updated 2 weeks ago
- Evolve your language agent with Agentic Context Engineering (ACE)β547Updated last week
- An agentic Machine Learning Engineerβ1,189Updated last month
- Standards for building agents, betterβ1,454Updated 2 weeks ago
- Semantic search and document parsing tools for the command lineβ1,572Updated this week
- Lighweight CLI to interact with MCP serversβ807Updated last week
- This repository allows reproduction of Poetiq's record-breaking submission to the ARC-AGI-1 and ARC-AGI-2 benchmarks.β1,179Updated last month
- AI Agent Evaluator & Red Team Platformβ947Updated last week
- An open source implementation of code execution with MCP (Programatic Tool Calling)β644Updated last week
- A Tree Search Library with Flexible API for LLM Inference-Time Scalingβ513Updated last month
- Official repo for spec & SDK of MCP Apps protocol - standard for UIs embedded AI chatbots, served by MCP serversβ1,159Updated this week
- Build common agent use-cases with deepagents libraryβ463Updated last week
- MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agentsβ548Updated 2 weeks ago
- General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.β1,872Updated this week
- Verbalized Sampling, a training-free prompting strategy to mitigate mode collapse in LLMs by requesting responses with probabilities. Achβ¦β674Updated 3 weeks ago
- LLMRouter: An Open-Source Library for LLM Routingβ1,185Updated last week
- The memory-first coding agentβ914Updated this week
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocolβ373Updated 5 months ago
- Data platform for context engineering. Context data platform that stores, observes and learns. Join the communityβ€οΈ: https://discord.aconβ¦β2,758Updated last week
- Build RL environments for LLM trainingβ617Updated last week
- Gemini-cli or claude code? Why not both? LangCode combines all CLI capabilities and models in one place βοΈ!β437Updated 2 months ago
- β722Updated this week
- Salesforce Enterprise Deep Researchβ1,049Updated 2 weeks ago
- β1,231Updated this week
- Local persistent memory store for LLM applications including claude desktop, github copilot, codex, antigravity, etc.β3,111Updated this week
- Optimize prompts, code, and more with AI-powered Reflective Text Evolutionβ2,167Updated last week
- β269Updated last week
- Agentic Web: Weaving the Next Web with AI Agents.β406Updated last week
- Reached #13 on Stanford's Terminal Bench leaderboard. Orchestrator, explorer & coder agents working together with intelligent context shaβ¦β1,321Updated 2 months ago