safety-research / bloomLinks
bloom - evaluate any behavior immediately Β πΈπ±
β1,165Updated 3 weeks ago
Alternatives and similar repositories for bloom
Users that are interested in bloom are comparing it to the libraries listed below
Sorting:
- Evolve your language agent with Agentic Context Engineering (ACE)β596Updated 3 weeks ago
- β1,205Updated this week
- An alignment auditing agent capable of quickly exploring alignment hypothesisβ887Updated this week
- An agentic Machine Learning Engineerβ1,204Updated 2 months ago
- This repository allows reproduction of Poetiq's record-breaking submission to the ARC-AGI-1 and ARC-AGI-2 benchmarks.β1,197Updated last month
- Official repo for spec & SDK of MCP Apps protocol - standard for UIs embedded AI chatbots, served by MCP serversβ1,416Updated last week
- β757Updated last week
- AI Agent Evaluator & Red Team Platformβ995Updated this week
- Standards for building agents, betterβ1,463Updated 3 weeks ago
- Lighweight CLI to interact with MCP serversβ845Updated last week
- Build common agent use-cases with deepagents libraryβ468Updated 2 weeks ago
- MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agentsβ556Updated this week
- An open source implementation of code execution with MCP (Programatic Tool Calling)β646Updated 3 weeks ago
- Verbalized Sampling, a training-free prompting strategy to mitigate mode collapse in LLMs by requesting responses with probabilities. Achβ¦β684Updated last month
- Build, enrich, and transform datasets using AI models with no codeβ1,623Updated 3 months ago
- General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.β2,028Updated this week
- Context Data Platform for AI Agentsβ2,906Updated this week
- Self-learning data agent that grounds its answers in 6 layers of context. Inspired by OpenAI's in-house implementation.β1,439Updated last week
- Semantic search and document parsing tools for the command lineβ1,604Updated this week
- Reached #13 on Stanford's Terminal Bench leaderboard. Orchestrator, explorer & coder agents working together with intelligent context shaβ¦β1,323Updated 3 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scalingβ520Updated last week
- The memory-first coding agentβ1,023Updated this week
- π§ Make your agents learn from experience. Based on the Agentic Context Engineering (ACE) framework.β1,834Updated this week
- The lightweight framework for building agentsβ298Updated last week
- Salesforce Enterprise Deep Researchβ1,107Updated last week
- LLMRouter: An Open-Source Library for LLM Routingβ1,314Updated this week
- Build RL environments for LLM trainingβ636Updated this week
- Aware - Deep Code Research Agent for Complex Codebase & Knowledge that βAct As Your Agentic Principal Engineerββ437Updated 3 months ago
- β1,257Updated last week
- Local persistent memory store for LLM applications including claude desktop, github copilot, codex, antigravity, etc.β3,217Updated 2 weeks ago