safety-research / bloomLinks
bloom - evaluate any behavior immediately Β πΈπ±
β1,027Updated last week
Alternatives and similar repositories for bloom
Users that are interested in bloom are comparing it to the libraries listed below
Sorting:
- β801Updated this week
- Evolve your language agent with Agentic Context Engineering (ACE)β480Updated last month
- An alignment auditing agent capable of quickly exploring alignment hypothesisβ791Updated this week
- AI Agent Evaluator & Red Team Platformβ771Updated this week
- π MassGen is an open-source multi-agent scaling system that runs in your terminal, autonomously orchestrating frontier models and agentsβ¦β680Updated this week
- MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agentsβ536Updated 2 weeks ago
- An agentic Machine Learning Engineerβ1,168Updated last month
- A Tree Search Library with Flexible API for LLM Inference-Time Scalingβ512Updated last month
- Salesforce Enterprise Deep Researchβ1,027Updated last month
- The memory-first coding agentβ815Updated this week
- This repository allows reproduction of Poetiq's record-breaking submission to the ARC-AGI-1 and ARC-AGI-2 benchmarks.β1,110Updated 3 weeks ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocolβ370Updated 4 months ago
- Standards for building agents, betterβ1,442Updated last week
- An open source implementation of code execution with MCP (Programatic Tool Calling)β631Updated this week
- Build, enrich, and transform datasets using AI models with no codeβ1,616Updated 2 months ago
- Build RL environments for LLM trainingβ568Updated this week
- The lightweight framework for building agentsβ231Updated this week
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and versβ¦β984Updated last month
- Gemini-cli or claude code? Why not both? LangCode combines all CLI capabilities and models in one place βοΈ!β435Updated last month
- Agentic Web: Weaving the Next Web with AI Agents.β402Updated 3 months ago
- Build common agent use-cases with deepagents libraryβ431Updated last month
- A powerful Python library for creating and managing isolated desktop environments using Docker containers.β442Updated 4 months ago
- MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Serversβ420Updated 3 months ago
- Verbalized Sampling, a training-free prompting strategy to mitigate mode collapse in LLMs by requesting responses with probabilities. Achβ¦β647Updated last week
- Reached #13 on Stanford's Terminal Bench leaderboard. Orchestrator, explorer & coder agents working together with intelligent context shaβ¦β1,304Updated 2 months ago
- Agent Development Kit Web (adk web) is the built-in developer UI that is integrated with Agent Development Kit for easier agent developmeβ¦β794Updated this week
- β318Updated last month
- Semantic search and document parsing tools for the command lineβ1,512Updated last month
- Claude Code SDK Demosβ1,086Updated this week
- π§ Make your agents learn from experience. Based on the Agentic Context Engineering (ACE) framework.β1,700Updated 3 weeks ago