safety-research / bloomLinks
bloom - evaluate any behavior immediately 🌸🌱
☆28Updated last week
Alternatives and similar repositories for bloom
Users that are interested in bloom are comparing it to the libraries listed below
Sorting:
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆722Updated this week
- Evolve your language agent with Agentic Context Engineering (ACE)☆423Updated last month
- MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents☆525Updated last week
- Prompts used in the Automated Auditing Blog Post☆127Updated 4 months ago
- ☆616Updated last week
- Vibe-coding tools for the LlamaIndex ecosystem☆173Updated last month
- ☆221Updated this week
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆270Updated 2 months ago
- Verbalized Sampling, a training-free prompting strategy to mitigate mode collapse in LLMs by requesting responses with probabilities. Ach…☆606Updated 3 weeks ago
- Context Engineering Course with DSPy☆204Updated 4 months ago
- Agentic Web: Weaving the Next Web with AI Agents.☆397Updated 2 months ago
- A clean, modular SDK for building AI agents with OpenHands V1.☆337Updated this week
- This repository allows reproduction of Poetiq's record-breaking submission to the ARC-AGI-1 and ARC-AGI-2 benchmarks.☆867Updated this week
- 🚀 MassGen is an open-source multi-agent scaling system that runs in your terminal, autonomously orchestrating frontier models and agents…☆653Updated this week
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆504Updated last week
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆370Updated 3 months ago
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆327Updated 3 months ago
- Build common agent use-cases with deepagents library☆394Updated 2 weeks ago
- MCP (Model Context Protocol) server for Weaviate☆160Updated 7 months ago
- ☆539Updated 6 months ago
- Together Open Deep Research☆356Updated 8 months ago
- Ranking LLMs on agentic tasks☆204Updated last month
- A framework for optimizing DSPy programs with RL☆298Updated last month
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆241Updated last week
- ☆239Updated last month
- Inference-time scaling for LLMs-as-a-judge.☆316Updated last month
- The State Of The Art, intelligence☆157Updated 4 months ago
- Gemini-cli or claude code? Why not both? LangCode combines all CLI capabilities and models in one place ☂️!☆427Updated last month
- Aware - Deep Code Research Agent for Complex Codebase & Knowledge that “Act As Your Agentic Principal Engineer”☆387Updated last month
- ☆235Updated 3 weeks ago