AndonLabs / multiagent-inspectLinks

☆11

Alternatives and similar repositories for multiagent-inspect

Users that are interested in multiagent-inspect are comparing it to the libraries listed below

Sorting:

lmnr-ai / lmnr-python
☆49Updated this week
haizelabs / verdict
Scale your LLM-as-a-judge.
☆232Updated last week
zenbase-ai / core
Prompt engineering, automated.
☆325Updated last month
jerber / lang-jepa
☆111Updated 5 months ago
Ziems / arbor
A framework for optimizing DSPy programs with RL
☆58Updated this week
haizelabs / sphynx
Sphynx Hallucination Induction
☆54Updated 4 months ago
The-LLM-Data-Company / doteval
doteval
☆20Updated last month
haizelabs / Awesome-LLM-Judges
⚖️ Awesome LLM Judges ⚖️
☆103Updated last month
agentic-labs / lsproxy
Multi-language code navigation API in a container
☆77Updated 2 weeks ago
nuanced-dev / nuanced
Enriched Python function call graphs for agents and coding assistants
☆96Updated last week
willccbb / claude-code-mcp
Letting Claude Code develop his own MCP tools :)
☆105Updated 2 months ago
zenbase-ai / py-priompt
Prompt design in Python
☆59Updated 6 months ago
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆140Updated last month
simplifine-llm / Simplifine
🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨
☆94Updated 9 months ago
Darius-Labs / pydantic-to-typescript2
CLI Tool for converting pydantic models into typescript definitions
☆35Updated 7 months ago
haizelabs / j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆74Updated last week
freestyle-sh / cloudstate
Cloudstate is a JavaScript database runtime.
☆178Updated last month
cased / kit
The toolkit for codebase mapping, symbol extraction, and many kinds of code search. Build AI-powered devtools
☆443Updated this week
humanlayer / agentcontrolplane
ACP is the Agent Control Plane - a distributed agent scheduler optimized for simplicity, clarity, and control. It is designed for outer-l…
☆81Updated 2 weeks ago
JasonGross / autoformalization-transpilation
Using LLMs to transpile from Coq to Lean (public version, may be out of date)
☆19Updated 2 months ago
athina-ai / ariadne
LLM Evals for Text Summarization and RAG use-cases.
☆35Updated last year
defog-ai / introspect
Deep Research for your internal data
☆320Updated this week
freestyle-sh / freestyle-auth
Open Source Auth Built on Freestyle: own your auth + data https://docs.freestyle.dev/guides/authentication/
☆23Updated 11 months ago
567-labs / kura
Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…
☆151Updated this week
proxis-dev / vscode-triton
vscode extension to convert computationally intensive pytorch kernels to triton
☆22Updated 7 months ago
nuwandavek / karpathify
☆93Updated 7 months ago
OpenPipe / ART
Agent Reinforcement Trainer for training multi-turn agents using GRPO
☆628Updated this week
haizelabs / get-haized
A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.
☆91Updated last month
irthomasthomas / llm-consortium
llm-consortium orchestrates mulitple LLMs, iteratively refines & achieves consensus.
☆248Updated 2 weeks ago
athina-ai / athina-evals
Python SDK for running evaluations on LLM generated responses
☆280Updated 2 weeks ago