AndonLabs / multiagent-inspectLinks
☆11Updated 4 months ago
Alternatives and similar repositories for multiagent-inspect
Users that are interested in multiagent-inspect are comparing it to the libraries listed below
Sorting:
- ☆49Updated this week
- Scale your LLM-as-a-judge.☆232Updated last week
- Prompt engineering, automated.☆325Updated last month
- ☆111Updated 5 months ago
- A framework for optimizing DSPy programs with RL☆58Updated this week
- Sphynx Hallucination Induction☆54Updated 4 months ago
- doteval☆20Updated last month
- ⚖️ Awesome LLM Judges ⚖️☆103Updated last month
- Multi-language code navigation API in a container☆77Updated 2 weeks ago
- Enriched Python function call graphs for agents and coding assistants☆96Updated last week
- Letting Claude Code develop his own MCP tools :)☆105Updated 2 months ago
- Prompt design in Python☆59Updated 6 months ago
- Open source interpretability artefacts for R1.☆140Updated last month
- 🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨☆94Updated 9 months ago
- CLI Tool for converting pydantic models into typescript definitions☆35Updated 7 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆74Updated last week
- Cloudstate is a JavaScript database runtime.☆178Updated last month
- The toolkit for codebase mapping, symbol extraction, and many kinds of code search. Build AI-powered devtools☆443Updated this week
- ACP is the Agent Control Plane - a distributed agent scheduler optimized for simplicity, clarity, and control. It is designed for outer-l…☆81Updated 2 weeks ago
- Using LLMs to transpile from Coq to Lean (public version, may be out of date)☆19Updated 2 months ago
- LLM Evals for Text Summarization and RAG use-cases.☆35Updated last year
- Deep Research for your internal data☆320Updated this week
- Open Source Auth Built on Freestyle: own your auth + data https://docs.freestyle.dev/guides/authentication/☆23Updated 11 months ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆151Updated this week
- vscode extension to convert computationally intensive pytorch kernels to triton☆22Updated 7 months ago
- ☆93Updated 7 months ago
- Agent Reinforcement Trainer for training multi-turn agents using GRPO☆628Updated this week
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆91Updated last month
- llm-consortium orchestrates mulitple LLMs, iteratively refines & achieves consensus.☆248Updated 2 weeks ago
- Python SDK for running evaluations on LLM generated responses☆280Updated 2 weeks ago