invariantlabs-ai / explorerLinks
A better way of testing, inspecting, and analyzing AI Agent traces.
☆46Updated 3 weeks ago
Alternatives and similar repositories for explorer
Users that are interested in explorer are comparing it to the libraries listed below
Sorting:
- Let Claude control a web browser on your machine.☆40Updated 7 months ago
- Guardrails for secure and robust agent development☆384Updated 3 weeks ago
- Sphynx Hallucination Induction☆52Updated last year
- ToolFuzz is a fuzzing framework designed to test your LLM Agent tools.☆37Updated 6 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆99Updated 4 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆100Updated 9 months ago
- A Text-Based Environment for Interactive Debugging☆293Updated this week
- ☆37Updated 6 months ago
- Red-Teaming Language Models with DSPy☆250Updated 11 months ago
- 🤖 Headless IDE for AI agents☆200Updated 3 months ago
- ☆76Updated last year
- A framework for optimizing DSPy programs with RL☆308Updated 3 weeks ago
- Enriched Python function call graphs for agents and coding assistants☆126Updated 7 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆90Updated last month
- Code interpreter support for o1☆31Updated last year
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UX☆44Updated last week
- Letting Claude Code develop his own MCP tools :)☆123Updated 10 months ago
- Anthropic Computer Use with Modal Sandboxes☆43Updated last year
- ☆85Updated 5 months ago
- ☆51Updated 5 months ago
- Test Generation for Prompts☆149Updated 2 weeks ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 8 months ago
- ☆47Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆73Updated 3 months ago
- Inference-time scaling for LLMs-as-a-judge.☆327Updated 3 months ago
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆42Updated 6 months ago
- Contains the prompts we use to talk to various LLMs for different utilities inside the editor☆84Updated 2 years ago
- ☆54Updated 9 months ago
- The theory of mind module for the SWE agent☆71Updated 3 weeks ago
- Prompts used in the Automated Auditing Blog Post☆137Updated 6 months ago