invariantlabs-ai / explorerLinks
A better way of testing, inspecting, and analyzing AI Agent traces.
☆40Updated 3 weeks ago
Alternatives and similar repositories for explorer
Users that are interested in explorer are comparing it to the libraries listed below
Sorting:
- Let Claude control a web browser on your machine.☆39Updated 5 months ago
- Sphynx Hallucination Induction☆53Updated 9 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆88Updated this week
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆92Updated last month
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆98Updated 7 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆59Updated 6 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆71Updated last week
- Red-Teaming Language Models with DSPy☆235Updated 9 months ago
- A Text-Based Environment for Interactive Debugging☆276Updated this week
- ☆74Updated last year
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆42Updated 3 months ago
- Inference-time scaling for LLMs-as-a-judge.☆308Updated last week
- Anthropic Computer Use with Modal Sandboxes☆41Updated last year
- Simple examples using Argilla tools to build AI☆56Updated 11 months ago
- Verbosity control for AI agents☆64Updated last year
- ☆84Updated last year
- ☆35Updated 3 months ago
- The theory of mind module for the SWE agent☆37Updated 3 weeks ago
- Specification for creating reliable LLM-based conversational agents☆63Updated 3 weeks ago
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UX☆40Updated last week
- Guardrails for secure and robust agent development☆364Updated 3 months ago
- Small, simple agent task environments for training and evaluation☆19Updated last year
- Tools for LLM agents.☆60Updated 10 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆57Updated 8 months ago
- Code interpreter support for o1☆32Updated last year
- LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores a…☆66Updated 2 weeks ago
- Letting Claude Code develop his own MCP tools :)☆123Updated 8 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆79Updated 10 months ago
- ☆47Updated last year
- Agent computer interface for AI software engineer.☆112Updated last month