invariantlabs-ai / explorerLinks
A better way of testing, inspecting, and analyzing AI Agent traces.
☆40Updated this week
Alternatives and similar repositories for explorer
Users that are interested in explorer are comparing it to the libraries listed below
Sorting:
- Let Claude control a web browser on your machine.☆36Updated 4 months ago
- Sphynx Hallucination Induction☆53Updated 8 months ago
- Guardrails for secure and robust agent development☆348Updated 2 months ago
- ☆73Updated 11 months ago
- Red-Teaming Language Models with DSPy☆214Updated 7 months ago
- A Text-Based Environment for Interactive Debugging☆268Updated this week
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆56Updated 4 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆96Updated 5 months ago
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UX☆39Updated last week
- ☆52Updated 5 months ago
- Inference-time scaling for LLMs-as-a-judge.☆299Updated last month
- Routing on Random Forest (RoRF)☆211Updated last year
- Letting Claude Code develop his own MCP tools :)☆121Updated 6 months ago
- A framework for optimizing DSPy programs with RL☆185Updated last week
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆89Updated this week
- ☆41Updated last month
- Code interpreter support for o1☆32Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆86Updated 3 weeks ago
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆41Updated 2 months ago
- Verbosity control for AI agents☆65Updated last year
- 🤖 Headless IDE for AI agents☆202Updated 5 months ago
- Test Generation for Prompts☆141Updated last week
- Tools for LLM agents.☆62Updated 9 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆56Updated 6 months ago
- auto fine tune of models with synthetic data☆76Updated last year
- ☆47Updated last year
- Enriched Python function call graphs for agents and coding assistants☆119Updated 3 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆70Updated last year
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆24Updated 2 weeks ago
- This codebase demonstrates various DSPy functionalities through practical examples.☆48Updated 7 months ago