invariantlabs-ai / explorerLinks
A better way of testing, inspecting, and analyzing AI Agent traces.
☆40Updated this week
Alternatives and similar repositories for explorer
Users that are interested in explorer are comparing it to the libraries listed below
Sorting:
- Let Claude control a web browser on your machine.☆39Updated 4 months ago
- Sphynx Hallucination Induction☆53Updated 8 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆88Updated last month
- Red-Teaming Language Models with DSPy☆221Updated 8 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆97Updated 6 months ago
- A framework for optimizing DSPy programs with RL☆208Updated this week
- Guardrails for secure and robust agent development☆354Updated 2 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆58Updated 5 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆90Updated 3 weeks ago
- Inference-time scaling for LLMs-as-a-judge.☆303Updated 3 weeks ago
- ☆73Updated last year
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UX☆40Updated this week
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆57Updated 7 months ago
- Verbosity control for AI agents☆65Updated last year
- ☆52Updated 6 months ago
- A Text-Based Environment for Interactive Debugging☆272Updated this week
- ☆68Updated 5 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆119Updated 2 weeks ago
- ToolFuzz is a fuzzing framework designed to test your LLM Agent tools.☆30Updated 3 months ago
- Simple examples using Argilla tools to build AI☆56Updated 11 months ago
- auto fine tune of models with synthetic data☆75Updated last year
- a simple example demonstrating MCP + ag2 (autogen) integration☆41Updated 3 months ago
- ☆47Updated last year
- Code interpreter support for o1☆32Updated last year
- 🤖 Headless IDE for AI agents☆200Updated 2 weeks ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆204Updated last week
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 6 months ago
- A system that tries to resolve all issues on a github repo with OpenHands.☆114Updated 11 months ago
- Agent computer interface for AI software engineer.☆111Updated last month
- ☆47Updated 2 months ago