invariantlabs-ai / explorerLinks
A better way of testing, inspecting, and analyzing AI Agent traces.
☆40Updated last month
Alternatives and similar repositories for explorer
Users that are interested in explorer are comparing it to the libraries listed below
Sorting:
- Let Claude control a web browser on your machine.☆39Updated 6 months ago
- Sphynx Hallucination Induction☆53Updated 10 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆93Updated 2 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆88Updated last week
- Red-Teaming Language Models with DSPy☆240Updated 9 months ago
- Guardrails for secure and robust agent development☆368Updated 4 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆72Updated last month
- A Text-Based Environment for Interactive Debugging☆282Updated this week
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆100Updated 7 months ago
- Inference-time scaling for LLMs-as-a-judge.☆314Updated last month
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆81Updated 9 months ago
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UX☆42Updated 2 weeks ago
- ☆74Updated last year
- A system that tries to resolve all issues on a github repo with OpenHands.☆117Updated last year
- Code interpreter support for o1☆31Updated last year
- 🤖 Headless IDE for AI agents☆200Updated last month
- Anthropic Computer Use with Modal Sandboxes☆40Updated last year
- Simple examples using Argilla tools to build AI☆56Updated last year
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆132Updated last year
- LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores a…☆68Updated last month
- ☆84Updated last year
- auto fine tune of models with synthetic data☆76Updated last year
- Policy Synth is a TypeScript class library designed to streamline and enhance decision-making processes through multi-scale AI agent logi…☆54Updated last week
- Test Generation for Prompts☆143Updated this week
- Agent computer interface for AI software engineer.☆114Updated 2 months ago
- Verbosity control for AI agents☆64Updated last year
- Small, simple agent task environments for training and evaluation☆19Updated last year
- Get a markdown version of any webpage with a keyboard shortcut.☆67Updated 9 months ago
- Simple Graph Memory for AI applications☆89Updated 6 months ago
- Official Repo for CRMArena and CRMArena-Pro☆126Updated 3 weeks ago