invariantlabs-ai / explorerLinks
A better way of testing, inspecting, and analyzing AI Agent traces.
☆39Updated 3 weeks ago
Alternatives and similar repositories for explorer
Users that are interested in explorer are comparing it to the libraries listed below
Sorting:
- Let Claude control a web browser on your machine.☆36Updated 2 months ago
- Sphynx Hallucination Induction☆53Updated 6 months ago
- Guardrails for secure and robust agent development☆327Updated last week
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆87Updated 10 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆95Updated 3 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆83Updated 4 months ago
- A framework for optimizing DSPy programs with RL☆94Updated this week
- Red-Teaming Language Models with DSPy☆203Updated 5 months ago
- Inference-time scaling for LLMs-as-a-judge.☆267Updated 3 weeks ago
- Agent computer interface for AI software engineer.☆92Updated this week
- 🤖 Headless IDE for AI agents☆196Updated 3 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆53Updated 2 months ago
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UX☆18Updated this week
- ☆52Updated 3 months ago
- Routing on Random Forest (RoRF)☆181Updated 10 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆67Updated last year
- Enriched Python function call graphs for agents and coding assistants☆101Updated last month
- A system that tries to resolve all issues on a github repo with OpenHands.☆110Updated 8 months ago
- Simple examples using Argilla tools to build AI☆53Updated 8 months ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆273Updated last week
- A Text-Based Environment for Interactive Debugging☆250Updated this week
- ☆78Updated 9 months ago
- ☆96Updated 10 months ago
- Anthropic Computer Use with Modal Sandboxes☆37Updated 9 months ago
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆38Updated this week
- Letting Claude Code develop his own MCP tools :)☆122Updated 4 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆78Updated 5 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆76Updated 7 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆91Updated last month
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆133Updated last month