invariantlabs-ai / explorerLinks
A better way of testing, inspecting, and analyzing AI Agent traces.
☆40Updated 2 months ago
Alternatives and similar repositories for explorer
Users that are interested in explorer are comparing it to the libraries listed below
Sorting:
- Let Claude control a web browser on your machine.☆36Updated 3 months ago
- Sphynx Hallucination Induction☆53Updated 7 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆89Updated 11 months ago
- Guardrails for secure and robust agent development☆344Updated last month
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆69Updated last year
- Code interpreter support for o1☆32Updated last year
- A framework for optimizing DSPy programs with RL☆172Updated this week
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆96Updated 5 months ago
- Red-Teaming Language Models with DSPy☆212Updated 7 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆85Updated this week
- A system that tries to resolve all issues on a github repo with OpenHands.☆113Updated 9 months ago
- Routing on Random Forest (RoRF)☆203Updated 11 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆54Updated 4 months ago
- ☆73Updated 10 months ago
- A Text-Based Environment for Interactive Debugging☆262Updated this week
- ☆31Updated last month
- Verbosity control for AI agents☆65Updated last year
- Specification for creating reliable LLM-based conversational agents☆54Updated last month
- Anthropic Computer Use with Modal Sandboxes☆37Updated 10 months ago
- ☆81Updated 10 months ago
- LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores a…☆65Updated last week
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆129Updated 11 months ago
- Letting Claude Code develop his own MCP tools :)☆120Updated 6 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code generation of LLMs.☆179Updated 4 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆78Updated 7 months ago
- Inference-time scaling for LLMs-as-a-judge.☆293Updated 2 weeks ago
- ☆99Updated last year
- ☆104Updated 3 months ago
- ☆52Updated 5 months ago