invariantlabs-ai / invariantLinks
Guardrails for secure and robust agent development
☆285Updated 2 weeks ago
Alternatives and similar repositories for invariant
Users that are interested in invariant are comparing it to the libraries listed below
Sorting:
- A better way of testing, inspecting, and analyzing AI Agent traces.☆37Updated this week
- Scale your LLM-as-a-judge.☆232Updated last week
- Red-Teaming Language Models with DSPy☆193Updated 3 months ago
- ☆106Updated last week
- A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.☆169Updated 2 weeks ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆204Updated last week
- LLM proxy to observe and debug what your AI agents are doing.☆30Updated this week
- Collection of evals for Inspect AI☆139Updated this week
- Python SDK for running evaluations on LLM generated responses☆280Updated 2 weeks ago
- ☆72Updated 7 months ago
- Code snippets to reproduce MCP tool poisoning attacks.☆132Updated last month
- A repository of Language Model Vulnerabilities and Exposures (LVEs).☆110Updated last year
- Prompt engineering, automated.☆321Updated last month
- ☆416Updated this week
- ☆86Updated 3 weeks ago
- 🤖 Headless IDE for AI agents☆188Updated last month
- The LLM Red Teaming Framework☆228Updated this week
- Simple AI coder that can do most of my work for me, including working on himself.☆235Updated last month
- ☆44Updated 10 months ago
- A plugin-based gateway that orchestrates other MCPs and allows developers to build upon it enterprise-grade agents.☆174Updated last month
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆94Updated this week
- Constrain, log and scan your MCP connections for security vulnerabilities.☆720Updated this week
- AWM: Agent Workflow Memory☆271Updated 4 months ago
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆340Updated 5 months ago
- TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…☆55Updated 2 months ago
- EnrichMCP is a python framework for building data driven MCP servers☆199Updated this week
- A benchmark for prompt injection detection systems.☆115Updated 3 weeks ago
- ⚖️ Awesome LLM Judges ⚖️☆103Updated last month
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆136Updated this week
- A Text-Based Environment for Interactive Debugging☆217Updated this week