invariantlabs-ai / invariantLinks

Guardrails for secure and robust agent development

☆355

Alternatives and similar repositories for invariant

Users that are interested in invariant are comparing it to the libraries listed below

Sorting:

haizelabs / dspy-redteam
Red-Teaming Language Models with DSPy
☆235Updated 8 months ago
andyzorigin / cybench
☆165Updated 4 months ago
invariantlabs-ai / explorer
A better way of testing, inspecting, and analyzing AI Agent traces.
☆40Updated last week
invariantlabs-ai / invariant-gateway
LLM proxy to observe and debug what your AI agents are doing.
☆51Updated 3 months ago
ZenGuard-AI / fast-llm-security-guardrails
The fastest Trust Layer for AI Agents
☆144Updated 5 months ago
safety-research / petri
An alignment auditing agent capable of quickly exploring alignment hypothesis
☆609Updated last week
google-research / camel-prompt-injection
Code for the paper "Defeating Prompt Injections by Design"
☆138Updated 4 months ago
haizelabs / verdict
Inference-time scaling for LLMs-as-a-judge.
☆304Updated 3 weeks ago
splx-ai / agentic-radar
A security scanner for your LLM agentic workflows
☆772Updated this week
lve-org / lve
A repository of Language Model Vulnerabilities and Exposures (LVEs).
☆112Updated last year
confident-ai / deepteam
DeepTeam is a framework to red team LLMs and LLM systems.
☆799Updated last week
UKGovernmentBEIS / inspect_evals
Collection of evals for Inspect AI
☆264Updated this week
ozyyshr / RepoGraph
Enhancing AI Software Engineering with Repository-level Code Graph
☆217Updated 6 months ago
SWE-agent / SWE-ReX
Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.
☆349Updated this week
princeton-pli / hal-harness
☆172Updated this week
lasso-security / mcp-gateway
A plugin-based gateway that orchestrates other MCPs and allows developers to build upon it enterprise-grade agents.
☆299Updated 3 months ago
microsoft / debug-gym
A Text-Based Environment for Interactive Debugging
☆272Updated last week
athina-ai / athina-evals
Python SDK for running evaluations on LLM generated responses
☆292Updated 4 months ago
Puliczek / awesome-mcp-security
🔥🔒 Awesome MCP (Model Context Protocol) Security 🖥️
☆574Updated 2 weeks ago
invariantlabs-ai / mcp-scan
Constrain, log and scan your MCP connections for security vulnerabilities.
☆1,166Updated this week
METR / vivaria
Vivaria is METR's tool for running evaluations and conducting agent elicitation research.
☆116Updated this week
haizelabs / Awesome-LLM-Judges
⚖️ Awesome LLM Judges ⚖️
☆132Updated 6 months ago
ServiceNow / TapeAgents
TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle
☆298Updated last week
haizelabs / get-haized
A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.
☆97Updated 6 months ago
agencyenterprise / PromptInject
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to a…
☆427Updated last year
haizelabs / sphynx
Sphynx Hallucination Induction
☆53Updated 8 months ago
invariantlabs-ai / playwright-computer-use
Let Claude control a web browser on your machine.
☆39Updated 4 months ago
amazon-science / CodeSage
CodeSage: Code Representation Learning At Scale (ICLR 2024)
☆113Updated last year
SWE-bench / SWE-smith
[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents
☆432Updated last week
symflower / eval-dev-quality
DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code generation of LLMs.
☆182Updated 5 months ago