invariantlabs-ai / invariant
A trace analysis tool for AI agents.
☆118Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for invariant
- Red-Teaming Language Models with DSPy☆142Updated 6 months ago
- Sphynx Hallucination Induction☆47Updated 3 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆85Updated 4 months ago
- A repository of Language Model Vulnerabilities and Exposures (LVEs).☆106Updated 7 months ago
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆84Updated 8 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆96Updated this week
- Enhancing AI Software Engineering with Repository-level Code Graph☆90Updated 2 months ago
- ☆253Updated last month
- Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [arXiv, Apr 2024]☆211Updated last month
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 3 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆109Updated 4 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆710Updated last week
- Improving Alignment and Robustness with Circuit Breakers☆152Updated last month
- Code for the paper 🌳 Tree Search for Language Model Agents☆138Updated 3 months ago
- ☆99Updated 3 months ago
- ☆57Updated last week
- 🤖🌊 aiFlows: The building blocks of your collaborative AI☆238Updated 6 months ago
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆62Updated this week
- AWM: Agent Workflow Memory☆203Updated last month
- CodeSage: Code Representation Learning At Scale (ICLR 2024)☆82Updated last week
- Just a bunch of benchmark logs for different LLMs☆113Updated 3 months ago
- EvoEval: Evolving Coding Benchmarks via LLM☆60Updated 7 months ago
- ☆38Updated 3 months ago
- Prompt engineering, automated.☆240Updated this week
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆62Updated last month
- [ICLR'24 Spotlight] A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use☆113Updated 7 months ago
- r2e: turn any github repository into a programming agent environment☆87Updated last week
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆201Updated 5 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆91Updated 4 months ago
- Small, simple agent task environments for training and evaluation☆16Updated last week