haizelabs / get-haized
A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.
☆91Updated last month
Alternatives and similar repositories for get-haized
Users that are interested in get-haized are comparing it to the libraries listed below
Sorting:
- Red-Teaming Language Models with DSPy☆192Updated 3 months ago
- Sphynx Hallucination Induction☆54Updated 3 months ago
- Verdict is a library for scaling judge-time compute.☆211Updated 2 weeks ago
- ☆22Updated 6 months ago
- ⚖️ Awesome LLM Judges ⚖️☆97Updated 2 weeks ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 9 months ago
- Track the progress of LLM context utilisation☆54Updated last month
- explore token trajectory trees on instruct and base models☆106Updated this week
- A framework for orchestrating AI agents using a mermaid graph☆75Updated last year
- ☆48Updated last year
- they've simulated websites, worlds, and imaginary CLIs... but what if they simulated *you*?☆120Updated 2 weeks ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 2 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆35Updated last week
- Verbosity control for AI agents☆63Updated 11 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆52Updated 2 months ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 9 months ago
- ☆47Updated last year
- ☆31Updated last month
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆92Updated last week
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated last year
- papers.day☆93Updated last year
- Interactive timeline of AI history☆51Updated last month
- A reading list of relevant papers and projects on foundation model annotation☆27Updated 2 months ago
- ☆148Updated 2 months ago
- ☆46Updated this week
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated last year
- ☆125Updated last month
- Letting Claude Code develop his own MCP tools :)☆100Updated 2 months ago
- Functional Benchmarks and the Reasoning Gap☆86Updated 7 months ago
- ☆54Updated 3 months ago