haizelabs / get-haized
A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.
☆89Updated 8 months ago
Alternatives and similar repositories for get-haized:
Users that are interested in get-haized are comparing it to the libraries listed below
- Red-Teaming Language Models with DSPy☆169Updated 2 weeks ago
- Sphynx Hallucination Induction☆52Updated last month
- ☆20Updated 4 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆31Updated this week
- ☆50Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated 7 months ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 7 months ago
- Track the progress of LLM context utilisation☆53Updated 7 months ago
- look how they massacred my boy☆63Updated 4 months ago
- A framework-less approach to robust agent development.☆154Updated this week
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆50Updated 3 months ago
- ☆60Updated last month
- Synthetic Data for LLM Fine-Tuning☆111Updated last year
- ☆111Updated 2 months ago
- ☆48Updated last year
- A better way of testing, inspecting, and analyzing AI Agent traces.☆28Updated last week
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆79Updated this week
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆63Updated 4 months ago
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.☆95Updated last year
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆86Updated last month
- Prompt design in Python☆54Updated 3 months ago
- Functional Benchmarks and the Reasoning Gap☆84Updated 5 months ago
- Get a markdown version of any webpage with a keyboard shortcut.☆58Updated 2 weeks ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆421Updated 5 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆101Updated 2 months ago
- A framework for orchestrating AI agents using a mermaid graph☆74Updated 9 months ago