haizelabs / get-haizedLinks
A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.
☆100Updated 7 months ago
Alternatives and similar repositories for get-haized
Users that are interested in get-haized are comparing it to the libraries listed below
Sorting:
- Sphynx Hallucination Induction☆53Updated 10 months ago
- Red-Teaming Language Models with DSPy☆244Updated 9 months ago
- ☆35Updated 6 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- ☆26Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 2 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆40Updated last month
- ⚖️ Awesome LLM Judges ⚖️☆146Updated 7 months ago
- explore token trajectory trees on instruct and base models☆149Updated 6 months ago
- A framework for orchestrating AI agents using a mermaid graph☆77Updated last year
- Track the progress of LLM context utilisation☆55Updated 7 months ago
- An automated tool for discovering insights from research papaer corpora☆137Updated last year
- Inference-time scaling for LLMs-as-a-judge.☆316Updated last month
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆93Updated 2 months ago
- look how they massacred my boy☆63Updated last year
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.☆101Updated last year
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 7 months ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated last year
- ☆117Updated 11 months ago
- ☆47Updated last year
- ☆45Updated 2 years ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆57Updated 8 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆146Updated 9 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Updated last year
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆99Updated 4 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆120Updated 3 weeks ago
- ☆136Updated 8 months ago
- Verbosity control for AI agents☆64Updated last year
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆31Updated 8 months ago