haizelabs / get-haizedLinks

A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.

☆95

Alternatives and similar repositories for get-haized

Users that are interested in get-haized are comparing it to the libraries listed below

Sorting:

haizelabs / dspy-redteam
Red-Teaming Language Models with DSPy
☆203Updated 5 months ago
haizelabs / sphynx
Sphynx Hallucination Induction
☆53Updated 6 months ago
redteaming-arena / redteam-arena
☆34Updated last month
haizelabs / bijection-learning
☆24Updated 9 months ago
haizelabs / Awesome-LLM-Judges
⚖️ Awesome LLM Judges ⚖️
☆108Updated 3 months ago
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
haizelabs / verdict
Inference-time scaling for LLMs-as-a-judge.
☆267Updated 3 weeks ago
vgel / logitloom
explore token trajectory trees on instruct and base models
☆134Updated 2 months ago
ai8hyf / OpenResearchAssistant
An automated tool for discovering insights from research papaer corpora
☆138Updated last year
interstellarninja / MeeseeksAI
A framework for orchestrating AI agents using a mermaid graph
☆77Updated last year
javirandor / anthropic-tokenizer
Approximation of the Claude 3 tokenizer by inspecting generation stream
☆131Updated last year
BBischof / yapping
Verbosity control for AI agents
☆64Updated last year
invariantlabs-ai / explorer
A better way of testing, inspecting, and analyzing AI Agent traces.
☆39Updated 3 weeks ago
Ziems / arbor
A framework for optimizing DSPy programs with RL
☆96Updated this week
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 5 months ago
haizelabs / j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆95Updated 2 weeks ago
plastic-labs / yousim
they've simulated websites, worlds, and imaginary CLIs... but what if they simulated *you*?
☆124Updated 3 months ago
Columbia-NLP-Lab / PAPILLON
Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles
☆53Updated 2 months ago
haizelabs / thorn-in-haizestack
Thorn in a HaizeStack test for evaluating long-context adversarial robustness.
☆26Updated last year
stunningpixels / lou-eval
Track the progress of LLM context utilisation
☆55Updated 3 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
teknium1 / transformers-gptq-quant
☆47Updated last year
METR / vivaria
Vivaria is METR's tool for running evaluations and conducting agent elicitation research.
☆103Updated last week
zbambergerNLP / strategic-debate-tot
A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments
☆87Updated 10 months ago
SpellcraftAI / oaib
Use the OpenAI Batch tool to make async batch requests to the OpenAI API.
☆99Updated last year
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆62Updated 9 months ago
hwchase17 / adversarial-prompts
Curation of prompts that are known to be adversarial to large language models
☆184Updated 2 years ago
AK391 / dailypapersHN
☆86Updated 10 months ago
Nearcyan / papers.day
papers.day
☆91Updated last year
hrishioa / ipgu
☆28Updated 3 months ago