haizelabs / llama3-jailbreakLinks
A trivial programmatic Llama 3 jailbreak. Sorry Zuck!
☆553Updated 4 months ago
Alternatives and similar repositories for llama3-jailbreak
Users that are interested in llama3-jailbreak are comparing it to the libraries listed below
Sorting:
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆474Updated 11 months ago
- Parseltongue is a powerful prompt hacking tool/browser extension for real-time tokenization visualization and seamless text conversion, s…☆454Updated 4 months ago
- A benchmark for emotional intelligence in large language models☆302Updated 10 months ago
- Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!☆302Updated 7 months ago
- Automatically evaluate your LLMs in Google Colab☆631Updated last year
- A library for making RepE control vectors☆595Updated 4 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆239Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- ☆447Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆239Updated 3 months ago
- ☆157Updated 10 months ago
- Fast parallel LLM inference for MLX☆189Updated 11 months ago
- A multimodal, function calling powered LLM webui.☆214Updated 8 months ago
- Guide for fine-tuning Llama/Mistral/CodeLlama models and more☆596Updated last month
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆371Updated last year
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆167Updated last year
- Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]☆313Updated 4 months ago
- ☆291Updated 2 months ago
- Code release for Best-of-N Jailbreaking☆512Updated 4 months ago
- Red-Teaming Language Models with DSPy☆195Updated 3 months ago
- ☆560Updated 6 months ago
- ☆437Updated 8 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆91Updated last month
- Low-Rank adapter extraction for fine-tuned transformers models☆171Updated last year
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆446Updated 8 months ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,014Updated last month
- llama.cpp with BakLLaVA model describes what does it see☆383Updated last year
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆106Updated last year
- a curated list of data for reasoning ai☆136Updated 10 months ago
- function calling-based LLM agents☆285Updated 8 months ago