haizelabs / llama3-jailbreakLinks
A trivial programmatic Llama 3 jailbreak. Sorry Zuck!
☆564Updated 9 months ago
Alternatives and similar repositories for llama3-jailbreak
Users that are interested in llama3-jailbreak are comparing it to the libraries listed below
Sorting:
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆518Updated last year
- A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information☆449Updated 3 months ago
- ☆446Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated last year
- Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!☆330Updated 2 weeks ago
- Red-Teaming Language Models with DSPy☆235Updated 8 months ago
- A benchmark for emotional intelligence in large language models☆370Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆246Updated 8 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆97Updated 6 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆451Updated last year
- Mistral7B playing DOOM☆138Updated last year
- ☆162Updated 2 months ago
- Parseltongue is a powerful prompt hacking tool/browser extension for real-time tokenization visualization and seamless text conversion, s…☆526Updated 9 months ago
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆169Updated last year
- Approximation of the Claude 3 tokenizer by inspecting generation stream☆143Updated last year
- Code release for Best-of-N Jailbreaking☆542Updated 8 months ago
- ☆416Updated last year
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆316Updated 2 years ago
- A library for making RepE control vectors☆655Updated last month
- function calling-based LLM agents☆289Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆714Updated 2 years ago
- Automatically evaluate your LLMs in Google Colab☆664Updated last year
- Visualize the intermediate output of Mistral 7B☆375Updated 9 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆239Updated last year
- ☆34Updated 4 months ago
- ☆415Updated 2 years ago
- ☆319Updated 3 months ago
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆369Updated last year
- Fast parallel LLM inference for MLX☆224Updated last year
- Implementation of Google's SELF-DISCOVER☆298Updated last year