haizelabs / llama3-jailbreakLinks
A trivial programmatic Llama 3 jailbreak. Sorry Zuck!
☆567Updated last year
Alternatives and similar repositories for llama3-jailbreak
Users that are interested in llama3-jailbreak are comparing it to the libraries listed below
Sorting:
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆562Updated last year
- Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!☆349Updated 3 months ago
- ☆417Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆266Updated 2 weeks ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆100Updated 9 months ago
- A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information☆455Updated 5 months ago
- Code release for Best-of-N Jailbreaking☆552Updated 11 months ago
- ☆446Updated last year
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆457Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- Approximation of the Claude 3 tokenizer by inspecting generation stream☆150Updated last year
- ☆165Updated 5 months ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆315Updated 7 months ago
- A benchmark for emotional intelligence in large language models☆397Updated last year
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆172Updated 2 years ago
- Red-Teaming Language Models with DSPy☆250Updated 11 months ago
- function calling-based LLM agents☆289Updated last year
- Parseltongue is a powerful prompt hacking tool/browser extension for real-time tokenization visualization and seamless text conversion, s…☆541Updated last year
- Convert all of libgen to high quality markdown☆254Updated 2 years ago
- Implementation of Google's SELF-DISCOVER☆300Updated last year
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆515Updated last year
- Instructions on how to run LLMs on Raspberry PI☆210Updated last year
- Automatically evaluate your LLMs in Google Colab☆683Updated last year
- ☆434Updated last year
- ☆334Updated 5 months ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,041Updated 9 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆240Updated last year
- llama.cpp with BakLLaVA model describes what does it see☆380Updated 2 years ago
- This repository explains and provides examples for "concept anchoring" in GPT4.☆71Updated 2 years ago
- Stop messing around with finicky sampling parameters and just use DRµGS!☆360Updated last year