haizelabs / llama3-jailbreakLinks
A trivial programmatic Llama 3 jailbreak. Sorry Zuck!
☆554Updated 5 months ago
Alternatives and similar repositories for llama3-jailbreak
Users that are interested in llama3-jailbreak are comparing it to the libraries listed below
Sorting:
- Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!☆307Updated 8 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆448Updated 9 months ago
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆480Updated last year
- A benchmark for emotional intelligence in large language models☆306Updated 11 months ago
- ☆157Updated 11 months ago
- Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]☆319Updated 5 months ago
- Red-Teaming Language Models with DSPy☆201Updated 4 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated last year
- ☆302Updated 2 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆91Updated 2 months ago
- A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information☆427Updated last week
- A multimodal, function calling powered LLM webui.☆214Updated 9 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆173Updated last year
- Arena-Hard-Auto: An automatic LLM benchmark.☆855Updated last week
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆305Updated this week
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆239Updated 4 months ago
- ☆435Updated 8 months ago
- ☆411Updated 10 months ago
- ☆565Updated 6 months ago
- ☆446Updated last year
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆483Updated 10 months ago
- Parseltongue is a powerful prompt hacking tool/browser extension for real-time tokenization visualization and seamless text conversion, s…☆459Updated 5 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆239Updated last year
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆464Updated last year
- LLM Analytics☆668Updated 8 months ago
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆347Updated 5 months ago
- From anywhere you can type, query and stream the output of an LLM or any other script☆496Updated last year
- Start a server from the MLX library.☆187Updated 11 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,407Updated 6 months ago
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆371Updated last year