haizelabs / llama3-jailbreakLinks
A trivial programmatic Llama 3 jailbreak. Sorry Zuck!
☆562Updated 7 months ago
Alternatives and similar repositories for llama3-jailbreak
Users that are interested in llama3-jailbreak are comparing it to the libraries listed below
Sorting:
- A benchmark for emotional intelligence in large language models☆348Updated last year
- Red-Teaming Language Models with DSPy☆212Updated 6 months ago
- ☆414Updated last year
- ☆446Updated last year
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆450Updated 11 months ago
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆504Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- Code release for Best-of-N Jailbreaking☆533Updated 6 months ago
- Parseltongue is a powerful prompt hacking tool/browser extension for real-time tokenization visualization and seamless text conversion, s…☆513Updated 7 months ago
- ☆433Updated 10 months ago
- Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!☆317Updated 10 months ago
- A library for making RepE control vectors☆626Updated 7 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆243Updated 6 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆95Updated 4 months ago
- A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information☆439Updated last month
- ☆161Updated 3 weeks ago
- Implementation of Google's SELF-DISCOVER☆299Updated last year
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆312Updated 2 months ago
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆350Updated 7 months ago
- function calling-based LLM agents☆287Updated 11 months ago
- Official inference library for pre-processing of Mistral models☆784Updated this week
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,029Updated 4 months ago
- Fast parallel LLM inference for MLX☆209Updated last year
- ☆314Updated last month
- Automatically evaluate your LLMs in Google Colab☆655Updated last year
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆316Updated last year
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆491Updated last year
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆469Updated last year
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆290Updated 2 weeks ago
- Approximation of the Claude 3 tokenizer by inspecting generation stream☆131Updated last year