haizelabs / llama3-jailbreakLinks

A trivial programmatic Llama 3 jailbreak. Sorry Zuck!

☆565

Alternatives and similar repositories for llama3-jailbreak

Users that are interested in llama3-jailbreak are comparing it to the libraries listed below

Sorting:

FailSpy / abliterator
Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens
☆530Updated last year
migtissera / Sensei
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆221Updated last year
mistralai-sf24 / hackathon
☆446Updated last year
NousResearch / Open-Reasoning-Tasks
A comprehensive repository of reasoning tasks for LLMs (and beyond)
☆450Updated last year
teknium1 / Prompt-Engineering-Toolkit
☆416Updated last year
Mihaiii / llm_steer
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆248Updated 9 months ago
BASI-LABS / parseltongue
Parseltongue is a powerful prompt hacking tool/browser extension for real-time tokenization visualization and seamless text conversion, s…
☆529Updated 10 months ago
galatolofederico / microchain
function calling-based LLM agents
☆289Updated last year
EQ-bench / EQ-Bench
A benchmark for emotional intelligence in large language models
☆382Updated last year
haizelabs / dspy-redteam
Red-Teaming Language Models with DSPy
☆235Updated 9 months ago
cpldcpu / MisguidedAttention
A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information
☆452Updated 3 months ago
gavi / mlx-whatsapp
An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning
☆171Updated last year
haizelabs / get-haized
A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.
☆98Updated 7 months ago
vgel / repeng
A library for making RepE control vectors
☆662Updated last month
CHATS-lab / persuasive_jailbreaker
Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!
☆334Updated last month
sam-paech / antislop-sampler
☆326Updated 3 months ago
itsme2417 / PolyMind
A multimodal, function calling powered LLM webui.
☆217Updated last year
jplhughes / bon-jailbreaking
Code release for Best-of-N Jailbreaking
☆546Updated 9 months ago
QuixiAI / OpenChatML
☆163Updated 3 months ago
willccbb / mlx_parallm
Fast parallel LLM inference for MLX
☆232Updated last year
javirandor / anthropic-tokenizer
Approximation of the Claude 3 tokenizer by inspecting generation stream
☆147Updated last year
mlabonne / llm-autoeval
Automatically evaluate your LLMs in Google Colab
☆670Updated last year
aidanmclaughlin / AidanBench
Aidan Bench attempts to measure <big_model_smell> in LLMs.
☆315Updated 4 months ago
SkunkworksAI / hydra-moe
☆415Updated 2 years ago
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
Leeroo-AI / mergoo
A library for easily merging multiple LLM experts, and efficiently train the merged LLM.
☆495Updated last year
OneInterface / realtime-bakllava
llama.cpp with BakLLaVA model describes what does it see
☆381Updated 2 years ago
IST-DASLab / PanzaMail
☆297Updated 7 months ago
umuthopeyildirim / DOOM-Mistral
Mistral7B playing DOOM
☆138Updated last year
EGjoni / DRUGS
Stop messing around with finicky sampling parameters and just use DRµGS!
☆358Updated last year