A trivial programmatic Llama 3 jailbreak. Sorry Zuck!
☆568Jan 26, 2025Updated last year
Alternatives and similar repositories for llama3-jailbreak
Users that are interested in llama3-jailbreak are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Red-Teaming Language Models with DSPy☆259Feb 13, 2025Updated last year
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆98Apr 13, 2025Updated last year
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,402Dec 10, 2024Updated last year
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Aug 3, 2024Updated last year
- ☆29Oct 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆16May 30, 2024Updated 2 years ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆61Dec 17, 2024Updated last year
- ☆864Jan 22, 2025Updated last year
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,179Mar 31, 2026Updated 2 months ago
- ☆15Apr 26, 2025Updated last year
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…☆3,081Apr 24, 2025Updated last year
- A fast + lightweight implementation of the GCG algorithm in PyTorch☆333May 13, 2025Updated last year
- Universal and Transferable Attacks on Aligned Language Models☆4,690Aug 2, 2024Updated last year
- Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.☆117Jun 13, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python package for measuring memorization in LLMs.☆190Jul 16, 2025Updated 10 months ago
- [CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak…☆3,689Dec 24, 2024Updated last year
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆183Apr 23, 2025Updated last year
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated last year
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆87Dec 17, 2024Updated last year
- Large Action Model framework to develop AI Web Agents☆6,361Jan 21, 2025Updated last year
- We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20…☆351Feb 23, 2024Updated 2 years ago
- [ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…☆445Jan 22, 2025Updated last year
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,342Jul 1, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fine-tune LLM agents with online reinforcement learning☆1,248Mar 19, 2024Updated 2 years ago
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersec…☆19,436Updated this week
- Official inference library for pre-processing of Mistral models☆901Updated this week
- TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models☆19Aug 17, 2025Updated 9 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆986Jul 23, 2024Updated last year
- Verbosity control for AI agents☆66May 23, 2024Updated 2 years ago
- ☆444Apr 1, 2024Updated 2 years ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,414Updated this week
- LLM Analytics☆713Oct 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- AI powered one-click comprehensive docs from transcripts and text.☆1,694Feb 11, 2025Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆90Jul 21, 2024Updated last year
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆6,258Apr 8, 2026Updated 2 months ago
- Prompt engineering, automated.☆353Apr 22, 2025Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆279Jan 10, 2026Updated 5 months ago
- WIP - Allows you to create DSPy pipelines using ComfyUI☆202Dec 1, 2024Updated last year
- ☆3,094Nov 21, 2025Updated 6 months ago