A trivial programmatic Llama 3 jailbreak. Sorry Zuck!
☆567Jan 26, 2025Updated last year
Alternatives and similar repositories for llama3-jailbreak
Users that are interested in llama3-jailbreak are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Red-Teaming Language Models with DSPy☆258Feb 13, 2025Updated last year
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆98Apr 13, 2025Updated last year
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,404Dec 10, 2024Updated last year
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Aug 3, 2024Updated last year
- ☆28Oct 22, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16May 30, 2024Updated last year
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆61Dec 17, 2024Updated last year
- ☆865Jan 22, 2025Updated last year
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,174Mar 31, 2026Updated last month
- ☆15Apr 26, 2025Updated last year
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…☆3,073Apr 24, 2025Updated last year
- A fast + lightweight implementation of the GCG algorithm in PyTorch☆331May 13, 2025Updated last year
- Universal and Transferable Attacks on Aligned Language Models☆4,674Aug 2, 2024Updated last year
- Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.☆117Jun 13, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Python package for measuring memorization in LLMs.☆190Jul 16, 2025Updated 10 months ago
- [CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak…☆3,680Dec 24, 2024Updated last year
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆183Apr 23, 2025Updated last year
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated last year
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆86Dec 17, 2024Updated last year
- Large Action Model framework to develop AI Web Agents☆6,351Jan 21, 2025Updated last year
- We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20…☆350Feb 23, 2024Updated 2 years ago
- [ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…☆443Jan 22, 2025Updated last year
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,338Jul 1, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Fine-tune LLM agents with online reinforcement learning☆1,251Mar 19, 2024Updated 2 years ago
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersec…☆19,258Updated this week
- Official inference library for pre-processing of Mistral models☆890May 13, 2026Updated last week
- TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models☆19Aug 17, 2025Updated 9 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆986Jul 23, 2024Updated last year
- Verbosity control for AI agents☆66May 23, 2024Updated last year
- ☆445Apr 1, 2024Updated 2 years ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,364May 1, 2026Updated 2 weeks ago
- LLM Analytics☆713Oct 19, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- AI powered one-click comprehensive docs from transcripts and text.☆1,696Feb 11, 2025Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆90Jul 21, 2024Updated last year
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆6,187Apr 8, 2026Updated last month
- Prompt engineering, automated.☆353Apr 22, 2025Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆276Jan 10, 2026Updated 4 months ago
- WIP - Allows you to create DSPy pipelines using ComfyUI☆202Dec 1, 2024Updated last year
- ☆3,091Nov 21, 2025Updated 6 months ago