A trivial programmatic Llama 3 jailbreak. Sorry Zuck!
☆568Jan 26, 2025Updated last year
Alternatives and similar repositories for llama3-jailbreak
Users that are interested in llama3-jailbreak are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Red-Teaming Language Models with DSPy☆253Feb 13, 2025Updated last year
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆98Apr 13, 2025Updated 11 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,404Dec 10, 2024Updated last year
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Aug 3, 2024Updated last year
- ☆28Oct 22, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆16May 30, 2024Updated last year
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆61Dec 17, 2024Updated last year
- ☆867Jan 22, 2025Updated last year
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,166Mar 31, 2026Updated last week
- ☆15Apr 26, 2025Updated 11 months ago
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…☆3,064Apr 24, 2025Updated 11 months ago
- A fast + lightweight implementation of the GCG algorithm in PyTorch☆324May 13, 2025Updated 10 months ago
- Universal and Transferable Attacks on Aligned Language Models☆4,601Aug 2, 2024Updated last year
- MAexp is a generic platform for RL-based multi-agent exploration☆108Aug 25, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Python package for measuring memorization in LLMs.☆185Jul 16, 2025Updated 8 months ago
- Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.☆115Jun 13, 2024Updated last year
- [CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak…☆3,629Dec 24, 2024Updated last year
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆178Apr 23, 2025Updated 11 months ago
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated 11 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆84Dec 17, 2024Updated last year
- Large Action Model framework to develop AI Web Agents☆6,311Jan 21, 2025Updated last year
- We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20…☆345Feb 23, 2024Updated 2 years ago
- [ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…☆438Jan 22, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,330Jul 1, 2024Updated last year
- Fine-tune LLM agents with online reinforcement learning☆1,250Mar 19, 2024Updated 2 years ago
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersec…☆18,957Updated this week
- Official inference library for pre-processing of Mistral models☆875Apr 1, 2026Updated last week
- TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models☆19Aug 17, 2025Updated 7 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆988Jul 23, 2024Updated last year
- Verbosity control for AI agents☆66May 23, 2024Updated last year
- ☆446Apr 1, 2024Updated 2 years ago
- maze datasets for investigating OOD behavior of ML systems☆75Jan 19, 2026Updated 2 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,280Updated this week
- LLM Analytics☆709Oct 19, 2024Updated last year
- AI powered one-click comprehensive docs from transcripts and text.☆1,696Feb 11, 2025Updated last year
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆6,003Updated this week
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆92Jul 21, 2024Updated last year
- Prompt engineering, automated.☆352Apr 22, 2025Updated 11 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆275Jan 10, 2026Updated 3 months ago