A trivial programmatic Llama 3 jailbreak. Sorry Zuck!
☆567Jan 26, 2025Updated last year
Alternatives and similar repositories for llama3-jailbreak
Users that are interested in llama3-jailbreak are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Red-Teaming Language Models with DSPy☆255Feb 13, 2025Updated last year
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆98Apr 13, 2025Updated last year
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,404Dec 10, 2024Updated last year
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Aug 3, 2024Updated last year
- ☆28Oct 22, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16May 30, 2024Updated last year
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆61Dec 17, 2024Updated last year
- ☆866Jan 22, 2025Updated last year
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,171Mar 31, 2026Updated last month
- ☆15Apr 26, 2025Updated last year
- ☆23Dec 28, 2023Updated 2 years ago
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…☆3,070Apr 24, 2025Updated last year
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆11Jun 6, 2024Updated last year
- A fast + lightweight implementation of the GCG algorithm in PyTorch☆330May 13, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Universal and Transferable Attacks on Aligned Language Models☆4,638Aug 2, 2024Updated last year
- Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.☆115Jun 13, 2024Updated last year
- Python package for measuring memorization in LLMs.☆188Jul 16, 2025Updated 9 months ago
- [CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak…☆3,652Dec 24, 2024Updated last year
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆179Apr 23, 2025Updated last year
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated last year
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆86Dec 17, 2024Updated last year
- Large Action Model framework to develop AI Web Agents☆6,328Jan 21, 2025Updated last year
- We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20…☆348Feb 23, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…☆439Jan 22, 2025Updated last year
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,337Jul 1, 2024Updated last year
- Fine-tune LLM agents with online reinforcement learning☆1,251Mar 19, 2024Updated 2 years ago
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersec…☆19,101Updated this week
- Official inference library for pre-processing of Mistral models☆885Apr 1, 2026Updated last month
- TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models☆19Aug 17, 2025Updated 8 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆988Jul 23, 2024Updated last year
- Verbosity control for AI agents☆66May 23, 2024Updated last year
- ☆446Apr 1, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- maze datasets for investigating OOD behavior of ML systems☆77Jan 19, 2026Updated 3 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,326Updated this week
- LLM Analytics☆711Oct 19, 2024Updated last year
- AI powered one-click comprehensive docs from transcripts and text.☆1,695Feb 11, 2025Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆90Jul 21, 2024Updated last year
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆6,085Apr 8, 2026Updated 3 weeks ago
- Prompt engineering, automated.☆354Apr 22, 2025Updated last year