jplhughes / bon-jailbreakingLinks
Code release for Best-of-N Jailbreaking
☆524Updated 4 months ago
Alternatives and similar repositories for bon-jailbreaking
Users that are interested in bon-jailbreaking are comparing it to the libraries listed below
Sorting:
- The LLM Red Teaming Framework☆477Updated this week
- Parseltongue is a powerful prompt hacking tool/browser extension for real-time tokenization visualization and seamless text conversion, s…☆459Updated 5 months ago
- agent q - oss advanced reasoning and learning for autonomous ai agents☆473Updated 9 months ago
- A security scanner for your LLM agentic workflows☆605Updated last week
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆343Updated 6 months ago
- Force DeepSeek r1 models to think for as long as you wish☆368Updated 4 months ago
- Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]☆319Updated 5 months ago
- A steganography tool for automatically encoding images that act as prompt injections/jailbreaks for AIs with code interpreter and vision.☆99Updated 8 months ago
- This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)☆320Updated 9 months ago
- Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to pote…☆173Updated 2 months ago
- Unlock 650+ MCP servers tools in your favorite agentic framework.☆361Updated last week
- ☆565Updated 6 months ago
- HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal☆667Updated 10 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆490Updated 4 months ago
- E2B Desktop Sandbox for LLMs. E2B Sandbox with desktop graphical environment that you can connect to any LLM for secure computer use.☆986Updated last week
- Atom of Thoughts for Markov LLM Test-Time Scaling☆577Updated last week
- Together Open Deep Research☆314Updated 2 months ago
- Surf is a computer use AI agent powered by OpenAI that interacts with a E2B's virtual desktop environment through natural language instru…☆453Updated last month
- A powerful tool for automated LLM fuzzing. It is designed to help developers and security researchers identify and mitigate potential jai…☆622Updated 3 weeks ago
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆480Updated last year
- Deeper Seeker is an simpler OSS version of OpenAI's latest Deep Research feature in ChatGPT.It is an agentic research tool to reason , cr…☆408Updated last month
- An agent benchmark with tasks in a simulated software company.☆407Updated this week
- the simplest self-building general autonomous agent☆311Updated 8 months ago
- ☆211Updated last week
- Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!☆307Updated 8 months ago
- A very fast, very minimal prompt optimizer☆267Updated 5 months ago
- A list of curated resources for people interested in AI Red Teaming, Jailbreaking, and Prompt Injection☆208Updated last month
- ☆314Updated 6 months ago
- ☆183Updated 7 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆273Updated this week