haizelabs / BEAST-implementation
☆16Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for BEAST-implementation
- ☆62Updated last month
- General research for Dreadnode☆17Updated 5 months ago
- Tree of Attacks (TAP) Jailbreaking Implementation☆95Updated 9 months ago
- ☆36Updated this week
- A utility to inspect, validate, sign and verify machine learning model files.☆42Updated 2 weeks ago
- ☆15Updated last week
- ☆26Updated this week
- Implementation of BEAST adversarial attack for language models (ICML 2024)☆73Updated 6 months ago
- A steganography tool for automatically encoding images that act as prompt injections/jailbreaks for AIs with code interpreter and vision.☆39Updated last month
- Stage 1: Sensitive Email/Chat Classification for Adversary Agent Emulation (espionage). This project is meant to extend Red Reaper v1 whi…☆23Updated 2 months ago
- A collection of prompt injection mitigation techniques.☆18Updated last year
- using ML models for red teaming☆39Updated last year
- Repo with random useful scripts, utilities, prompts and stuff☆19Updated last month
- A library for red-teaming LLM applications with LLMs.☆22Updated last month
- Payloads for Attacking Large Language Models☆64Updated 4 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆86Updated 5 months ago
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆14Updated last month
- Data Scientists Go To Jupyter☆57Updated last week
- 🤖🛡️🔍🔒🔑 Tiny package designed to support red teams and penetration testers in exploiting large language model AI solutions.☆16Updated 6 months ago
- A YAML based format for describing tools to LLMs, like man pages but for robots!☆26Updated last week
- ☆63Updated this week
- future-proof vulnerability detection benchmark, based on CVEs in open-source repos☆44Updated last week
- De-redacting Elon's Email with Character-count Constrained Llama2 Decoding☆10Updated 8 months ago
- Improve prompts for e.g. GPT3 and GPT-J using templates and hyperparameter optimization.☆41Updated last year
- ☆72Updated last year
- A trace analysis tool for AI agents.☆124Updated last month
- ☆15Updated 7 months ago
- Minimal workflows☆14Updated 8 months ago
- A Completely Modular LLM Reverse Engineering, Red Teaming, and Vulnerability Research Framework.☆26Updated last week