haizelabs / BEAST-implementation
☆16Updated 7 months ago
Alternatives and similar repositories for BEAST-implementation:
Users that are interested in BEAST-implementation are comparing it to the libraries listed below
- A utility to inspect, validate, sign and verify machine learning model files.☆52Updated 2 months ago
- General research for Dreadnode☆19Updated 7 months ago
- Tree of Attacks (TAP) Jailbreaking Implementation☆98Updated 11 months ago
- ☆63Updated 3 months ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 5 months ago
- A YAML based format for describing tools to LLMs, like man pages but for robots!☆52Updated last month
- ☆28Updated 2 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆88Updated 7 months ago
- CLI and API server for https://github.com/dreadnode/robopages☆24Updated last month
- A library for red-teaming LLM applications with LLMs.☆24Updated 3 months ago
- Manual Prompt Injection / Red Teaming Tool☆14Updated 3 months ago
- Improve prompts for e.g. GPT3 and GPT-J using templates and hyperparameter optimization.☆41Updated 2 years ago
- ☆26Updated 2 months ago
- ☆17Updated last year
- ☆45Updated last month
- PAL: Proxy-Guided Black-Box Attack on Large Language Models☆48Updated 5 months ago
- https://arxiv.org/abs/2412.02776☆41Updated last month
- future-proof vulnerability detection benchmark, based on CVEs in open-source repos☆46Updated this week
- A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)☆76Updated last month
- Red-Teaming Language Models with DSPy☆154Updated 9 months ago
- Code for the paper "Fishing for Magikarp"☆140Updated this week
- CompChomper is a framework for measuring how LLMs perform at code completion.☆14Updated 2 months ago
- An interactive CLI application for interacting with authenticated Jupyter instances.☆50Updated 10 months ago
- ☆14Updated 6 months ago
- ☆48Updated 3 months ago
- direct preference optimization with only 1 model copy :)☆12Updated last year
- Data Scientists Go To Jupyter☆62Updated last month
- The jailbreak-evaluation is an easy-to-use Python package for language model jailbreak evaluation.☆19Updated 2 months ago
- Sphynx Hallucination Induction☆51Updated 5 months ago
- A collection of prompt injection mitigation techniques.☆20Updated last year