dreadnode / researchLinks
General research for Dreadnode
☆23Updated 11 months ago
Alternatives and similar repositories for research
Users that are interested in research are comparing it to the libraries listed below
Sorting:
- Tree of Attacks (TAP) Jailbreaking Implementation☆109Updated last year
- Implementation of BEAST adversarial attack for language models (ICML 2024)☆87Updated last year
- Adversarial Tokenization☆22Updated last month
- [IJCAI 2024] Imperio is an LLM-powered backdoor attack. It allows the adversary to issue language-guided instructions to control the vict…☆42Updated 3 months ago
- ☆40Updated 8 months ago
- ☆71Updated 6 months ago
- The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".☆48Updated 7 months ago
- ☆88Updated last year
- PAL: Proxy-Guided Black-Box Attack on Large Language Models☆51Updated 9 months ago
- ☆16Updated last year
- ☆14Updated 5 months ago
- ☆65Updated 4 months ago
- ☆13Updated 11 months ago
- A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)☆113Updated 5 months ago
- Source code of "TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification", ACL2024 (findings)☆11Updated 6 months ago
- ☆34Updated 6 months ago
- Fine-tuning base models to build robust task-specific models☆30Updated last year
- ☆63Updated 11 months ago
- Minimal workflows☆19Updated last year
- Data Scientists Go To Jupyter☆64Updated 3 months ago
- A collection of prompt injection mitigation techniques.☆23Updated last year
- An interactive CLI application for interacting with authenticated Jupyter instances.☆53Updated 3 weeks ago
- [ArXiv 2024] Denial-of-Service Poisoning Attacks on Large Language Models☆18Updated 7 months ago
- Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives☆69Updated last year
- [ICML 2024] COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability☆151Updated 5 months ago
- A utility to inspect, validate, sign and verify machine learning model files.☆57Updated 4 months ago
- [EMNLP 2024] Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction☆14Updated 6 months ago
- A prompt injection game to collect data for robust ML research☆61Updated 4 months ago
- ☆16Updated last year
- ☆45Updated last year