parameterlab / trapLinks
Source code of "TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification", ACL2024 (findings)
☆11Updated 7 months ago
Alternatives and similar repositories for trap
Users that are interested in trap are comparing it to the libraries listed below
Sorting:
- ☆42Updated 8 months ago
- General research for Dreadnode☆23Updated last year
- A collection of prompt injection mitigation techniques.☆23Updated last year
- DEF CON 31 AI Village - LLMs: Loose Lips Multipliers☆10Updated last year
- Implementation of BEAST adversarial attack for language models (ICML 2024)☆88Updated last year
- [IJCAI 2024] Imperio is an LLM-powered backdoor attack. It allows the adversary to issue language-guided instructions to control the vict…☆42Updated 4 months ago
- Risks and targets for assessing LLMs & LLM vulnerabilities☆30Updated last year
- Adversarial Tokenization☆23Updated last month
- using ML models for red teaming☆43Updated last year
- Code for shelLM tool☆55Updated 4 months ago
- PAL: Proxy-Guided Black-Box Attack on Large Language Models☆51Updated 10 months ago
- Tree of Attacks (TAP) Jailbreaking Implementation☆109Updated last year
- A CLI wrapper for libmodsecurity (v3.0.10)☆13Updated last year
- Security Weaknesses in Machine Learning☆15Updated last year
- ☆20Updated last year
- Official code for the paper entitled "Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense"☆14Updated 2 months ago
- CyberBench: A Multi-Task Cyber LLM Benchmark☆17Updated last month
- CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on…☆30Updated this week
- ☆37Updated this week
- AI fun☆25Updated 4 months ago
- A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.☆69Updated last year
- Code for the paper "EMBERSim: A Large-Scale Databank for Boosting Similarity Search in Malware Analysis"☆29Updated last year
- ☆65Updated 5 months ago
- The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".☆49Updated 8 months ago
- A Python client for the Global CVE Allocation System.☆13Updated this week
- ☆13Updated 2 years ago
- future-proof vulnerability detection benchmark, based on CVEs in open-source repos☆56Updated last week
- ☆26Updated last year
- ☆66Updated 11 months ago
- Data Scientists Go To Jupyter☆63Updated 3 months ago